|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface Text
Superclass of all unstructured text representation objects. Provides access
to offsets, the Document
, and surrounding Text
objects.
Method Summary | ||
---|---|---|
AnnotationType |
getAnnotationType()
Returns the AnnotationType that describes this Text . |
|
|
getCongruentAnnotations(AnnotationType<T> type)
Returns all the annotations congruent with this Text that
correspond to the given AnnotationType . |
|
Document |
getDocument()
Returns the Document in which this Text exists. |
|
String |
getDocumentID()
Returns the Document ID that identifies this Text . |
|
int |
getEndCharOffset()
Returns the (exclusive) end character offset for this Text
object within the (processed) Document . |
|
|
getIntersectingAnnotations(AnnotationType<T> type)
Returns all the annotations that intersect this Text and
correspond to the given AnnotationType . |
|
String |
getRawString()
Returns the raw String value that this Text
spans. |
|
int |
getStartCharOffset()
Returns the start character offset for this Text object within
the (processed) Document . |
|
|
getSubAnnotations(AnnotationType<T> type)
Returns all the annotations within this Text that correspond
to the given AnnotationType . |
|
|
getSuperAnnotations(AnnotationType<T> type)
Returns all the annotations that contain this Text as a
subspan and correspond to the given AnnotationType . |
Method Detail |
---|
AnnotationType getAnnotationType()
AnnotationType
that describes this Text
.
AnnotationType
of this Text
.AnnotationType
String getDocumentID()
Document
ID that identifies this Text
.
Document
that contains this
Text
.Document getDocument()
Document
in which this Text
exists.
Document
that contains this Text
.int getStartCharOffset()
Text
object within
the (processed) Document
. This offset does not necessarily line up
with the start character offset from the unprocessed document/file.
int getEndCharOffset()
Text
object within the (processed) Document
. This offset does not
necessarily line up with the start character offset from the unprocessed
document/file.
String getRawString()
String
value that this Text
spans. Includes whitespace (spaces, tabs, newlines, etc) but not the
original markup (html tags, pdf and MS word markup, etc). Might contain
additional newlines along paragraph boundaries for some large documents.
<T extends Text> Collection<T> getCongruentAnnotations(AnnotationType<T> type)
Text
that
correspond to the given AnnotationType
.
T
- The Text
sub-class corresponding to the given
AnnotationType
.type
- The AnnotationType
that all returned items will match.
Collection
of objects matching the given
AnnotationType
. The annotations will be in a semi-sorted
order. This means that non-intersecting objects will be sorted
by their order in the document. No guarantee will be placed on the
order of intersecting objects.<T extends Text> Collection<T> getSubAnnotations(AnnotationType<T> type)
Text
that correspond
to the given AnnotationType
.
T
- The Text
sub-class corresponding to the given
AnnotationType
.type
- The AnnotationType
that all returned items will match.
Collection
of objects matching the given
AnnotationType
. The annotations will be in a semi-sorted
order. This means that non-intersecting objects will be sorted
by their order in the document. No guarantee will be placed on the
order of intersecting objects.<T extends Text> Collection<T> getSuperAnnotations(AnnotationType<T> type)
Text
as a
subspan and correspond to the given AnnotationType
.
T
- The Text
sub-class corresponding to the given
AnnotationType
.type
- The AnnotationType
that all returned items will match.
Collection
of objects matching the given
AnnotationType
. The annotations will be in a semi-sorted
order. This means that non-intersecting objects will be sorted
by their order in the document. No guarantee will be placed on the
order of intersecting objects.<T extends Text> Collection<T> getIntersectingAnnotations(AnnotationType<T> type)
Text
and
correspond to the given AnnotationType
.
T
- The Text
sub-class corresponding to the given
AnnotationType
.type
- The AnnotationType
that all returned items will match.
Collection
of objects matching the given
AnnotationType
. The annotations will be in a semi-sorted
order. This means that non-intersecting objects will be sorted
by their order in the document. No guarantee will be placed on the
order of intersecting objects.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |