|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
public interface Text
Superclass of all unstructured text representation objects. Provides access
to offsets, the Document, and surrounding Text objects.
| Method Summary | ||
|---|---|---|
AnnotationType |
getAnnotationType()
Returns the AnnotationType that describes this Text. |
|
|
getCongruentAnnotations(AnnotationType<T> type)
Returns all the annotations congruent with this Text that
correspond to the given AnnotationType. |
|
Document |
getDocument()
Returns the Document in which this Text exists. |
|
String |
getDocumentID()
Returns the Document ID that identifies this Text. |
|
int |
getEndCharOffset()
Returns the (exclusive) end character offset for this Text
object within the (processed) Document. |
|
|
getIntersectingAnnotations(AnnotationType<T> type)
Returns all the annotations that intersect this Text and
correspond to the given AnnotationType. |
|
String |
getRawString()
Returns the raw String value that this Text
spans. |
|
int |
getStartCharOffset()
Returns the start character offset for this Text object within
the (processed) Document. |
|
|
getSubAnnotations(AnnotationType<T> type)
Returns all the annotations within this Text that correspond
to the given AnnotationType. |
|
|
getSuperAnnotations(AnnotationType<T> type)
Returns all the annotations that contain this Text as a
subspan and correspond to the given AnnotationType. |
|
| Method Detail |
|---|
AnnotationType getAnnotationType()
AnnotationType that describes this Text.
AnnotationType of this Text.AnnotationTypeString getDocumentID()
Document ID that identifies this Text.
Document that contains this
Text.Document getDocument()
Document in which this Text exists.
Document that contains this Text.int getStartCharOffset()
Text object within
the (processed) Document. This offset does not necessarily line up
with the start character offset from the unprocessed document/file.
int getEndCharOffset()
Text
object within the (processed) Document. This offset does not
necessarily line up with the start character offset from the unprocessed
document/file.
String getRawString()
String value that this Text
spans. Includes whitespace (spaces, tabs, newlines, etc) but not the
original markup (html tags, pdf and MS word markup, etc). Might contain
additional newlines along paragraph boundaries for some large documents.
<T extends Text> Collection<T> getCongruentAnnotations(AnnotationType<T> type)
Text that
correspond to the given AnnotationType.
T - The Text sub-class corresponding to the given
AnnotationType.type - The AnnotationType that all returned items will match.
Collection of objects matching the given
AnnotationType. The annotations will be in a semi-sorted
order. This means that non-intersecting objects will be sorted
by their order in the document. No guarantee will be placed on the
order of intersecting objects.<T extends Text> Collection<T> getSubAnnotations(AnnotationType<T> type)
Text that correspond
to the given AnnotationType.
T - The Text sub-class corresponding to the given
AnnotationType.type - The AnnotationType that all returned items will match.
Collection of objects matching the given
AnnotationType. The annotations will be in a semi-sorted
order. This means that non-intersecting objects will be sorted
by their order in the document. No guarantee will be placed on the
order of intersecting objects.<T extends Text> Collection<T> getSuperAnnotations(AnnotationType<T> type)
Text as a
subspan and correspond to the given AnnotationType.
T - The Text sub-class corresponding to the given
AnnotationType.type - The AnnotationType that all returned items will match.
Collection of objects matching the given
AnnotationType. The annotations will be in a semi-sorted
order. This means that non-intersecting objects will be sorted
by their order in the document. No guarantee will be placed on the
order of intersecting objects.<T extends Text> Collection<T> getIntersectingAnnotations(AnnotationType<T> type)
Text and
correspond to the given AnnotationType.
T - The Text sub-class corresponding to the given
AnnotationType.type - The AnnotationType that all returned items will match.
Collection of objects matching the given
AnnotationType. The annotations will be in a semi-sorted
order. This means that non-intersecting objects will be sorted
by their order in the document. No guarantee will be placed on the
order of intersecting objects.
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||