This library was created as part of the SEMPRE project.

This module makes it possible to create, read, modify, and write GATE documents from within perl and use the abstractions for document content, document annotation sets, annotations, and annotation features in a way very similar to the JAVA API.

Here is an overview of the current features and limitations:

  • Create a GATE document from a string
  • De-serialize the XML representation of GATE documents
  • Create a XML serialization of a GATE document
  • Handle annotation sets, annotations, annotation spans, and features
  • Only supports documents and feature values encoded in UTF8
  • Only supports feature values of type java.lang.String.
  • Features and offsets are stored directly with annotations, not in separate data structures (nodes, feature sets)



Key facts