Why do we need/use annotation formats?
Documents are often human readable but not machine readable
What are annotations?
added like markup to a text document (not the original format)
What is important when creating annotations?
A convention must be followed
What is the purpose of annotation (3)
What are the types of annotation formats (3)
What is boundary notation?
The application of annotation to each individual token,
Give an example of boundary notation
BIO- begin, inside, outside
What are the benefits (1) and disadvantages (1) of BIO
- cant handle hierarchical or structured annotations like nesting, relations or events
Give an example of a nested entity
“the British prime minister boris johnson”
Given an example of an event
The iraq, us war
What are the benefits (1) and disadvantages (2) of inline markup
Give an example overlapping/intersecting annotation
“the Iraq city of Basra”
What are stand off annotations?
Annotations stored separately from the document (requires a way to link). We link annotations using indexing on character offsets
What are the benefits (2) and disadvantages (1) of stand off annotations