Ruprecht-Karls-UniversitÃ<83>¤t Heidelberg

K. Erk and S. Pado: A powerful and versatile XML Format for representing role-semantic annotation. Proceedings of LREC-2004, Lisbon.


We present two XML formats for the description and encoding of semantic role information in corpora. The TIGER/SALSA XML format provides a modular representation for semantic roles and syntactic structure. The Text-SALSA XML format is a lightweight version of TIGER/SALSA XML designed for manual annotation with an XML editor rather than a special tool. Both formats can deal with underspecification, roles crossing the sentence boundary, compound splitting, and whole-sentence tags for meta-level comments.



@InProceedings{erk04:_xml_format,
  author = 	 {Katrin Erk and Sebastian Pado},
  title = 	 {A powerful and versatile XML Format for
representing role-semantic annotation},
  booktitle =	 {Proceedings of LREC-2004},
  year =	 2004,
  address =	 {Lisbon, Portugal}
}