Code

Code I published can generally be found on my GitHub profile. The most important contributions are listed below.

CorefAnnotator

LicenseDOILatest releaseLanguagePlatform

CorefAnnotator is a tool for annotating coreference in texts. It has been developed to be able to cope with long texts with many different chains. The core idea is not to annotate binary relations between mentions (e.g., cataphoric), but to consider all mentions that co-refer to form an equivalence set. Consequently, the tool displays a list of entities on the right and allows free assignment of text spans to these entities.

https://github.com/nilsreiter/CorefAnnotator

Citation

Ina Rösiger, Sarah Schulz, Nils Reiter. Towards Coreference for Literary Text: Analyzing Domain-Specific Phenomena. In Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, August 2018.
 

DramaAnalysis

LicenseDOILatest releaseLanguage

This R-package is developed in the context of the QuaDramA project and is a collection of frequently used functions for the analysis of dramatic texts. It takes texts as input that are formatted in a CSV-like format (and that are produced by DramaNLP).

https://github.com/quadrama/DramaAnalysis

 

SimpleXmiViewer

Latest releaseLanguagePlatform

A very simple viewer for XMI files, which is an extension and re-packaging of the UIMA Annotation Viewer. On Mac OS X, it nicely integrates with Finder.app, allowing to open and inspect XMI files with a double click or open on the command line.

http://nilsreiter.github.io/SimpleXmiViewer/

 

TreeAnno

LicenseDOILatest releaseLanguagePlatform

TreeAnno is a web-based tool to annotate tree structures on texts. Instead of drawing relations with pointing device, sentences or tokens can be indented. This makes the tool similar to an outliner (which is also its inspiration source). Under the hood, indentations are stored as trees on text snippets encoded via their character offsets. The tool has been used to annotate information structure in interactive texts.

https://github.com/nilsreiter/treeanno

Citation

Kordula De Kuthy, Nils Reiter, Arndt Riester. QUD-Based Annotation of Discourse Structure and Information Structure: Tool and Evaluation. In Nicoletta Calzolari (Conference chair), Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga (eds.): Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Paris, France, May 2018.
 

GenericXmlReader

License platform language

A class to read in arbitrary XML content (including TEI) into UIMA, translating some structural annotation to stand off

https://github.com/nilsreiter/generic-xml-reader

 

DramaNLP

LicenseDOILatest releaseLanguagePlatform

DramaNLP is a collection of UIMA components that we use to process dramatic texts in the project QuaDramA. The collection contains importers from various TEI dialects as well as NLP components specific to the text type. This package is developed jointly with Janis Pagel.

https://github.com/quadrama/DramaNLP