Data access
Using the corpus in ANNIS
The current version of the corpus can be accessed via the ANNIS installation from the FDM (https://annis.fdm.uni-hamburg.de/annis-gui-3.6.0/ren). A short manual for searching in the corpus using ANNIS is available here. A more detailed manual is available in German and is contained in each corpus release at the FDM starting with version 0.6.
Please note that the number of tokens shown in ANNIS next to the name of the corpus does not match with the actual number of tokens as two different tokenization layers are used. In order to identify the correct number of tokens, please use the search field in ANNIS to query the relevant tokens (e.g. “tok_dipl“ or “tok_anno“) – the number of matches is the number of tokens.
Publication
Till the end of the project (July 2019) the corpus is published gradually, that means that as soon as new texts are completely annotated each of those texts will be published in a new version of the corpus. The different versions of the corpus can be identified by the date behind the name of the particular corpus. At the end of the project, a final version will be released. All versions will get a persistent identifier (PID) and will be permanently available at the Zentrum für nachhaltiges Forschungsdatenmanagement (FDM). A list of all the so far published corpus versions can be found at the bottom of this web site.
There may be differences between der versions of the corpus concerning the design of the annotation (e.g. the names of categories) and the transcription and the annotation (e.g. error corrections).
The corpus data is available in the two following four formats:
- relAnnis: format that can be directly imported into ANNIS
- CorA-XML: format used by the annotation tool CorA
- TEI: XML-Format according to the guidelines of the Text Encoding Initiative
- Leseversion: A simplified version of the transcriptions in pdf as well as in html.
Corpus versions
- ReN-Team. (2021). Reference Corpus Middle Low German/Low Rhenish (1200–1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200–1650) (Version 1.1) [Data set]. http://doi.org/10.25592/uhhfdm.9195
- ReN-Team. (2019). Reference Corpus Middle Low German/Low Rhenish (1200–1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200–1650) (Version 1.0) [Data set]. http://doi.org/10.25592/uhhfdm.1697
- ReN-Team. (2019). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.9) [Data set]. http://doi.org/10.25592/uhhfdm.1696
- ReN-Team. (2018). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.8) [Data set]. http://doi.org/10.25592/uhhfdm.1695
- ReN-Team. (2018). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.7) [Data set]. http://doi.org/10.25592/uhhfdm.1694
- ReN-Team. (2018). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.6) [Data set]. http://doi.org/10.25592/uhhfdm.1691
- ReN-Team. (2017). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.5) [Data set]. http://doi.org/10.25592/uhhfdm.1690
- ReN-Team. (2017). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.4) [Data set]. http://doi.org/10.25592/uhhfdm.1689
- ReN-Team. (2017). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.3) [Data set]. http://doi.org/10.25592/uhhfdm.1687
- ReN-Team. (2017). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.2) [Data set]. http://doi.org/10.25592/uhhfdm.1684
- Schröder, Ingrid, & Peters, Robert. (2016). Reference Corpus Middle Low German/Low Rhenish (1200-1650); Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650) (Version 0.1) [Data set]. http://doi.org/10.25592/uhhfdm.1669