Publication INEL Corpus Nganasan
19 May 2025, by IFUU

Photo: Brykina
We are pleased to announce that the INEL corpus Nganasan has been published!
The INEL Nganasan corpus has been created within the long-term INEL project (Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages), 2016–2033. The corpus is largely based on the Nganasan Spoken Language Corpus, which has been adapted to the INEL standards and supplemented with new texts. The corpus makes possible typologically oriented corpus-based research on Nganasan and expands the documentation of the lesser described indigenous languages of Northern Eurasia. The INEL Nganasan corpus consists of two parts. The glossed (searchable) part of the corpus includes texts provided with source media files (whenever available) and annotated transcripts. The archival part of the corpus contains non-glossed texts, represented either by audio recordings (optionally – with preliminary transcriptions) or scanned pages of the manuscripts or publications. The corpus includes texts recorded between 1933–2019 in Nganasan.
You can find the INEL corpus Nganasan as well as all previously published corpora in the INEL community of the UHH Centre for Sustainable Research Data Management and on the INEL resource portal.