Methods
In order to ensure the publication of complex annotated online corpora via the INEL corpus platform at the end of the respective sub-projects, the project operates a digital infrastructure that is constantly being adapted and expanded. It is based on the preliminary work of the Hamburg Centre for Language Corpora and includes the following areas of work, among others:
- Resource creation
Object language data is digitised, transcribed and time-aligned (in the case of audio data) and enriched with metadata, glossed and annotated in depth using established tools and standards (e.g. EXMARaLDA and ISO-TEI). In addition to language corpora, further resources are created, including cartographic data, catalogues and the INEL bibliography. - Workflow Management
The aim of developing and adapting digital workflows is to ensure more efficient and user-friendly data processing in the sub-projects. The workflows developed extend the work on corpus creation to include components for monitoring, versioning, consistency checks and the analysis and visualisation of the language corpora currently being created. - Provision of resources
The provision and publication of the project resources under open access conditions takes place under the aspects of sustainable storage and intuitive searchability, both for the international community and for interested parties. For this purpose, both repositories in Hamburg (for example at the Centre for Sustainable Research Data Management, FDM and the Hamburg Centre for Language Corpora, HZSK) and existing solutions for visualisation and corpus search (TsaKorpus platform) are used.