Methods
In order to ensure the publication of complex annotated online corpora via the INEL Corpus Platform at the end of the respective sub-projects, the project operates a digital infrastructure that is continuously adapted and expanded. It is based on the preliminary work of the Hamburg Centre for Language Corpora and includes the following work areas, among others:
- Resource Creation
Object language data are digitised, transcribed and time-aligned (in the case of audio data), and enriched with metadata, glossed and deeply annotated using established tools and standards (e.g., EXMARaLDA and ISO-TEI). In addition to language corpora, further resources are created, including cartographic data as well as catalogues and the INEL bibliography. - Workflow Management
The aim of developing and adapting digital workflows is to ensure more efficient and user-friendly data preparation in the sub-projects. The developed workflows expand the work on corpus creation with components for monitoring, versioning, consistency checks as well as for analysing and visualising the language corpora under construction. - Providing Resources
The project resources are made available and published under Open Access conditions with a view to sustainable storage and intuitive searchability, both for the international community and for the interested public. For this purpose, use is made both of repositories in Hamburg (for example, at the Centre for Sustainable Research Data Management, FDM, and the Hamburg Centre for Language Corpora, HZSK) and of existing solutions for visualisation and corpus search (TsaKorpus platform).