Research on Modern Latvian Language and
Development of Language Technology



LATE kick-off meeting 
On 18 January, the LATE project kick-off meeting took place on Zoom. It was attended by representatives from all partner institutions:
LU MII: Ilze Auziņa, Baiba Saulīte, Normunds Grūzītis, Inguna Skadiņa, Vita Matule
LU LVI: Edmunds Trumpa, Agris Timuška, Sanda Rapa, Marita Silkāne
LU HZF: Andra Kalnača, Ilze Lokmane
LU LFMI: Sanita Reinsone
LiepU: Dina Bethere, Airita Lindberga

Online hands-on workshop on platform and searching the corpora, organized by the Institute of Mathematics and Computer Sciences of the University of Latvia in cooperation with CLARIN Latvia. Instructors: Ilze Auziņa and Baiba Saulīte.

Information about the project

The project "Research on Modern Latvian Language and Development of Language Technology” is implemented within the framework of the National Research Programme "Letonika – Fostering a Latvian and European Society".

Project No: VPP-LETONIKA-2021/1-0006

Implementation period: 20.12.2021–19.12.2024

Project funding: 1 068 000 EUR

Funded by: Latvian Council of Science of the Ministry of Education and Science

Project partners: Institute of Mathematics and Computer Science of the University of Latvia (leading partner), University of Latvia (Latvian Language Institute of the University of Latvia and Faculty of Humanities of the University of Latvia), Institute of Literature, Folklore and Art of the University of Latvia, Liepāja University


Project leader: Ilze Auziņa

The aim of the project is to advance research on the grammatical, lexical-semantic, phonetic and phonological system of the modern Latvian language, and Latvian sign language using data-driven methods, as well as to develop sustainable Latvian language resources and tools. In order to achieve the goal, the Latvian speech corpus, the pilot corpus of Latvian sign language will be developed, and and “Dictionary of Contemporary Latvian” will be improved. Based on Latvian grammar studies, “Latvian Treebank” will be enhanced. These resources will be integrated into a single Latvian language research infrastructure, as well into the CLARIN-LV repository. During the project, a LATE platform for speech transcription and subtitling will be created.

Project's research group:  10 lead participants and 33 participants (including 14 student-participants).