Culture-Sensitive Assessment and Adjustment of Large Language Models – Adaptation to the Nordic-Baltic Societies (CAALLM)

The objective of the project is to facilitate the adaption of large language models towards a more responsible coverage and functionality that encompass the linguistic, cultural, and societal diversity in the Nordic and Baltic regions.

The project brings together language institutions and NLP research groups from the Nordic-Baltic region and will compile a number of open-source linguistic and cultural multi-parallel datasets for Danish, Swedish, Bokmål, Nynorsk, Faroese, and Latvian, which will systematically draw on and make explicit the central aspects of the linguistic and cultural diversity of our regions. Based on these data, we hope to advance state-of-the-art methods for explaining, assessing, and aligning LLMs across languages and cultures, with particular focus on the linguistic idiosyncrasy and cultural heritage of our regions.

 

University of the Faroe Islands

University of Oslo

University of Gothenburg

The Society of Danish Language and Literature

Institute of Mathematics and Computer Science, University of Latvia

 

Researchers

Name Title
Ali Basirat Associate Professor Billede af Ali Basirat
Bolette Sandford Pedersen Professor, Deputy Head of Department Billede af Bolette Sandford Pedersen
Sussi Olsen Academic Research Staff Billede af Sussi Olsen

Funding

The project is funded by NordForsk.

Project period: 1 March 2026 – 28 February 2029

Principal investigator: Bolette Sandford Pedersen