ParlaMint: Comparable Corpora of European Parliamentary Data

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Standard

ParlaMint: Comparable Corpora of European Parliamentary Data. / Erjavec, Tomaž; Ogrodniczuk, Maciej; Osenova, Petya; Petya Osenova, Petya; Pancur, Andrej ; Ljubešic, Nikola ; Agnoloni, Tommaso ; Barkarson, StarkaDur ; Calzada Pérez, María ; Çöltekin, Çagrı; Coole, Matthew; Dargis, Roberts ; de Macedo, Luciana D.; de Does, Jesse; Depuydt, Katrien ; Diwersy, Sascha ; Hansen, Dorte Haltrup; Kopp, Matyáš ; Krilavicius, Tomas ; Luxardo, Giancarlo; Marx, Maarten ; Morkevicius, Vaidas ; Navarretta, Costanza; Rayson, Paul ; Ring, Orsolya ; Rudolf, Michał ; Simov, Kiril; Steingrímsson, Steinþór; Üveges, István ; van Heusden, Ruben ; Venturi, Giulia.

Proceedings of CLARIN Annual Conference 2021. CLARIN ERIC, 2021. s. 19-24.

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Harvard

Erjavec, T, Ogrodniczuk, M, Osenova, P, Petya Osenova, P, Pancur, A, Ljubešic, N, Agnoloni, T, Barkarson, S, Calzada Pérez, M, Çöltekin, Ç, Coole, M, Dargis, R, de Macedo, LD, de Does, J, Depuydt, K, Diwersy, S, Hansen, DH, Kopp, M, Krilavicius, T, Luxardo, G, Marx, M, Morkevicius, V, Navarretta, C, Rayson, P, Ring, O, Rudolf, M, Simov, K, Steingrímsson, S, Üveges, I, van Heusden, R & Venturi, G 2021, ParlaMint: Comparable Corpora of European Parliamentary Data. i Proceedings of CLARIN Annual Conference 2021. CLARIN ERIC, s. 19-24. <https://office.clarin.eu/v/CE-2021-1923-CLARIN2021_ConferenceProceedings.pdf>

APA

Erjavec, T., Ogrodniczuk, M., Osenova, P., Petya Osenova, P., Pancur, A., Ljubešic, N., Agnoloni, T., Barkarson, S., Calzada Pérez, M., Çöltekin, Ç., Coole, M., Dargis, R., de Macedo, L. D., de Does, J., Depuydt, K., Diwersy, S., Hansen, D. H., Kopp, M., Krilavicius, T., ... Venturi, G. (2021). ParlaMint: Comparable Corpora of European Parliamentary Data. I Proceedings of CLARIN Annual Conference 2021 (s. 19-24). CLARIN ERIC. https://office.clarin.eu/v/CE-2021-1923-CLARIN2021_ConferenceProceedings.pdf

Vancouver

Erjavec T, Ogrodniczuk M, Osenova P, Petya Osenova P, Pancur A, Ljubešic N o.a. ParlaMint: Comparable Corpora of European Parliamentary Data. I Proceedings of CLARIN Annual Conference 2021. CLARIN ERIC. 2021. s. 19-24

Author

Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Petya Osenova, Petya ; Pancur, Andrej ; Ljubešic, Nikola ; Agnoloni, Tommaso ; Barkarson, StarkaDur ; Calzada Pérez, María ; Çöltekin, Çagrı ; Coole, Matthew ; Dargis, Roberts ; de Macedo, Luciana D. ; de Does, Jesse ; Depuydt, Katrien ; Diwersy, Sascha ; Hansen, Dorte Haltrup ; Kopp, Matyáš ; Krilavicius, Tomas ; Luxardo, Giancarlo ; Marx, Maarten ; Morkevicius, Vaidas ; Navarretta, Costanza ; Rayson, Paul ; Ring, Orsolya ; Rudolf, Michał ; Simov, Kiril ; Steingrímsson, Steinþór ; Üveges, István ; van Heusden, Ruben ; Venturi, Giulia. / ParlaMint: Comparable Corpora of European Parliamentary Data. Proceedings of CLARIN Annual Conference 2021. CLARIN ERIC, 2021. s. 19-24

Bibtex

@inproceedings{540ad753599a4277b9f641a30d419948,
title = "ParlaMint: Comparable Corpora of European Parliamentary Data",
abstract = "This paper outlines the ParlaMint project from the perspective of its goals, tasks, participants, results and applications potential. The project produced language corpora from the sessions of the national parliaments of 17 countries, almost half a billion words in total. The corpora are split into COVID-related subcorpora (from November 2019) and reference corpora (to October 2019). The corpora are uniformly encoded according to the ParlaMint schema with the same Universal Dependencies linguistic annotations. Samples of the corpora and conversion scripts are available from the project{\textquoteright}s GitHub repository. The complete corpora are openly available via the CLARIN.SI repository for download, and through the NoSketch Engine and KonText concordancers as well as through the Parlameter4 interface for exploration and analysis.",
author = "Toma{\v z} Erjavec and Maciej Ogrodniczuk and Petya Osenova and {Petya Osenova}, Petya and Andrej Pancur and Nikola Ljube{\v s}ic and Tommaso Agnoloni and StarkaDur Barkarson and {Calzada P{\'e}rez}, Mar{\'i}a and {\c C}agrı {\c C}{\"o}ltekin and Matthew Coole and Roberts Dargis and {de Macedo}, {Luciana D.} and {de Does}, Jesse and Katrien Depuydt and Sascha Diwersy and Hansen, {Dorte Haltrup} and Maty{\'a}{\v s} Kopp and Tomas Krilavicius and Giancarlo Luxardo and Maarten Marx and Vaidas Morkevicius and Costanza Navarretta and Paul Rayson and Orsolya Ring and Micha{\l} Rudolf and Kiril Simov and Stein{\th}{\'o}r Steingr{\'i}msson and Istv{\'a}n {\"U}veges and {van Heusden}, Ruben and Giulia Venturi",
year = "2021",
language = "English",
pages = "19--24",
booktitle = "Proceedings of CLARIN Annual Conference 2021",
publisher = "CLARIN ERIC",

}

RIS

TY - GEN

T1 - ParlaMint: Comparable Corpora of European Parliamentary Data

AU - Erjavec, Tomaž

AU - Ogrodniczuk, Maciej

AU - Osenova, Petya

AU - Petya Osenova, Petya

AU - Pancur, Andrej

AU - Ljubešic, Nikola

AU - Agnoloni, Tommaso

AU - Barkarson, StarkaDur

AU - Calzada Pérez, María

AU - Çöltekin, Çagrı

AU - Coole, Matthew

AU - Dargis, Roberts

AU - de Macedo, Luciana D.

AU - de Does, Jesse

AU - Depuydt, Katrien

AU - Diwersy, Sascha

AU - Hansen, Dorte Haltrup

AU - Kopp, Matyáš

AU - Krilavicius, Tomas

AU - Luxardo, Giancarlo

AU - Marx, Maarten

AU - Morkevicius, Vaidas

AU - Navarretta, Costanza

AU - Rayson, Paul

AU - Ring, Orsolya

AU - Rudolf, Michał

AU - Simov, Kiril

AU - Steingrímsson, Steinþór

AU - Üveges, István

AU - van Heusden, Ruben

AU - Venturi, Giulia

PY - 2021

Y1 - 2021

N2 - This paper outlines the ParlaMint project from the perspective of its goals, tasks, participants, results and applications potential. The project produced language corpora from the sessions of the national parliaments of 17 countries, almost half a billion words in total. The corpora are split into COVID-related subcorpora (from November 2019) and reference corpora (to October 2019). The corpora are uniformly encoded according to the ParlaMint schema with the same Universal Dependencies linguistic annotations. Samples of the corpora and conversion scripts are available from the project’s GitHub repository. The complete corpora are openly available via the CLARIN.SI repository for download, and through the NoSketch Engine and KonText concordancers as well as through the Parlameter4 interface for exploration and analysis.

AB - This paper outlines the ParlaMint project from the perspective of its goals, tasks, participants, results and applications potential. The project produced language corpora from the sessions of the national parliaments of 17 countries, almost half a billion words in total. The corpora are split into COVID-related subcorpora (from November 2019) and reference corpora (to October 2019). The corpora are uniformly encoded according to the ParlaMint schema with the same Universal Dependencies linguistic annotations. Samples of the corpora and conversion scripts are available from the project’s GitHub repository. The complete corpora are openly available via the CLARIN.SI repository for download, and through the NoSketch Engine and KonText concordancers as well as through the Parlameter4 interface for exploration and analysis.

M3 - Article in proceedings

SP - 19

EP - 24

BT - Proceedings of CLARIN Annual Conference 2021

PB - CLARIN ERIC

ER -

ID: 279629163