The ParlaMint corpora of parliamentary proceedings

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

The ParlaMint corpora of parliamentary proceedings. / Erjavec, Tomaž; Ogrodniczuk, Maciej; Osenova, Petya; Ljubešic, Nikola ; Simov, Kiril; Pancur, Andrej ; Rudolf, Michał ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór; Çöltekin, Çagrı; de Does, Jesse; Depuydt, Katrien ; Agnoloni, Tommaso ; Venturi, Giulia; Calzada Pérez, María ; de Macedo, Luciana D.; Navarretta, Costanza; Luxardo, Giancarlo; Coole, Matthew; Rayson, Paul ; Morkevicius, Vaidas ; Krilavicius, Tomas ; Dargis, Roberts ; Ring, Orsolya ; van Heusden, Ruben ; Marx, Maarten ; Fiser, Darja.

I: Language Resources and Evaluation, Bind 57, 2023, s. 415-448.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Erjavec, T, Ogrodniczuk, M, Osenova, P, Ljubešic, N, Simov, K, Pancur, A, Rudolf, M, Kopp, M, Barkarson, S, Steingrímsson, S, Çöltekin, Ç, de Does, J, Depuydt, K, Agnoloni, T, Venturi, G, Calzada Pérez, M, de Macedo, LD, Navarretta, C, Luxardo, G, Coole, M, Rayson, P, Morkevicius, V, Krilavicius, T, Dargis, R, Ring, O, van Heusden, R, Marx, M & Fiser, D 2023, 'The ParlaMint corpora of parliamentary proceedings', Language Resources and Evaluation, bind 57, s. 415-448. https://doi.org/10.1007/s10579-021-09574-0

APA

Erjavec, T., Ogrodniczuk, M., Osenova, P., Ljubešic, N., Simov, K., Pancur, A., Rudolf, M., Kopp, M., Barkarson, S., Steingrímsson, S., Çöltekin, Ç., de Does, J., Depuydt, K., Agnoloni, T., Venturi, G., Calzada Pérez, M., de Macedo, L. D., Navarretta, C., Luxardo, G., ... Fiser, D. (2023). The ParlaMint corpora of parliamentary proceedings. Language Resources and Evaluation, 57, 415-448. https://doi.org/10.1007/s10579-021-09574-0

Vancouver

Erjavec T, Ogrodniczuk M, Osenova P, Ljubešic N, Simov K, Pancur A o.a. The ParlaMint corpora of parliamentary proceedings. Language Resources and Evaluation. 2023;57:415-448. https://doi.org/10.1007/s10579-021-09574-0

Author

Erjavec, Tomaž ; Ogrodniczuk, Maciej ; Osenova, Petya ; Ljubešic, Nikola ; Simov, Kiril ; Pancur, Andrej ; Rudolf, Michał ; Kopp, Matyáš ; Barkarson, Starkaður ; Steingrímsson, Steinþór ; Çöltekin, Çagrı ; de Does, Jesse ; Depuydt, Katrien ; Agnoloni, Tommaso ; Venturi, Giulia ; Calzada Pérez, María ; de Macedo, Luciana D. ; Navarretta, Costanza ; Luxardo, Giancarlo ; Coole, Matthew ; Rayson, Paul ; Morkevicius, Vaidas ; Krilavicius, Tomas ; Dargis, Roberts ; Ring, Orsolya ; van Heusden, Ruben ; Marx, Maarten ; Fiser, Darja. / The ParlaMint corpora of parliamentary proceedings. I: Language Resources and Evaluation. 2023 ; Bind 57. s. 415-448.

Bibtex

@article{613cae9f83434dfcadad71a830a682f9,
title = "The ParlaMint corpora of parliamentary proceedings",
abstract = "This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 European national parliaments with half a billion words. The corpora are uniformly encoded, contain rich meta-data about 11 thousand speakers, and are linguistically annotated following the Universal Dependencies formalism and with named entities. Samples of the corpora and conversion scripts are available from the project{\textquoteright}s GitHub repository, and the complete corpora are openly available via the CLARIN.SI repository for download, as well as through the NoSketch Engine and KonText concordancers and the Parlameter interface for on-line exploration and analysis.",
author = "Toma{\v z} Erjavec and Maciej Ogrodniczuk and Petya Osenova and Nikola Ljube{\v s}ic and Kiril Simov and Andrej Pancur and Micha{\l} Rudolf and Maty{\'a}{\v s} Kopp and Starka{\dh}ur Barkarson and Stein{\th}{\'o}r Steingr{\'i}msson and {\c C}agrı {\c C}{\"o}ltekin and {de Does}, Jesse and Katrien Depuydt and Tommaso Agnoloni and Giulia Venturi and {Calzada P{\'e}rez}, Mar{\'i}a and {de Macedo}, {Luciana D.} and Costanza Navarretta and Giancarlo Luxardo and Matthew Coole and Paul Rayson and Vaidas Morkevicius and Tomas Krilavicius and Roberts Dargis and Orsolya Ring and {van Heusden}, Ruben and Maarten Marx and Darja Fiser",
year = "2023",
doi = "10.1007/s10579-021-09574-0",
language = "English",
volume = "57",
pages = "415--448",
journal = "Language Resources and Evaluation",
issn = "1574-020X",
publisher = "Springer",

}

RIS

TY - JOUR

T1 - The ParlaMint corpora of parliamentary proceedings

AU - Erjavec, Tomaž

AU - Ogrodniczuk, Maciej

AU - Osenova, Petya

AU - Ljubešic, Nikola

AU - Simov, Kiril

AU - Pancur, Andrej

AU - Rudolf, Michał

AU - Kopp, Matyáš

AU - Barkarson, Starkaður

AU - Steingrímsson, Steinþór

AU - Çöltekin, Çagrı

AU - de Does, Jesse

AU - Depuydt, Katrien

AU - Agnoloni, Tommaso

AU - Venturi, Giulia

AU - Calzada Pérez, María

AU - de Macedo, Luciana D.

AU - Navarretta, Costanza

AU - Luxardo, Giancarlo

AU - Coole, Matthew

AU - Rayson, Paul

AU - Morkevicius, Vaidas

AU - Krilavicius, Tomas

AU - Dargis, Roberts

AU - Ring, Orsolya

AU - van Heusden, Ruben

AU - Marx, Maarten

AU - Fiser, Darja

PY - 2023

Y1 - 2023

N2 - This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 European national parliaments with half a billion words. The corpora are uniformly encoded, contain rich meta-data about 11 thousand speakers, and are linguistically annotated following the Universal Dependencies formalism and with named entities. Samples of the corpora and conversion scripts are available from the project’s GitHub repository, and the complete corpora are openly available via the CLARIN.SI repository for download, as well as through the NoSketch Engine and KonText concordancers and the Parlameter interface for on-line exploration and analysis.

AB - This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 European national parliaments with half a billion words. The corpora are uniformly encoded, contain rich meta-data about 11 thousand speakers, and are linguistically annotated following the Universal Dependencies formalism and with named entities. Samples of the corpora and conversion scripts are available from the project’s GitHub repository, and the complete corpora are openly available via the CLARIN.SI repository for download, as well as through the NoSketch Engine and KonText concordancers and the Parlameter interface for on-line exploration and analysis.

U2 - 10.1007/s10579-021-09574-0

DO - 10.1007/s10579-021-09574-0

M3 - Journal article

C2 - 35125984

VL - 57

SP - 415

EP - 448

JO - Language Resources and Evaluation

JF - Language Resources and Evaluation

SN - 1574-020X

ER -

ID: 291220591