Semi-automatic identification of danish discourse deictics

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

In this paper we present an algorithm for the (semi-)automatic identification of anaphors whose antecedents are verbal phrases, clauses or discourse segments in Danish Dialogues. Although these anaphors are quite frequent, especially in conversations, they are usually been neglected in computational linguistics. The algorithm we propose contains defeasible rules for distinguishing these anaphors from those who have individual nominals as antecedents. The rules have been identified by looking at the occurrences of these types of anaphor in the transcriptions of two dialogue collections. The algorithm has been manually tested on four Danish dialogues and the obtained results have been evaluated.

OriginalsprogEngelsk
TitelText, Speech and Dialogue - 4th International Conference, TSD 2001, Proceedings
RedaktørerVaclav Matousek, Pavel Mautner, Roman Moucek, Karel Tauser
Antal sider7
ForlagSpringer Verlag
Publikationsdato2001
Sider396-402
ISBN (Trykt)9783540425571
DOI
StatusUdgivet - 2001
Begivenhed4th International Conference on Text, Speech and Dialogue, TSD 2001 - Zelezna Ruda, Tjekkiet
Varighed: 11 sep. 200113 sep. 2001

Konference

Konference4th International Conference on Text, Speech and Dialogue, TSD 2001
LandTjekkiet
ByZelezna Ruda
Periode11/09/200113/09/2001
NavnLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Vol/bind2166
ISSN0302-9743

Bibliografisk note

Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2001.

ID: 301814657