Search CORE

639 research outputs found

Challenges in Annotating Medieval Latin Charters

Author: Korkiakangas Timo
Passarotti Marco
Publication venue: German Society for Computational Linguistics and Language Technology (GSCL)
Publication date: 01/07/2011
Field of study

Crossref

Journal for Language Technology and Computational Linguistics (JLCL)

Preface

Author: Mambrini Francesco
Passarotti Marco Carlo
Publication venue: place:Lisbona
Publication date: 01/01/2012
Field of study

Preface of the proceedings of the international workshop ACRH-

PubliCatt

Preface

Author: Dickinson Markus
Müürisep Kaili
Passarotti Marco
Publication venue
Publication date: 01/12/2010
Field of study

Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories. Editors: Markus Dickinson, Kaili Müürisep and Marco Passarotti. NEALT Proceedings Series, Vol. 9 (2010), iii-iv. © 2010 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/15891

ADA University of Tartu

Proceedings

Author: Dickinson Markus
Müürisep Kaili
Passarotti Marco
Publication venue
Publication date: 01/12/2010
Field of study

Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories. Editors: Markus Dickinson, Kaili Müürisep and Marco Passarotti. NEALT Proceedings Series, Vol. 9 (2010), 268 pages. © 2010 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/15891

ADA University of Tartu

The Lemma Bank of the LiITA Knowledge Base of Interoperable Resources for Italian

Author: Eleonora Litta
Francesco Mambrini
Marco Passarotti
Publication venue: place:Pisa
Publication date: 01/01/2024
Field of study

The paper introduces the LiITA Knowledge Base of interoperable linguistic resources for Italian. After describing the principles of the Linked Data paradigm, on which LiITA is grounded, the paper presents the lemma-centred architecture of the Knowledge Base and details its core component, consisting of a large collection of Italian lemmas (called the Lemma Bank) used to interlink distributed lexical and textual resources

PubliCatt

Proceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities (ACRH-2). 29 November 2012, Lisbon, Portugal

Author: Mambrini Francesco
Passarotti Marco Carlo
Sporleder Caroline
Publication venue: place:Lisbona
Publication date: 01/01/2012
Field of study

Proceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities (ACRH-2), held in Lisbon, Portugal on 29 November 2012

PubliCatt

A New Latin Treebank for Universal Dependencies : Charters between Ancient Latin and Romance Languages

Author: Cecchini Flavio Massimiliano
Korkiakangas Timo
Passarotti Marco
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2020
Field of study

The present work introduces a new Latin treebank that follows the Universal Dependencies (UD) annotation standard. The treebank is obtained from the automated conversion of the Late Latin Charter Treebank 2 (LLCT2), originally in the Prague Dependency Treebank (PDT) style. As this treebank consists of Early Medieval legal documents, its language variety differs considerably from both the Classical and Medieval learned varieties prevalent in the other currently available UD Latin treebanks. Consequently, besides significant phenomena from the perspective of diachronic linguistics, this treebank also poses several challenging technical issues for the current and future syntactic annotation of Latin in the UD framework. Some of the most relevant cases are discussed in depth, with comparisons between the original PDT and the resulting UD annotations. Additionally, an overview of the UD-style structure of the treebank is given, and some diachronic aspects of the transition from Latin to Romance languages are highlighted.Peer reviewe

PubliCatt

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Helsingin yliopiston digitaalinen arkisto

UDante: First Steps Towards the Universal Dependencies Treebank of Dante’s Latin Works

Author: Cecchini Flavio
Moretti Giovanni
Passarotti Marco
Sprugnoli Rachele
Publication venue: place:Torino
Publication date: 01/01/2020
Field of study

This paper1 presents the early stages of the development of a new treebank containing all of Dante Alighieri’s Latin works. In particular, it describes the conversion of the original TEI-XML files to CoNLL-U, the creation of a gold standard, the process of training four annotators and the evaluation of the syntactic annotation in terms of inter-annotator agreement and LA, UAS and LAS. The aim is to release a new resource, in view of the celebrations for the 700th anniversary of Dante’s death, which can support the development of the Vocabolario Dantesco

Archivio istituzionale della Ricerca - Università degli Studi di Parma

Representing Compounding with OntoLex : An Evaluation of Vocabularies for Word Formation Resources

Author: E. Benzoni
F. Dede'
M. Passarotti
M. Pellegrini
Publication venue: ELRA and ICCL
Publication date: 01/01/2024
Field of study

This paper explores how compounds are represented in resources documenting word formation, and proposes ways to convert them into Linked Open Data using the OntoLex model. The ultimate purpose is to offer a broad empirical evaluation of which of the two OntoLex modules allowing for the representation of compounds {--} Decomp and Morph {--} fits best the different formats and theoretical approaches of the resources we examine. We show that the vocabulary of Decomp alone is rarely sufficient to account for all relevant facts; in almost all cases, it is necessary to resort to the vocabulary of Morph, either to reify the relation between compounds and their constituents or to represent specifically morphological information or other aspects. Special attention is devoted to the format of the Universal Derivations project: the modelling strategy that we propose can be applied to all resources harmonized in that format, potentially allowing for the conversion into Linked Open Data of a large amount of structured data

AIR Universita degli studi di Milano

A Collaborative Model of Treebank Development

Author: Bamman David
Crane Gregory
Passarotti Marco
Raynaud Savina
Publication venue
Publication date: 01/01/2007
Field of study

Proceedings of the Sixth International Workshop on Treebanks and Linguistic Theories. Editors: Koenraad De Smedt, Jan Hajič and Sandra Kübler. NEALT Proceedings Series, Vol. 1 (2007), 1-6. © 2007 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/4476

PubliCatt

ADA University of Tartu