Knowledge Center Catalog

Building corpora for the development of a dependency parser for spanish using Maltparser

Herrera, J.

Building corpora for the development of a dependency parser for spanish using Maltparser - Spain : Sociedad EspaƱola para el Procesamiento del Lenguaje Natural, 2007.

Peer review Abstract in Spanish and English

The present paper details the process followed for creating training and test corpora for a dependency parser generator (Maltparser). The starting point is the Cast3LB corpus, which contains constituency analyses of Spanish texts. These constituency analyses are automatically transformed into dependency analyses. In addition, the empirically and semiautomatically obtention of a set of syntactic function labels for the training corpus is described. As a result of the process followed, it has been obtained a dependency parser for Spanish showing a 91% precision when determining dependencies.


Text in English

1135-5948 1989-7553 (Online)


Machine learning
Languages
Text mining

International Maize and Wheat Improvement Center (CIMMYT) © Copyright 2021.
Carretera México-Veracruz. Km. 45, El Batán, Texcoco, México, C.P. 56237.
If you have any question, please contact us at
CIMMYT-Knowledge-Center@cgiar.org