Building corpora for the development of a dependency parser for spanish using Maltparser
Material type: ArticleLanguage: Spanish Publication details: Spain : Sociedad Española para el Procesamiento del Lenguaje Natural, 2007.ISSN:- 1135-5948
- 1989-7553 (Online)
Item type | Current library | Collection | Call number | Status | Date due | Barcode | Item holds | |
---|---|---|---|---|---|---|---|---|
Article | CIMMYT Knowledge Center: John Woolston Library | Reprints Collection | Available |
Peer review
Abstract in Spanish and English
The present paper details the process followed for creating training and test corpora for a dependency parser generator (Maltparser). The starting point is the Cast3LB corpus, which contains constituency analyses of Spanish texts. These constituency analyses are automatically transformed into dependency analyses. In addition, the empirically and semiautomatically obtention of a set of syntactic function labels for the training corpus is described. As a result of the process followed, it has been obtained a dependency parser for Spanish showing a 91% precision when determining dependencies.
Text in English
Herrera, J. : No CIMMYT Affiliation