This paper seeks to describe the creation of a Spanish lexicon with semantic annotation in order to analyse more extensive corpora in the Spanish language. The semantic resources most employed nowadays are WordNet, FrameNet, PDEV and USAS, but they have been used mainly for English language research. The creation of a large Spanish lexicon will permit a greater amount of studies of corpora in Spanish can be undertaken. In the description of the steps followed for the construction of the lexicon, the difficulties encountered in its creation, and the solutions used to overcome them will be described. Finally, the construction of the lexicon will allow specific research tasks to be carried out, such as metaphor analysis, ACD studies and even NLP studies.
|Title of host publication||Proceedings of the 35th Edition of the International Conference of The Spanish Association of Applied Linguistics|
|Subtitle of host publication||Languages at the Crossroads: Training, Accreditation and Context of Use|
|Editors||Francisco Javier Díez Pérez, María Águeda Moreno Moreno|
|Place of Publication||Jaén|
|Publisher||Publicaciones de la Universidad de Jaén|
|Number of pages||13|
|Publication status||Published - May 2017|
|Event||International Conference of the Spanish Association of Applied Linguistics - Universidad de Jaén, Andalucia, Spain|
Duration: 4 May 2017 → 6 May 2017
|Conference||International Conference of the Spanish Association of Applied Linguistics|
|Period||4/05/17 → 6/05/17|
Jiménez-Yáñez, R., Sanjurjo-González, H., Rayson, P., & Piao, S. (2017). Building a Spanish Lexicon for Corpus Analysis. In F. J. Díez Pérez, & M. Á. Moreno Moreno (Eds.), Proceedings of the 35th Edition of the International Conference of The Spanish Association of Applied Linguistics : Languages at the Crossroads: Training, Accreditation and Context of Use (Vol. 1, pp. 227-239). Jaén: Publicaciones de la Universidad de Jaén.