Building a Spanish Lexicon for Corpus Analysis

Ricardo Jiménez-Yáñez, H. Sanjurjo-González, Paul Rayson, Scott Piao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review


This paper seeks to describe the creation of a Spanish lexicon with semantic annotation in order to analyse more extensive corpora in the Spanish language. The semantic resources most employed nowadays are WordNet, FrameNet, PDEV and USAS, but they have been used mainly for English language research. The creation of a large Spanish lexicon will permit a greater amount of studies of corpora in Spanish can be undertaken. In the description of the steps followed for the construction of the lexicon, the difficulties encountered in its creation, and the solutions used to overcome them will be described. Finally, the construction of the lexicon will allow specific research tasks to be carried out, such as metaphor analysis, ACD studies and even NLP studies.
Original languageEnglish
Title of host publicationProceedings of the 35th Edition of the International Conference of The Spanish Association of Applied Linguistics
Subtitle of host publicationLanguages at the Crossroads: Training, Accreditation and Context of Use
EditorsFrancisco Javier Díez Pérez, María Águeda Moreno Moreno
Place of PublicationJaén
PublisherPublicaciones de la Universidad de Jaén
Number of pages13
ISBN (Print)9788491591085
Publication statusPublished - May 2017
Externally publishedYes
EventInternational Conference of the Spanish Association of Applied Linguistics - Universidad de Jaén, Andalucia, Spain
Duration: 4 May 20176 May 2017


ConferenceInternational Conference of the Spanish Association of Applied Linguistics
Internet address


Dive into the research topics of 'Building a Spanish Lexicon for Corpus Analysis'. Together they form a unique fingerprint.

Cite this