Exportación Completada — 

WordNet-SHP: Towards the building of a lexical database for a Peruvian minority language

Descripción del Articulo

WordNet-like resources are lexical databases with highly relevance information and data which could be exploited in more complex computational linguistics research and applications. The building process requires manual and automatic tasks, that could be more arduous if the language is a minority one...

Descripción completa

Detalles Bibliográficos
Autores: Maguiño-Valencia D., Oncevay-Marcos A., Sobrevilla Cabezudo M.A.
Formato: objeto de conferencia
Fecha de Publicación:2019
Institución:Consejo Nacional de Ciencia Tecnología e Innovación
Repositorio:CONCYTEC-Institucional
Lenguaje:inglés
OAI Identifier:oai:repositorio.concytec.gob.pe:20.500.12390/819
Enlace del recurso:https://hdl.handle.net/20.500.12390/819
Nivel de acceso:acceso abierto
Materia:Wordnet
Computational linguistics
Database systems
Natural language processing systems
Ships
Bilingual dictionary
Digital resources
Lexical database
Machine translations
Minority languages
Research and application
Word Sense Disambiguation
Ontology
https://purl.org/pe-repo/ocde/ford#6.02.06
Descripción
Sumario:WordNet-like resources are lexical databases with highly relevance information and data which could be exploited in more complex computational linguistics research and applications. The building process requires manual and automatic tasks, that could be more arduous if the language is a minority one with fewer digital resources. This study focuses in the construction of an initial WordNetdatabase for a low-resourced and indigenous language in Peru: Shipibo-Konibo (shp). First, the stages of development from a scarce scenario (a bilingual dictionary shp-es) are described. Then, it is proposed a synset alignment method by comparing the definition glosses in the dictionary (written in Spanish) with the content of a Spanish WordNet. In this sense, word2vec similarity was the chosen metric for the proximity measure. Finally, an evaluation process is performed for the synsets, using a manually annotated Gold Standard inShipibo-Konibo. The obtained results are promising, and this resource is expected to serve well in further applications, such as word sense disambiguation and even machine translation in the shp-es language pair.
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).