Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
Descripción del Articulo
Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing...
Autores: | , , , , |
---|---|
Formato: | objeto de conferencia |
Fecha de Publicación: | 2017 |
Institución: | Consejo Nacional de Ciencia Tecnología e Innovación |
Repositorio: | CONCYTEC-Institucional |
Lenguaje: | inglés |
OAI Identifier: | oai:repositorio.concytec.gob.pe:20.500.12390/773 |
Enlace del recurso: | https://hdl.handle.net/20.500.12390/773 https://doi.org/10.1007/978-3-319-64206-2_53 |
Nivel de acceso: | acceso abierto |
Materia: | Text processing Automation Computational linguistics Ships Automatic identification Core processing Lemmatization Low resource languages Machine translations Native language Part of speech tagging Shipibo-konibo Natural language processing systems https://purl.org/pe-repo/ocde/ford#2.00.00 |
id |
CONC_b7e04ae9fb1f4bb649af66650afa2e1d |
---|---|
oai_identifier_str |
oai:repositorio.concytec.gob.pe:20.500.12390/773 |
network_acronym_str |
CONC |
network_name_str |
CONCYTEC-Institucional |
repository_id_str |
4689 |
dc.title.none.fl_str_mv |
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language |
title |
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language |
spellingShingle |
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language Pereira-Noriega J. Text processing Automation Computational linguistics Ships Automatic identification Core processing Lemmatization Low resource languages Machine translations Native language Part of speech tagging Shipibo-konibo Natural language processing systems https://purl.org/pe-repo/ocde/ford#2.00.00 |
title_short |
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language |
title_full |
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language |
title_fullStr |
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language |
title_full_unstemmed |
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language |
title_sort |
Ship-lemmatagger: Building an nlp toolkit for a peruvian native language |
author |
Pereira-Noriega J. |
author_facet |
Pereira-Noriega J. Mercado-Gonzales R. Melgar A. Sobrevilla-Cabezudo M. Oncevay-Marcos A. |
author_role |
author |
author2 |
Mercado-Gonzales R. Melgar A. Sobrevilla-Cabezudo M. Oncevay-Marcos A. |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Pereira-Noriega J. Mercado-Gonzales R. Melgar A. Sobrevilla-Cabezudo M. Oncevay-Marcos A. |
dc.subject.none.fl_str_mv |
Text processing |
topic |
Text processing Automation Computational linguistics Ships Automatic identification Core processing Lemmatization Low resource languages Machine translations Native language Part of speech tagging Shipibo-konibo Natural language processing systems https://purl.org/pe-repo/ocde/ford#2.00.00 |
dc.subject.es_PE.fl_str_mv |
Automation Computational linguistics Ships Automatic identification Core processing Lemmatization Low resource languages Machine translations Native language Part of speech tagging Shipibo-konibo Natural language processing systems |
dc.subject.ocde.none.fl_str_mv |
https://purl.org/pe-repo/ocde/ford#2.00.00 |
description |
Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing of the morphology of the language (such as lemmatization) and automatic identification of the part-of-speech tag. Thereby, this paper describes the implementation of a basic NLP toolkit for a new language, focusing in the features mentioned before, and testing them in an own corpus built for the occasion. The obtained results exceeded the expected results and could be used for more complex tasks such as machine translation. |
publishDate |
2017 |
dc.date.accessioned.none.fl_str_mv |
2024-05-30T23:13:38Z |
dc.date.available.none.fl_str_mv |
2024-05-30T23:13:38Z |
dc.date.issued.fl_str_mv |
2017 |
dc.type.none.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
dc.identifier.isbn.none.fl_str_mv |
urn:isbn:9783319642055 |
dc.identifier.uri.none.fl_str_mv |
https://hdl.handle.net/20.500.12390/773 |
dc.identifier.doi.none.fl_str_mv |
https://doi.org/10.1007/978-3-319-64206-2_53 |
dc.identifier.scopus.none.fl_str_mv |
2-s2.0-85028645758 |
identifier_str_mv |
urn:isbn:9783319642055 2-s2.0-85028645758 |
url |
https://hdl.handle.net/20.500.12390/773 https://doi.org/10.1007/978-3-319-64206-2_53 |
dc.language.iso.none.fl_str_mv |
eng |
language |
eng |
dc.relation.ispartof.none.fl_str_mv |
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
dc.rights.none.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
Springer Verlag |
publisher.none.fl_str_mv |
Springer Verlag |
dc.source.none.fl_str_mv |
reponame:CONCYTEC-Institucional instname:Consejo Nacional de Ciencia Tecnología e Innovación instacron:CONCYTEC |
instname_str |
Consejo Nacional de Ciencia Tecnología e Innovación |
instacron_str |
CONCYTEC |
institution |
CONCYTEC |
reponame_str |
CONCYTEC-Institucional |
collection |
CONCYTEC-Institucional |
repository.name.fl_str_mv |
Repositorio Institucional CONCYTEC |
repository.mail.fl_str_mv |
repositorio@concytec.gob.pe |
_version_ |
1839175492586962944 |
spelling |
Publicationrp00955500rp00954500rp01007500rp01987600rp00570500Pereira-Noriega J.Mercado-Gonzales R.Melgar A.Sobrevilla-Cabezudo M.Oncevay-Marcos A.2024-05-30T23:13:38Z2024-05-30T23:13:38Z2017urn:isbn:9783319642055https://hdl.handle.net/20.500.12390/773https://doi.org/10.1007/978-3-319-64206-2_532-s2.0-85028645758Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing of the morphology of the language (such as lemmatization) and automatic identification of the part-of-speech tag. Thereby, this paper describes the implementation of a basic NLP toolkit for a new language, focusing in the features mentioned before, and testing them in an own corpus built for the occasion. The obtained results exceeded the expected results and could be used for more complex tasks such as machine translation.Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica - ConcytecengSpringer VerlagLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)info:eu-repo/semantics/openAccessText processingAutomation-1Computational linguistics-1Ships-1Automatic identification-1Core processing-1Lemmatization-1Low resource languages-1Machine translations-1Native language-1Part of speech tagging-1Shipibo-konibo-1Natural language processing systems-1https://purl.org/pe-repo/ocde/ford#2.00.00-1Ship-lemmatagger: Building an nlp toolkit for a peruvian native languageinfo:eu-repo/semantics/conferenceObjectreponame:CONCYTEC-Institucionalinstname:Consejo Nacional de Ciencia Tecnología e Innovacióninstacron:CONCYTEC20.500.12390/773oai:repositorio.concytec.gob.pe:20.500.12390/7732024-05-30 15:58:58.846http://purl.org/coar/access_right/c_14cbinfo:eu-repo/semantics/closedAccessmetadata only accesshttps://repositorio.concytec.gob.peRepositorio Institucional CONCYTECrepositorio@concytec.gob.pe#PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE#<Publication xmlns="https://www.openaire.eu/cerif-profile/1.1/" id="351c87dc-7cb2-4ee3-95e8-fe679965d455"> <Type xmlns="https://www.openaire.eu/cerif-profile/vocab/COAR_Publication_Types">http://purl.org/coar/resource_type/c_1843</Type> <Language>eng</Language> <Title>Ship-lemmatagger: Building an nlp toolkit for a peruvian native language</Title> <PublishedIn> <Publication> <Title>Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</Title> </Publication> </PublishedIn> <PublicationDate>2017</PublicationDate> <DOI>https://doi.org/10.1007/978-3-319-64206-2_53</DOI> <SCP-Number>2-s2.0-85028645758</SCP-Number> <ISBN>urn:isbn:9783319642055</ISBN> <Authors> <Author> <DisplayName>Pereira-Noriega J.</DisplayName> <Person id="rp00955" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Mercado-Gonzales R.</DisplayName> <Person id="rp00954" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Melgar A.</DisplayName> <Person id="rp01007" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Sobrevilla-Cabezudo M.</DisplayName> <Person id="rp01987" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Oncevay-Marcos A.</DisplayName> <Person id="rp00570" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> </Authors> <Editors> </Editors> <Publishers> <Publisher> <DisplayName>Springer Verlag</DisplayName> <OrgUnit /> </Publisher> </Publishers> <Keyword>Text processing</Keyword> <Keyword>Automation</Keyword> <Keyword>Computational linguistics</Keyword> <Keyword>Ships</Keyword> <Keyword>Automatic identification</Keyword> <Keyword>Core processing</Keyword> <Keyword>Lemmatization</Keyword> <Keyword>Low resource languages</Keyword> <Keyword>Machine translations</Keyword> <Keyword>Native language</Keyword> <Keyword>Part of speech tagging</Keyword> <Keyword>Shipibo-konibo</Keyword> <Keyword>Natural language processing systems</Keyword> <Abstract>Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing of the morphology of the language (such as lemmatization) and automatic identification of the part-of-speech tag. Thereby, this paper describes the implementation of a basic NLP toolkit for a new language, focusing in the features mentioned before, and testing them in an own corpus built for the occasion. The obtained results exceeded the expected results and could be used for more complex tasks such as machine translation.</Abstract> <Access xmlns="http://purl.org/coar/access_right" > </Access> </Publication> -1 |
score |
13.4481325 |
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).