Ship-lemmatagger: Building an nlp toolkit for a peruvian native language

Pereira-Noriega J.; Mercado-Gonzales R.; Melgar A.; Sobrevilla-Cabezudo M.; Oncevay-Marcos A.

Ship-lemmatagger: Building an nlp toolkit for a peruvian native language

Descripción del Articulo

Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing...

Descripción completa

Detalles Bibliográficos
Autores:	Pereira-Noriega J., Mercado-Gonzales R., Melgar A., Sobrevilla-Cabezudo M., Oncevay-Marcos A.
Formato:	objeto de conferencia
Fecha de Publicación:	2017
Institución:	Consejo Nacional de Ciencia Tecnología e Innovación
Repositorio:	CONCYTEC-Institucional
Lenguaje:	inglés
OAI Identifier:	oai:repositorio.concytec.gob.pe:20.500.12390/773
Enlace del recurso:	https://hdl.handle.net/20.500.12390/773 https://doi.org/10.1007/978-3-319-64206-2_53
Nivel de acceso:	acceso abierto
Materia:	Text processing Automation Computational linguistics Ships Automatic identification Core processing Lemmatization Low resource languages Machine translations Native language Part of speech tagging Shipibo-konibo Natural language processing systems https://purl.org/pe-repo/ocde/ford#2.00.00

id	CONC_b7e04ae9fb1f4bb649af66650afa2e1d
oai_identifier_str	oai:repositorio.concytec.gob.pe:20.500.12390/773
network_acronym_str	CONC
network_name_str	CONCYTEC-Institucional
repository_id_str	4689
dc.title.none.fl_str_mv	Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
title	Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
spellingShingle	Ship-lemmatagger: Building an nlp toolkit for a peruvian native language Pereira-Noriega J. Text processing Automation Computational linguistics Ships Automatic identification Core processing Lemmatization Low resource languages Machine translations Native language Part of speech tagging Shipibo-konibo Natural language processing systems https://purl.org/pe-repo/ocde/ford#2.00.00
title_short	Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
title_full	Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
title_fullStr	Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
title_full_unstemmed	Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
title_sort	Ship-lemmatagger: Building an nlp toolkit for a peruvian native language
author	Pereira-Noriega J.
author_facet	Pereira-Noriega J. Mercado-Gonzales R. Melgar A. Sobrevilla-Cabezudo M. Oncevay-Marcos A.
author_role	author
author2	Mercado-Gonzales R. Melgar A. Sobrevilla-Cabezudo M. Oncevay-Marcos A.
author2_role	author author author author
dc.contributor.author.fl_str_mv	Pereira-Noriega J. Mercado-Gonzales R. Melgar A. Sobrevilla-Cabezudo M. Oncevay-Marcos A.
dc.subject.none.fl_str_mv	Text processing
topic	Text processing Automation Computational linguistics Ships Automatic identification Core processing Lemmatization Low resource languages Machine translations Native language Part of speech tagging Shipibo-konibo Natural language processing systems https://purl.org/pe-repo/ocde/ford#2.00.00
dc.subject.es_PE.fl_str_mv	Automation Computational linguistics Ships Automatic identification Core processing Lemmatization Low resource languages Machine translations Native language Part of speech tagging Shipibo-konibo Natural language processing systems
dc.subject.ocde.none.fl_str_mv	https://purl.org/pe-repo/ocde/ford#2.00.00
description	Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing of the morphology of the language (such as lemmatization) and automatic identification of the part-of-speech tag. Thereby, this paper describes the implementation of a basic NLP toolkit for a new language, focusing in the features mentioned before, and testing them in an own corpus built for the occasion. The obtained results exceeded the expected results and could be used for more complex tasks such as machine translation.
publishDate	2017
dc.date.accessioned.none.fl_str_mv	2024-05-30T23:13:38Z
dc.date.available.none.fl_str_mv	2024-05-30T23:13:38Z
dc.date.issued.fl_str_mv	2017
dc.type.none.fl_str_mv	info:eu-repo/semantics/conferenceObject
format	conferenceObject
dc.identifier.isbn.none.fl_str_mv	urn:isbn:9783319642055
dc.identifier.uri.none.fl_str_mv	https://hdl.handle.net/20.500.12390/773
dc.identifier.doi.none.fl_str_mv	https://doi.org/10.1007/978-3-319-64206-2_53
dc.identifier.scopus.none.fl_str_mv	2-s2.0-85028645758
identifier_str_mv	urn:isbn:9783319642055 2-s2.0-85028645758
url	https://hdl.handle.net/20.500.12390/773 https://doi.org/10.1007/978-3-319-64206-2_53
dc.language.iso.none.fl_str_mv	eng
language	eng
dc.relation.ispartof.none.fl_str_mv	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.rights.none.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Springer Verlag
publisher.none.fl_str_mv	Springer Verlag
dc.source.none.fl_str_mv	reponame:CONCYTEC-Institucional instname:Consejo Nacional de Ciencia Tecnología e Innovación instacron:CONCYTEC
instname_str	Consejo Nacional de Ciencia Tecnología e Innovación
instacron_str	CONCYTEC
institution	CONCYTEC
reponame_str	CONCYTEC-Institucional
collection	CONCYTEC-Institucional
repository.name.fl_str_mv	Repositorio Institucional CONCYTEC
repository.mail.fl_str_mv	repositorio@concytec.gob.pe
_version_	1854395723892654080
spelling	Publicationrp00955500rp00954500rp01007500rp01987600rp00570500Pereira-Noriega J.Mercado-Gonzales R.Melgar A.Sobrevilla-Cabezudo M.Oncevay-Marcos A.2024-05-30T23:13:38Z2024-05-30T23:13:38Z2017urn:isbn:9783319642055https://hdl.handle.net/20.500.12390/773https://doi.org/10.1007/978-3-319-64206-2_532-s2.0-85028645758Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing of the morphology of the language (such as lemmatization) and automatic identification of the part-of-speech tag. Thereby, this paper describes the implementation of a basic NLP toolkit for a new language, focusing in the features mentioned before, and testing them in an own corpus built for the occasion. The obtained results exceeded the expected results and could be used for more complex tasks such as machine translation.Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica - ConcytecengSpringer VerlagLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)info:eu-repo/semantics/openAccessText processingAutomation-1Computational linguistics-1Ships-1Automatic identification-1Core processing-1Lemmatization-1Low resource languages-1Machine translations-1Native language-1Part of speech tagging-1Shipibo-konibo-1Natural language processing systems-1https://purl.org/pe-repo/ocde/ford#2.00.00-1Ship-lemmatagger: Building an nlp toolkit for a peruvian native languageinfo:eu-repo/semantics/conferenceObjectreponame:CONCYTEC-Institucionalinstname:Consejo Nacional de Ciencia Tecnología e Innovacióninstacron:CONCYTEC20.500.12390/773oai:repositorio.concytec.gob.pe:20.500.12390/7732024-05-30 15:58:58.846http://purl.org/coar/access_right/c_14cbinfo:eu-repo/semantics/closedAccessmetadata only accesshttps://repositorio.concytec.gob.peRepositorio Institucional CONCYTECrepositorio@concytec.gob.pe#PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE#<Publication xmlns="https://www.openaire.eu/cerif-profile/1.1/" id="351c87dc-7cb2-4ee3-95e8-fe679965d455"> <Type xmlns="https://www.openaire.eu/cerif-profile/vocab/COAR_Publication_Types">http://purl.org/coar/resource_type/c_1843</Type> <Language>eng</Language> <Title>Ship-lemmatagger: Building an nlp toolkit for a peruvian native language</Title> <PublishedIn> <Publication> <Title>Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</Title> </Publication> </PublishedIn> <PublicationDate>2017</PublicationDate> <DOI>https://doi.org/10.1007/978-3-319-64206-2_53</DOI> <SCP-Number>2-s2.0-85028645758</SCP-Number> <ISBN>urn:isbn:9783319642055</ISBN> <Authors> <Author> <DisplayName>Pereira-Noriega J.</DisplayName> <Person id="rp00955" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Mercado-Gonzales R.</DisplayName> <Person id="rp00954" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Melgar A.</DisplayName> <Person id="rp01007" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Sobrevilla-Cabezudo M.</DisplayName> <Person id="rp01987" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Oncevay-Marcos A.</DisplayName> <Person id="rp00570" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> </Authors> <Editors> </Editors> <Publishers> <Publisher> <DisplayName>Springer Verlag</DisplayName> <OrgUnit /> </Publisher> </Publishers> <Keyword>Text processing</Keyword> <Keyword>Automation</Keyword> <Keyword>Computational linguistics</Keyword> <Keyword>Ships</Keyword> <Keyword>Automatic identification</Keyword> <Keyword>Core processing</Keyword> <Keyword>Lemmatization</Keyword> <Keyword>Low resource languages</Keyword> <Keyword>Machine translations</Keyword> <Keyword>Native language</Keyword> <Keyword>Part of speech tagging</Keyword> <Keyword>Shipibo-konibo</Keyword> <Keyword>Natural language processing systems</Keyword> <Abstract>Natural Language Processing deals with the understanding and generation of texts through computer programs. There are many different functionalities used in this area, but among them there are some functions that are the support of the remaining ones. These methods are related to the core processing of the morphology of the language (such as lemmatization) and automatic identification of the part-of-speech tag. Thereby, this paper describes the implementation of a basic NLP toolkit for a new language, focusing in the features mentioned before, and testing them in an own corpus built for the occasion. The obtained results exceeded the expected results and could be used for more complex tasks such as machine translation.</Abstract> <Access xmlns="http://purl.org/coar/access_right" > </Access> </Publication> -1
score	13.918711

Ship-lemmatagger: Building an nlp toolkit for a peruvian native language

Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).

Ship-lemmatagger: Building an nlp toolkit for a peruvian native language

Descripción del Articulo

Ejemplares Similares