Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru

Mercado-Gonzales R.; Pereira-Noriega J.; Sobrevilla M.; Oncevay A.

Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru

Descripción del Articulo

Linguistic corpus annotation is one of the most important phases for solving Natural Language Processing (NLP) tasks, as these methods are deeply involved with corpus-based techniques. However, meta-data annotation is a highly laborious manual task. A supportive alternative requires the use of compu...

Descripción completa

Detalles Bibliográficos
Autores:	Mercado-Gonzales R., Pereira-Noriega J., Sobrevilla M., Oncevay A.
Formato:	objeto de conferencia
Fecha de Publicación:	2019
Institución:	Consejo Nacional de Ciencia Tecnología e Innovación
Repositorio:	CONCYTEC-Institucional
Lenguaje:	inglés
OAI Identifier:	oai:repositorio.concytec.gob.pe:20.500.12390/547
Enlace del recurso:	https://hdl.handle.net/20.500.12390/547
Nivel de acceso:	acceso abierto
Materia:	Ships Data mining Learning algorithms Learning systems Natural language processing systems Agglutinative language Annotation tool Computational tools Corpus annotations Linguistic annotations https://purl.org/pe-repo/ocde/ford#6.02.06

id	CONC_9d3a6658c568976e7ff62f50045d82ae
oai_identifier_str	oai:repositorio.concytec.gob.pe:20.500.12390/547
network_acronym_str	CONC
network_name_str	CONCYTEC-Institucional
repository_id_str	4689
dc.title.none.fl_str_mv	Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru
title	Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru
spellingShingle	Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru Mercado-Gonzales R. Ships Data mining Learning algorithms Learning systems Natural language processing systems Agglutinative language Annotation tool Computational tools Corpus annotations Linguistic annotations https://purl.org/pe-repo/ocde/ford#6.02.06
title_short	Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru
title_full	Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru
title_fullStr	Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru
title_full_unstemmed	Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru
title_sort	Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru
author	Mercado-Gonzales R.
author_facet	Mercado-Gonzales R. Pereira-Noriega J. Sobrevilla M. Oncevay A.
author_role	author
author2	Pereira-Noriega J. Sobrevilla M. Oncevay A.
author2_role	author author author
dc.contributor.author.fl_str_mv	Mercado-Gonzales R. Pereira-Noriega J. Sobrevilla M. Oncevay A.
dc.subject.none.fl_str_mv	Ships
topic	Ships Data mining Learning algorithms Learning systems Natural language processing systems Agglutinative language Annotation tool Computational tools Corpus annotations Linguistic annotations https://purl.org/pe-repo/ocde/ford#6.02.06
dc.subject.es_PE.fl_str_mv	Data mining Learning algorithms Learning systems Natural language processing systems Agglutinative language Annotation tool Computational tools Corpus annotations Linguistic annotations
dc.subject.ocde.none.fl_str_mv	https://purl.org/pe-repo/ocde/ford#6.02.06
description	Linguistic corpus annotation is one of the most important phases for solving Natural Language Processing (NLP) tasks, as these methods are deeply involved with corpus-based techniques. However, meta-data annotation is a highly laborious manual task. A supportive alternative requires the use of computational tools. They are likely to simplify some of these operations, while can be adjusted appropriately to the needs of particular language features at the same time. Therefore, this paper presents ChAnot, a web-based annotation tool developed for Peruvian indigenous and highly agglutinative languages, where Shipibo-Konibo was the case study. This new tool is able to support a diverse set of linguistic annotation tasks, such as word segmentation, POS-tag markup, among others. Also, it includes a suggestion engine based on historic and machine learning models, and a set of statistics about previous annotations.
publishDate	2019
dc.date.accessioned.none.fl_str_mv	2024-05-30T23:13:38Z
dc.date.available.none.fl_str_mv	2024-05-30T23:13:38Z
dc.date.issued.fl_str_mv	2019
dc.type.none.fl_str_mv	info:eu-repo/semantics/conferenceObject
format	conferenceObject
dc.identifier.isbn.none.fl_str_mv	urn:isbn:9791095546009
dc.identifier.uri.none.fl_str_mv	https://hdl.handle.net/20.500.12390/547
dc.identifier.scopus.none.fl_str_mv	2-s2.0-85059897933
identifier_str_mv	urn:isbn:9791095546009 2-s2.0-85059897933
url	https://hdl.handle.net/20.500.12390/547
dc.language.iso.none.fl_str_mv	eng
language	eng
dc.relation.ispartof.none.fl_str_mv	LREC 2018 - 11th International Conference on Language Resources and Evaluation
dc.rights.none.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	European Language Resources Association (ELRA)
publisher.none.fl_str_mv	European Language Resources Association (ELRA)
dc.source.none.fl_str_mv	reponame:CONCYTEC-Institucional instname:Consejo Nacional de Ciencia Tecnología e Innovación instacron:CONCYTEC
instname_str	Consejo Nacional de Ciencia Tecnología e Innovación
instacron_str	CONCYTEC
institution	CONCYTEC
reponame_str	CONCYTEC-Institucional
collection	CONCYTEC-Institucional
repository.name.fl_str_mv	Repositorio Institucional CONCYTEC
repository.mail.fl_str_mv	repositorio@concytec.gob.pe
_version_	1870084316882534400
spelling	Publicationrp00954600rp00955600rp00953600rp00952600Mercado-Gonzales R.Pereira-Noriega J.Sobrevilla M.Oncevay A.2024-05-30T23:13:38Z2024-05-30T23:13:38Z2019urn:isbn:9791095546009https://hdl.handle.net/20.500.12390/5472-s2.0-85059897933Linguistic corpus annotation is one of the most important phases for solving Natural Language Processing (NLP) tasks, as these methods are deeply involved with corpus-based techniques. However, meta-data annotation is a highly laborious manual task. A supportive alternative requires the use of computational tools. They are likely to simplify some of these operations, while can be adjusted appropriately to the needs of particular language features at the same time. Therefore, this paper presents ChAnot, a web-based annotation tool developed for Peruvian indigenous and highly agglutinative languages, where Shipibo-Konibo was the case study. This new tool is able to support a diverse set of linguistic annotation tasks, such as word segmentation, POS-tag markup, among others. Also, it includes a suggestion engine based on historic and machine learning models, and a set of statistics about previous annotations.Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica - ConcytecengEuropean Language Resources Association (ELRA)LREC 2018 - 11th International Conference on Language Resources and Evaluationinfo:eu-repo/semantics/openAccessShipsData mining-1Learning algorithms-1Learning systems-1Natural language processing systems-1Agglutinative language-1Annotation tool-1Computational tools-1Corpus annotations-1Linguistic annotations-1https://purl.org/pe-repo/ocde/ford#6.02.06-1Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peruinfo:eu-repo/semantics/conferenceObjectreponame:CONCYTEC-Institucionalinstname:Consejo Nacional de Ciencia Tecnología e Innovacióninstacron:CONCYTEC20.500.12390/547oai:repositorio.concytec.gob.pe:20.500.12390/5472024-05-30 15:57:54.523http://purl.org/coar/access_right/c_14cbinfo:eu-repo/semantics/closedAccessmetadata only accesshttps://repositorio.concytec.gob.peRepositorio Institucional CONCYTECrepositorio@concytec.gob.pe#PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE##PLACEHOLDER_PARENT_METADATA_VALUE#<Publication xmlns="https://www.openaire.eu/cerif-profile/1.1/" id="d982418d-268c-433b-8175-e6574299e93d"> <Type xmlns="https://www.openaire.eu/cerif-profile/vocab/COAR_Publication_Types">http://purl.org/coar/resource_type/c_1843</Type> <Language>eng</Language> <Title>Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru</Title> <PublishedIn> <Publication> <Title>LREC 2018 - 11th International Conference on Language Resources and Evaluation</Title> </Publication> </PublishedIn> <PublicationDate>2019</PublicationDate> <SCP-Number>2-s2.0-85059897933</SCP-Number> <ISBN>urn:isbn:9791095546009</ISBN> <Authors> <Author> <DisplayName>Mercado-Gonzales R.</DisplayName> <Person id="rp00954" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Pereira-Noriega J.</DisplayName> <Person id="rp00955" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Sobrevilla M.</DisplayName> <Person id="rp00953" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> <Author> <DisplayName>Oncevay A.</DisplayName> <Person id="rp00952" /> <Affiliation> <OrgUnit> </OrgUnit> </Affiliation> </Author> </Authors> <Editors> </Editors> <Publishers> <Publisher> <DisplayName>European Language Resources Association (ELRA)</DisplayName> <OrgUnit /> </Publisher> </Publishers> <Keyword>Ships</Keyword> <Keyword>Data mining</Keyword> <Keyword>Learning algorithms</Keyword> <Keyword>Learning systems</Keyword> <Keyword>Natural language processing systems</Keyword> <Keyword>Agglutinative language</Keyword> <Keyword>Annotation tool</Keyword> <Keyword>Computational tools</Keyword> <Keyword>Corpus annotations</Keyword> <Keyword>Linguistic annotations</Keyword> <Abstract>Linguistic corpus annotation is one of the most important phases for solving Natural Language Processing (NLP) tasks, as these methods are deeply involved with corpus-based techniques. However, meta-data annotation is a highly laborious manual task. A supportive alternative requires the use of computational tools. They are likely to simplify some of these operations, while can be adjusted appropriately to the needs of particular language features at the same time. Therefore, this paper presents ChAnot, a web-based annotation tool developed for Peruvian indigenous and highly agglutinative languages, where Shipibo-Konibo was the case study. This new tool is able to support a diverse set of linguistic annotation tasks, such as word segmentation, POS-tag markup, among others. Also, it includes a suggestion engine based on historic and machine learning models, and a set of statistics about previous annotations.</Abstract> <Access xmlns="http://purl.org/coar/access_right" > </Access> </Publication> -1
score	13.411838

Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru

Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).

Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru

Descripción del Articulo

Ejemplares Similares