A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter
Descripción del Articulo
Cyberbullying is a social problem in which bullies’ actions are more harmful than in traditional forms of bullying as they have the power to repeatedly humiliate the victim in front of an entire community through social media. Nowadays, multiple works aim at detecting acts of cyberbullying via the a...
| Autor: | |
|---|---|
| Formato: | tesis de grado |
| Fecha de Publicación: | 2020 |
| Institución: | Universidad de Lima |
| Repositorio: | ULIMA-Institucional |
| Lenguaje: | inglés |
| OAI Identifier: | oai:repositorio.ulima.edu.pe:20.500.12724/12718 |
| Enlace del recurso: | https://hdl.handle.net/20.500.12724/12718 |
| Nivel de acceso: | acceso abierto |
| Materia: | Ciberacoso Blogs Acoso moral Cyberbullying Bullying https://purl.org/pe-repo/ocde/ford#2.02.04 |
| id |
RULI_ad01a9c2ac187a8712f32a65d1adcac9 |
|---|---|
| oai_identifier_str |
oai:repositorio.ulima.edu.pe:20.500.12724/12718 |
| network_acronym_str |
RULI |
| network_name_str |
ULIMA-Institucional |
| repository_id_str |
3883 |
| dc.title.es_PE.fl_str_mv |
A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter |
| title |
A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter |
| spellingShingle |
A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter Cuzcano Chavez, Ximena Marianne Ciberacoso Blogs Acoso moral Cyberbullying Bullying https://purl.org/pe-repo/ocde/ford#2.02.04 |
| title_short |
A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter |
| title_full |
A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter |
| title_fullStr |
A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter |
| title_full_unstemmed |
A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter |
| title_sort |
A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter |
| author |
Cuzcano Chavez, Ximena Marianne |
| author_facet |
Cuzcano Chavez, Ximena Marianne |
| author_role |
author |
| dc.contributor.student.none.fl_str_mv |
1, OA, S |
| dc.contributor.advisor.fl_str_mv |
Ayma Quirita, Víctor Hugo |
| dc.contributor.author.fl_str_mv |
Cuzcano Chavez, Ximena Marianne |
| dc.subject.es_PE.fl_str_mv |
Ciberacoso Blogs Acoso moral |
| topic |
Ciberacoso Blogs Acoso moral Cyberbullying Bullying https://purl.org/pe-repo/ocde/ford#2.02.04 |
| dc.subject.en_EN.fl_str_mv |
Cyberbullying Bullying |
| dc.subject.ocde.none.fl_str_mv |
https://purl.org/pe-repo/ocde/ford#2.02.04 |
| description |
Cyberbullying is a social problem in which bullies’ actions are more harmful than in traditional forms of bullying as they have the power to repeatedly humiliate the victim in front of an entire community through social media. Nowadays, multiple works aim at detecting acts of cyberbullying via the analysis of texts in social media publications written in one or more languages; however, few investigations target the cyberbullying detection in the Spanish language. In this work, we aim to compare four traditional supervised machine learning methods performances in detecting cyberbullying via the identification of four cyberbullying-related categories on Twitter posts written in the Peruvian Spanish language. Specifically, we trained and tested the Naive Bayes, Multinomial Logistic Regression, Support Vector Machines, and Random Forest classifiers upon a manually annotated dataset with the help of human participants. The results indicate that the best performing classifier for the cyberbullying detection task was the Support Vector Machine classifier. |
| publishDate |
2020 |
| dc.date.accessioned.none.fl_str_mv |
2021-03-16T22:42:34Z |
| dc.date.available.none.fl_str_mv |
2021-03-16T22:42:34Z |
| dc.date.issued.fl_str_mv |
2020 |
| dc.type.none.fl_str_mv |
info:eu-repo/semantics/bachelorThesis |
| dc.type.other.none.fl_str_mv |
Tesis |
| format |
bachelorThesis |
| dc.identifier.citation.es_PE.fl_str_mv |
Cuzcano Chavez, X. M. (2020). A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter [Tesis para optar el Título Profesional de Ingeniero de Sistemas, Universidad de Lima]. Repositorio institucional de la Universidad de Lima. https://hdl.handle.net/20.500.12724/12718 |
| dc.identifier.uri.none.fl_str_mv |
https://hdl.handle.net/20.500.12724/12718 |
| identifier_str_mv |
Cuzcano Chavez, X. M. (2020). A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter [Tesis para optar el Título Profesional de Ingeniero de Sistemas, Universidad de Lima]. Repositorio institucional de la Universidad de Lima. https://hdl.handle.net/20.500.12724/12718 |
| url |
https://hdl.handle.net/20.500.12724/12718 |
| dc.language.iso.none.fl_str_mv |
eng |
| language |
eng |
| dc.relation.ispartof.fl_str_mv |
SUNEDU |
| dc.rights.*.fl_str_mv |
info:eu-repo/semantics/openAccess |
| dc.rights.uri.*.fl_str_mv |
https://creativecommons.org/licenses/by-nc-sa/4.0/ |
| eu_rights_str_mv |
openAccess |
| rights_invalid_str_mv |
https://creativecommons.org/licenses/by-nc-sa/4.0/ |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.publisher.none.fl_str_mv |
Universidad de Lima |
| dc.publisher.country.none.fl_str_mv |
PE |
| publisher.none.fl_str_mv |
Universidad de Lima |
| dc.source.none.fl_str_mv |
Repositorio Institucional - Ulima Universidad de Lima reponame:ULIMA-Institucional instname:Universidad de Lima instacron:ULIMA |
| instname_str |
Universidad de Lima |
| instacron_str |
ULIMA |
| institution |
ULIMA |
| reponame_str |
ULIMA-Institucional |
| collection |
ULIMA-Institucional |
| bitstream.url.fl_str_mv |
https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/2/license_rdf https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/3/license.txt https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/4/Cuzcano_Chavez_Ximena_Marianne.pdf.txt https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/1/Cuzcano_Chavez_Ximena_Marianne.pdf https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/5/Cuzcano_Chavez_Ximena_Marianne.pdf.jpg |
| bitstream.checksum.fl_str_mv |
8fc46f5e71650fd7adee84a69b9163c2 8a4605be74aa9ea9d79846c1fba20a33 55922830c07ebf43b624b05e428a83ed f055ed4ae71db76c256fdc90b93854de 97ad7eb5a01df78ee5e15c8d7114087a |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 MD5 |
| repository.name.fl_str_mv |
Repositorio Universidad de Lima |
| repository.mail.fl_str_mv |
repositorio@ulima.edu.pe |
| _version_ |
1846611946286088192 |
| spelling |
Ayma Quirita, Víctor HugoCuzcano Chavez, Ximena Marianne1, OA, S2021-03-16T22:42:34Z2021-03-16T22:42:34Z2020Cuzcano Chavez, X. M. (2020). A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitter [Tesis para optar el Título Profesional de Ingeniero de Sistemas, Universidad de Lima]. Repositorio institucional de la Universidad de Lima. https://hdl.handle.net/20.500.12724/12718https://hdl.handle.net/20.500.12724/12718Cyberbullying is a social problem in which bullies’ actions are more harmful than in traditional forms of bullying as they have the power to repeatedly humiliate the victim in front of an entire community through social media. Nowadays, multiple works aim at detecting acts of cyberbullying via the analysis of texts in social media publications written in one or more languages; however, few investigations target the cyberbullying detection in the Spanish language. In this work, we aim to compare four traditional supervised machine learning methods performances in detecting cyberbullying via the identification of four cyberbullying-related categories on Twitter posts written in the Peruvian Spanish language. Specifically, we trained and tested the Naive Bayes, Multinomial Logistic Regression, Support Vector Machines, and Random Forest classifiers upon a manually annotated dataset with the help of human participants. The results indicate that the best performing classifier for the cyberbullying detection task was the Support Vector Machine classifier.application/pdfengUniversidad de LimaPEinfo:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-sa/4.0/Repositorio Institucional - UlimaUniversidad de Limareponame:ULIMA-Institucionalinstname:Universidad de Limainstacron:ULIMACiberacosoBlogsAcoso moralCyberbullyingBullyinghttps://purl.org/pe-repo/ocde/ford#2.02.04A comparison of classification models to detect cyberbullying in the peruvian spanish language on Twitterinfo:eu-repo/semantics/bachelorThesisTesisSUNEDUTítulo ProfesionalIngeniería de sistemasUniversidad de Lima. Facultad de Ingeniería y ArquitecturaIngeniero de sistemashttps://orcid.org/0000-0002-0284-26104502509561207676438232https://purl.org/pe-repo/renati/level#tituloProfesionalRodriguez-Rodriguez-Nadia-KatherineRamos-Ponce, Oscar-EfraiQuintana-Cruz, Hernan-Alejandrohttps://purl.org/pe-repo/renati/type#tesisCC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-81037https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/2/license_rdf8fc46f5e71650fd7adee84a69b9163c2MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/3/license.txt8a4605be74aa9ea9d79846c1fba20a33MD53TEXTCuzcano_Chavez_Ximena_Marianne.pdf.txtCuzcano_Chavez_Ximena_Marianne.pdf.txtExtracted texttext/plain39283https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/4/Cuzcano_Chavez_Ximena_Marianne.pdf.txt55922830c07ebf43b624b05e428a83edMD54ORIGINALCuzcano_Chavez_Ximena_Marianne.pdfCuzcano_Chavez_Ximena_Marianne.pdfapplication/pdf319520https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/1/Cuzcano_Chavez_Ximena_Marianne.pdff055ed4ae71db76c256fdc90b93854deMD51THUMBNAILCuzcano_Chavez_Ximena_Marianne.pdf.jpgCuzcano_Chavez_Ximena_Marianne.pdf.jpgGenerated Thumbnailimage/jpeg10320https://repositorio.ulima.edu.pe/bitstream/20.500.12724/12718/5/Cuzcano_Chavez_Ximena_Marianne.pdf.jpg97ad7eb5a01df78ee5e15c8d7114087aMD5520.500.12724/12718oai:repositorio.ulima.edu.pe:20.500.12724/127182025-09-17 13:54:53.794Repositorio Universidad de Limarepositorio@ulima.edu.peTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo= |
| score |
13.0768795 |
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).