Data Extraction, Visualization, and Prediction Through Natural Language Processing

Descripción del Articulo

This study presents Datalyzer, a system designed for data extraction, visualization, and prediction in the mining sector using advanced NLP and machine learning, specifically GPT-3.S Turbo. The system enhances operational efficiency through rigorous data preprocessing and specialized fine-tuning, va...

Descripción completa

Detalles Bibliográficos
Autores: Alvarado, Carlos, Velásquez, Gabriel, Mauricio, David
Formato: artículo
Fecha de Publicación:2024
Institución:Universidad Peruana de Ciencias Aplicadas
Repositorio:UPC-Institucional
Lenguaje:inglés
OAI Identifier:oai:repositorioacademico.upc.edu.pe:10757/676028
Enlace del recurso:http://hdl.handle.net/10757/676028
Nivel de acceso:acceso embargado
Materia:Artificial Intelligence
Data Visualization
NLP
Predictive Analytics
id UUPC_6f6654111c7a7b54caf247045d93ee43
oai_identifier_str oai:repositorioacademico.upc.edu.pe:10757/676028
network_acronym_str UUPC
network_name_str UPC-Institucional
repository_id_str 2670
dc.title.es_PE.fl_str_mv Data Extraction, Visualization, and Prediction Through Natural Language Processing
title Data Extraction, Visualization, and Prediction Through Natural Language Processing
spellingShingle Data Extraction, Visualization, and Prediction Through Natural Language Processing
Alvarado, Carlos
Artificial Intelligence
Data Visualization
NLP
Predictive Analytics
title_short Data Extraction, Visualization, and Prediction Through Natural Language Processing
title_full Data Extraction, Visualization, and Prediction Through Natural Language Processing
title_fullStr Data Extraction, Visualization, and Prediction Through Natural Language Processing
title_full_unstemmed Data Extraction, Visualization, and Prediction Through Natural Language Processing
title_sort Data Extraction, Visualization, and Prediction Through Natural Language Processing
author Alvarado, Carlos
author_facet Alvarado, Carlos
Velásquez, Gabriel
Mauricio, David
author_role author
author2 Velásquez, Gabriel
Mauricio, David
author2_role author
author
dc.contributor.author.fl_str_mv Alvarado, Carlos
Velásquez, Gabriel
Mauricio, David
dc.subject.es_PE.fl_str_mv Artificial Intelligence
Data Visualization
NLP
Predictive Analytics
topic Artificial Intelligence
Data Visualization
NLP
Predictive Analytics
description This study presents Datalyzer, a system designed for data extraction, visualization, and prediction in the mining sector using advanced NLP and machine learning, specifically GPT-3.S Turbo. The system enhances operational efficiency through rigorous data preprocessing and specialized fine-tuning, validated on a simulated mining dataset. Results show significant improvements: data extraction time reduced by 94 % and visualization time by 97.6%. These improvements indicate a transformation in efficiency, usability, and user satisfaction. Despite limitations in data variability and complexity, this pioneering approach highlights the potential of NLP and machine learning in modernizing the mining industry and supporting data-driven decision-making.
publishDate 2024
dc.date.accessioned.none.fl_str_mv 2024-10-06T11:26:29Z
dc.date.available.none.fl_str_mv 2024-10-06T11:26:29Z
dc.date.issued.fl_str_mv 2024-01-01
dc.type.es_PE.fl_str_mv info:eu-repo/semantics/article
format article
dc.identifier.doi.none.fl_str_mv 10.1109/COINS61597.2024.10622130
dc.identifier.uri.none.fl_str_mv http://hdl.handle.net/10757/676028
dc.identifier.journal.es_PE.fl_str_mv 2024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 2024
dc.identifier.eid.none.fl_str_mv 2-s2.0-85202558415
dc.identifier.scopusid.none.fl_str_mv SCOPUS_ID:85202558415
identifier_str_mv 10.1109/COINS61597.2024.10622130
2024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 2024
2-s2.0-85202558415
SCOPUS_ID:85202558415
url http://hdl.handle.net/10757/676028
dc.language.iso.es_PE.fl_str_mv eng
language eng
dc.rights.es_PE.fl_str_mv info:eu-repo/semantics/embargoedAccess
eu_rights_str_mv embargoedAccess
dc.format.es_PE.fl_str_mv application/html
dc.publisher.es_PE.fl_str_mv Institute of Electrical and Electronics Engineers Inc.
dc.source.none.fl_str_mv reponame:UPC-Institucional
instname:Universidad Peruana de Ciencias Aplicadas
instacron:UPC
instname_str Universidad Peruana de Ciencias Aplicadas
instacron_str UPC
institution UPC
reponame_str UPC-Institucional
collection UPC-Institucional
dc.source.journaltitle.none.fl_str_mv 2024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 2024
bitstream.url.fl_str_mv https://repositorioacademico.upc.edu.pe/bitstream/10757/676028/1/license.txt
bitstream.checksum.fl_str_mv 8a4605be74aa9ea9d79846c1fba20a33
bitstream.checksumAlgorithm.fl_str_mv MD5
repository.name.fl_str_mv Repositorio académico upc
repository.mail.fl_str_mv upc@openrepository.com
_version_ 1846066052408016896
spelling 937dd46d050aac267b6115b86c212c5375bfcc73c7b0845615effc9e0697b68ac63d9ae1b7e9e0a8b9f5fffff3be59c0Alvarado, CarlosVelásquez, GabrielMauricio, David2024-10-06T11:26:29Z2024-10-06T11:26:29Z2024-01-0110.1109/COINS61597.2024.10622130http://hdl.handle.net/10757/6760282024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 20242-s2.0-85202558415SCOPUS_ID:85202558415This study presents Datalyzer, a system designed for data extraction, visualization, and prediction in the mining sector using advanced NLP and machine learning, specifically GPT-3.S Turbo. The system enhances operational efficiency through rigorous data preprocessing and specialized fine-tuning, validated on a simulated mining dataset. Results show significant improvements: data extraction time reduced by 94 % and visualization time by 97.6%. These improvements indicate a transformation in efficiency, usability, and user satisfaction. Despite limitations in data variability and complexity, this pioneering approach highlights the potential of NLP and machine learning in modernizing the mining industry and supporting data-driven decision-making.application/htmlengInstitute of Electrical and Electronics Engineers Inc.info:eu-repo/semantics/embargoedAccessArtificial IntelligenceData VisualizationNLPPredictive AnalyticsData Extraction, Visualization, and Prediction Through Natural Language Processinginfo:eu-repo/semantics/article2024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 2024reponame:UPC-Institucionalinstname:Universidad Peruana de Ciencias Aplicadasinstacron:UPCLICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://repositorioacademico.upc.edu.pe/bitstream/10757/676028/1/license.txt8a4605be74aa9ea9d79846c1fba20a33MD51false10757/676028oai:repositorioacademico.upc.edu.pe:10757/6760282024-10-06 11:26:31.991Repositorio académico upcupc@openrepository.comTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=
score 13.932913
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).