Data Extraction, Visualization, and Prediction Through Natural Language Processing
Descripción del Articulo
This study presents Datalyzer, a system designed for data extraction, visualization, and prediction in the mining sector using advanced NLP and machine learning, specifically GPT-3.S Turbo. The system enhances operational efficiency through rigorous data preprocessing and specialized fine-tuning, va...
| Autores: | , , |
|---|---|
| Formato: | artículo |
| Fecha de Publicación: | 2024 |
| Institución: | Universidad Peruana de Ciencias Aplicadas |
| Repositorio: | UPC-Institucional |
| Lenguaje: | inglés |
| OAI Identifier: | oai:repositorioacademico.upc.edu.pe:10757/676028 |
| Enlace del recurso: | http://hdl.handle.net/10757/676028 |
| Nivel de acceso: | acceso embargado |
| Materia: | Artificial Intelligence Data Visualization NLP Predictive Analytics |
| id |
UUPC_6f6654111c7a7b54caf247045d93ee43 |
|---|---|
| oai_identifier_str |
oai:repositorioacademico.upc.edu.pe:10757/676028 |
| network_acronym_str |
UUPC |
| network_name_str |
UPC-Institucional |
| repository_id_str |
2670 |
| dc.title.es_PE.fl_str_mv |
Data Extraction, Visualization, and Prediction Through Natural Language Processing |
| title |
Data Extraction, Visualization, and Prediction Through Natural Language Processing |
| spellingShingle |
Data Extraction, Visualization, and Prediction Through Natural Language Processing Alvarado, Carlos Artificial Intelligence Data Visualization NLP Predictive Analytics |
| title_short |
Data Extraction, Visualization, and Prediction Through Natural Language Processing |
| title_full |
Data Extraction, Visualization, and Prediction Through Natural Language Processing |
| title_fullStr |
Data Extraction, Visualization, and Prediction Through Natural Language Processing |
| title_full_unstemmed |
Data Extraction, Visualization, and Prediction Through Natural Language Processing |
| title_sort |
Data Extraction, Visualization, and Prediction Through Natural Language Processing |
| author |
Alvarado, Carlos |
| author_facet |
Alvarado, Carlos Velásquez, Gabriel Mauricio, David |
| author_role |
author |
| author2 |
Velásquez, Gabriel Mauricio, David |
| author2_role |
author author |
| dc.contributor.author.fl_str_mv |
Alvarado, Carlos Velásquez, Gabriel Mauricio, David |
| dc.subject.es_PE.fl_str_mv |
Artificial Intelligence Data Visualization NLP Predictive Analytics |
| topic |
Artificial Intelligence Data Visualization NLP Predictive Analytics |
| description |
This study presents Datalyzer, a system designed for data extraction, visualization, and prediction in the mining sector using advanced NLP and machine learning, specifically GPT-3.S Turbo. The system enhances operational efficiency through rigorous data preprocessing and specialized fine-tuning, validated on a simulated mining dataset. Results show significant improvements: data extraction time reduced by 94 % and visualization time by 97.6%. These improvements indicate a transformation in efficiency, usability, and user satisfaction. Despite limitations in data variability and complexity, this pioneering approach highlights the potential of NLP and machine learning in modernizing the mining industry and supporting data-driven decision-making. |
| publishDate |
2024 |
| dc.date.accessioned.none.fl_str_mv |
2024-10-06T11:26:29Z |
| dc.date.available.none.fl_str_mv |
2024-10-06T11:26:29Z |
| dc.date.issued.fl_str_mv |
2024-01-01 |
| dc.type.es_PE.fl_str_mv |
info:eu-repo/semantics/article |
| format |
article |
| dc.identifier.doi.none.fl_str_mv |
10.1109/COINS61597.2024.10622130 |
| dc.identifier.uri.none.fl_str_mv |
http://hdl.handle.net/10757/676028 |
| dc.identifier.journal.es_PE.fl_str_mv |
2024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 2024 |
| dc.identifier.eid.none.fl_str_mv |
2-s2.0-85202558415 |
| dc.identifier.scopusid.none.fl_str_mv |
SCOPUS_ID:85202558415 |
| identifier_str_mv |
10.1109/COINS61597.2024.10622130 2024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 2024 2-s2.0-85202558415 SCOPUS_ID:85202558415 |
| url |
http://hdl.handle.net/10757/676028 |
| dc.language.iso.es_PE.fl_str_mv |
eng |
| language |
eng |
| dc.rights.es_PE.fl_str_mv |
info:eu-repo/semantics/embargoedAccess |
| eu_rights_str_mv |
embargoedAccess |
| dc.format.es_PE.fl_str_mv |
application/html |
| dc.publisher.es_PE.fl_str_mv |
Institute of Electrical and Electronics Engineers Inc. |
| dc.source.none.fl_str_mv |
reponame:UPC-Institucional instname:Universidad Peruana de Ciencias Aplicadas instacron:UPC |
| instname_str |
Universidad Peruana de Ciencias Aplicadas |
| instacron_str |
UPC |
| institution |
UPC |
| reponame_str |
UPC-Institucional |
| collection |
UPC-Institucional |
| dc.source.journaltitle.none.fl_str_mv |
2024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 2024 |
| bitstream.url.fl_str_mv |
https://repositorioacademico.upc.edu.pe/bitstream/10757/676028/1/license.txt |
| bitstream.checksum.fl_str_mv |
8a4605be74aa9ea9d79846c1fba20a33 |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 |
| repository.name.fl_str_mv |
Repositorio académico upc |
| repository.mail.fl_str_mv |
upc@openrepository.com |
| _version_ |
1846066052408016896 |
| spelling |
937dd46d050aac267b6115b86c212c5375bfcc73c7b0845615effc9e0697b68ac63d9ae1b7e9e0a8b9f5fffff3be59c0Alvarado, CarlosVelásquez, GabrielMauricio, David2024-10-06T11:26:29Z2024-10-06T11:26:29Z2024-01-0110.1109/COINS61597.2024.10622130http://hdl.handle.net/10757/6760282024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 20242-s2.0-85202558415SCOPUS_ID:85202558415This study presents Datalyzer, a system designed for data extraction, visualization, and prediction in the mining sector using advanced NLP and machine learning, specifically GPT-3.S Turbo. The system enhances operational efficiency through rigorous data preprocessing and specialized fine-tuning, validated on a simulated mining dataset. Results show significant improvements: data extraction time reduced by 94 % and visualization time by 97.6%. These improvements indicate a transformation in efficiency, usability, and user satisfaction. Despite limitations in data variability and complexity, this pioneering approach highlights the potential of NLP and machine learning in modernizing the mining industry and supporting data-driven decision-making.application/htmlengInstitute of Electrical and Electronics Engineers Inc.info:eu-repo/semantics/embargoedAccessArtificial IntelligenceData VisualizationNLPPredictive AnalyticsData Extraction, Visualization, and Prediction Through Natural Language Processinginfo:eu-repo/semantics/article2024 IEEE International Conference on Omni-Layer Intelligent Systems, COINS 2024reponame:UPC-Institucionalinstname:Universidad Peruana de Ciencias Aplicadasinstacron:UPCLICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://repositorioacademico.upc.edu.pe/bitstream/10757/676028/1/license.txt8a4605be74aa9ea9d79846c1fba20a33MD51false10757/676028oai:repositorioacademico.upc.edu.pe:10757/6760282024-10-06 11:26:31.991Repositorio académico upcupc@openrepository.comTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo= |
| score |
13.932913 |
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).