Evolución de la IA en la resolución de problemas contables: Comparación entre GPT4 y GPT4o en el Examen de Competencia Contable

Descripción del Articulo

The research aimed to evaluate the performance of the GPT-4o model compared to the GPT-4 model in solving questions on the Accounting Proficiency Exam. This study is grounded in the concept of natural language processing (PLN), as discussed by Brown et al. (2020). The research was conducted using th...

Descripción completa

Detalles Bibliográficos
Autores: Machado de Freitas, Marcelo, Dutra Sallaberry, Jonatas, Bruno de Jesus Silva, Thiago, Silva da Rosa, Fabricia
Formato: artículo
Fecha de Publicación:2025
Institución:Pontificia Universidad Católica del Perú
Repositorio:PUCP-Institucional
Lenguaje:portugués
OAI Identifier:oai:repositorio.pucp.edu.pe:20.500.14657/205111
Enlace del recurso:https://revistas.pucp.edu.pe/index.php/contabilidadyNegocios/article/view/31895/28006
http://hdl.handle.net/20.500.14657/205111
https://doi.org/10.18800/contabilidad.2025ESP.005
Nivel de acceso:acceso abierto
Materia:Use of technologies
ChatGPT4o
Large language models (LLMs)
Uso de tecnologías
Modelos de lenguaje extensos (LLMs)
Uso de tecnologias
Large Language Models
https://purl.org/pe-repo/ocde/ford#5.02.04
Descripción
Sumario:The research aimed to evaluate the performance of the GPT-4o model compared to the GPT-4 model in solving questions on the Accounting Proficiency Exam. This study is grounded in the concept of natural language processing (PLN), as discussed by Brown et al. (2020). The research was conducted using the design science methodology, which aims to build and/or evaluate different technological artifacts by applying the proficiency exam questions to GPT-4o, using OpenAI's ChatGPT. While actual statistics for the Proficiency Exam show that only a portion of accountants pass, the results of artificial intelligence (AI) showed that all four editions evaluated passed with at least a 64% success rate. Overall, across the sample analyzed, the GPT-4o AI model achieved 77% accuracy compared to 71% for the GPT-4 model, achieving 84% accuracy in the last two exams using the most recent model. However, on some questions that the GPT-4 model had previously answered correctly, the more recent model ended up answering incorrectly or differently than expected by the question developers. These results contribute to the literature studying the use of AI in accounting, particularly the application of natural language processing models and large language models (LLMs).
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).