Evolución de la IA en la resolución de problemas contables: Comparación entre GPT4 y GPT4o en el Examen de Competencia Contable
Descripción del Articulo
The research aimed to evaluate the performance of the GPT-4o model compared to the GPT-4 model in solving questions on the Accounting Proficiency Exam. This study is grounded in the concept of natural language processing (PLN), as discussed by Brown et al. (2020). The research was conducted using th...
| Autores: | , , , |
|---|---|
| Formato: | artículo |
| Fecha de Publicación: | 2025 |
| Institución: | Pontificia Universidad Católica del Perú |
| Repositorio: | PUCP-Institucional |
| Lenguaje: | portugués |
| OAI Identifier: | oai:repositorio.pucp.edu.pe:20.500.14657/205111 |
| Enlace del recurso: | https://revistas.pucp.edu.pe/index.php/contabilidadyNegocios/article/view/31895/28006 http://hdl.handle.net/20.500.14657/205111 https://doi.org/10.18800/contabilidad.2025ESP.005 |
| Nivel de acceso: | acceso abierto |
| Materia: | Use of technologies ChatGPT4o Large language models (LLMs) Uso de tecnologías Modelos de lenguaje extensos (LLMs) Uso de tecnologias Large Language Models https://purl.org/pe-repo/ocde/ford#5.02.04 |
| Sumario: | The research aimed to evaluate the performance of the GPT-4o model compared to the GPT-4 model in solving questions on the Accounting Proficiency Exam. This study is grounded in the concept of natural language processing (PLN), as discussed by Brown et al. (2020). The research was conducted using the design science methodology, which aims to build and/or evaluate different technological artifacts by applying the proficiency exam questions to GPT-4o, using OpenAI's ChatGPT. While actual statistics for the Proficiency Exam show that only a portion of accountants pass, the results of artificial intelligence (AI) showed that all four editions evaluated passed with at least a 64% success rate. Overall, across the sample analyzed, the GPT-4o AI model achieved 77% accuracy compared to 71% for the GPT-4 model, achieving 84% accuracy in the last two exams using the most recent model. However, on some questions that the GPT-4 model had previously answered correctly, the more recent model ended up answering incorrectly or differently than expected by the question developers. These results contribute to the literature studying the use of AI in accounting, particularly the application of natural language processing models and large language models (LLMs). |
|---|
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).