Application of KNN algorithm for predicting celiac disease using clinical and serological variables
Descripción del Articulo
Celiac disease is an autoimmune condition with a global prevalence close to 1%, often underdiagnosed due to low clinical suspicion, which increases both morbidity and mortality. In this context, the application of the K-Nearest Neighbors (KNN) algorithm emerged as a predictive model to support the d...
| Autores: | , , , , , |
|---|---|
| Formato: | artículo |
| Fecha de Publicación: | 2025 |
| Institución: | Universidad La Salle |
| Repositorio: | Revistas - Universidad La Salle |
| Lenguaje: | español |
| OAI Identifier: | oai:ojs.revistas.ulasalle.edu.pe:article/311 |
| Enlace del recurso: | https://revistas.ulasalle.edu.pe/innosoft/article/view/311 https://doi.org/10.48168/innosoft.s24.a311 https://purl.org/42411/s24/a311 https://n2t.net/ark:/42411/s24/a311 |
| Nivel de acceso: | acceso abierto |
| Materia: | autoimmune Django desease KNN prediction autoinmune enfermedad predicción |
| id |
REVUSALLE_6f5e76517f30cc1bb4e375dc2f70fcca |
|---|---|
| oai_identifier_str |
oai:ojs.revistas.ulasalle.edu.pe:article/311 |
| network_acronym_str |
REVUSALLE |
| network_name_str |
Revistas - Universidad La Salle |
| repository_id_str |
|
| spelling |
Application of KNN algorithm for predicting celiac disease using clinical and serological variablesAplicación del algoritmo KNN para la predicción de enfermedad celíaca utilizando variables clínicas y serológicas Levano, DanielCerdán León, Flor ElizabethSalazar Giraldo, Cesar RolandoVasquez Castro, Jadira DinaCarbajal Bazán, Marita Abigail Zea Mendoza, Aldana CamilaautoimmuneDjangodeseaseKNNpredictionautoinmuneDjangoenfermedadKNNpredicciónCeliac disease is an autoimmune condition with a global prevalence close to 1%, often underdiagnosed due to low clinical suspicion, which increases both morbidity and mortality. In this context, the application of the K-Nearest Neighbors (KNN) algorithm emerged as a predictive model to support the detection of this disease using clinical and serological variables. A supervised model was developed using the KNN algorithm and clinical and serological data extracted from an academic dataset containing 2,206 records. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was applied. The data were split for training and validation, optimizing the classification parameter through cross-validation. In addition, a web platform was developed to support data input, analysis, and output, allowing the uploading, processing, and generation of medical reports with role-based access and diagnostic probability estimation. The model achieved 94% accuracy, 97% precision, and 91% sensitivity. The algorithm proved to be effective for predicting celiac disease based on clinical and serological data, and its web-based implementation enables practical integration in clinical environments.La enfermedad celíaca corresponde a una condición autoinmune con una prevalencia cercana al 1% a nivel global, frecuentemente subdiagnosticada debido a la escasa sospecha clínica, lo que incrementa su morbilidad y mortalidad. En este contexto, la aplicacion del algoritmo K-Nearest Neighbors (KNN) surgió como un modelo predictivo para contribuir a la detección de esta enfermedad mediante variables clínicas y serológicas. Se diseñó un modelo supervisado con el algoritmo KNN utilizando variables clínicas y serológicas extraídas de una base de datos académica de 2,206 registros. Para balancear las clases, se aplicó la técnica de sobremuestreo sintético (SMOTE). Los datos fueron segmentados para entrenamiento y validación, optimizando el parámetro de clasificación mediante validación cruzada. Además, se desarrolló una plataforma web diseñada para admitir el ingreso, análisis y emisión que permite la carga, procesamiento y generación de reportes médicos con acceso por roles y estimación de probabilidad diagnóstica. Este modelo alcanzó una exactitud del 94%, una precisión del 97 % y una sensibilidad del 91 %. El algoritmo demostró ser útil para la predicción de la enfermedad celíaca a partir de datos clínicos y serológicos, y su implementación en la web permite su integración práctica en entornos clínicos.Universidad La Salle2025-09-30info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionJournal paperArtículos originalesapplication/pdftext/htmlhttps://revistas.ulasalle.edu.pe/innosoft/article/view/311https://doi.org/10.48168/innosoft.s24.a311https://purl.org/42411/s24/a311https://n2t.net/ark:/42411/s24/a311Innovation and Software; Vol 6 No 2 (2025): September - February; 74-89Innovación y Software; Vol. 6 Núm. 2 (2025): Septiembre - Febrero; 74-892708-09352708-0927https://doi.org/10.48168/innosoft.s24https://purl.org/42411/s24https://n2t.net/ark:/42411/s24reponame:Revistas - Universidad La Salleinstname:Universidad La Salleinstacron:USALLEspahttps://revistas.ulasalle.edu.pe/innosoft/article/view/311/405https://revistas.ulasalle.edu.pe/innosoft/article/view/311/406Derechos de autor 2026 Innovación y Softwarehttps://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessoai:ojs.revistas.ulasalle.edu.pe:article/3112026-03-09T08:00:15Z |
| dc.title.none.fl_str_mv |
Application of KNN algorithm for predicting celiac disease using clinical and serological variables Aplicación del algoritmo KNN para la predicción de enfermedad celíaca utilizando variables clínicas y serológicas |
| title |
Application of KNN algorithm for predicting celiac disease using clinical and serological variables |
| spellingShingle |
Application of KNN algorithm for predicting celiac disease using clinical and serological variables Levano, Daniel autoimmune Django desease KNN prediction autoinmune Django enfermedad KNN predicción |
| title_short |
Application of KNN algorithm for predicting celiac disease using clinical and serological variables |
| title_full |
Application of KNN algorithm for predicting celiac disease using clinical and serological variables |
| title_fullStr |
Application of KNN algorithm for predicting celiac disease using clinical and serological variables |
| title_full_unstemmed |
Application of KNN algorithm for predicting celiac disease using clinical and serological variables |
| title_sort |
Application of KNN algorithm for predicting celiac disease using clinical and serological variables |
| dc.creator.none.fl_str_mv |
Levano, Daniel Cerdán León, Flor Elizabeth Salazar Giraldo, Cesar Rolando Vasquez Castro, Jadira Dina Carbajal Bazán, Marita Abigail Zea Mendoza, Aldana Camila |
| author |
Levano, Daniel |
| author_facet |
Levano, Daniel Cerdán León, Flor Elizabeth Salazar Giraldo, Cesar Rolando Vasquez Castro, Jadira Dina Carbajal Bazán, Marita Abigail Zea Mendoza, Aldana Camila |
| author_role |
author |
| author2 |
Cerdán León, Flor Elizabeth Salazar Giraldo, Cesar Rolando Vasquez Castro, Jadira Dina Carbajal Bazán, Marita Abigail Zea Mendoza, Aldana Camila |
| author2_role |
author author author author author |
| dc.subject.none.fl_str_mv |
autoimmune Django desease KNN prediction autoinmune Django enfermedad KNN predicción |
| topic |
autoimmune Django desease KNN prediction autoinmune Django enfermedad KNN predicción |
| description |
Celiac disease is an autoimmune condition with a global prevalence close to 1%, often underdiagnosed due to low clinical suspicion, which increases both morbidity and mortality. In this context, the application of the K-Nearest Neighbors (KNN) algorithm emerged as a predictive model to support the detection of this disease using clinical and serological variables. A supervised model was developed using the KNN algorithm and clinical and serological data extracted from an academic dataset containing 2,206 records. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was applied. The data were split for training and validation, optimizing the classification parameter through cross-validation. In addition, a web platform was developed to support data input, analysis, and output, allowing the uploading, processing, and generation of medical reports with role-based access and diagnostic probability estimation. The model achieved 94% accuracy, 97% precision, and 91% sensitivity. The algorithm proved to be effective for predicting celiac disease based on clinical and serological data, and its web-based implementation enables practical integration in clinical environments. |
| publishDate |
2025 |
| dc.date.none.fl_str_mv |
2025-09-30 |
| dc.type.none.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion Journal paper Artículos originales |
| format |
article |
| status_str |
publishedVersion |
| dc.identifier.none.fl_str_mv |
https://revistas.ulasalle.edu.pe/innosoft/article/view/311 https://doi.org/10.48168/innosoft.s24.a311 https://purl.org/42411/s24/a311 https://n2t.net/ark:/42411/s24/a311 |
| url |
https://revistas.ulasalle.edu.pe/innosoft/article/view/311 https://doi.org/10.48168/innosoft.s24.a311 https://purl.org/42411/s24/a311 https://n2t.net/ark:/42411/s24/a311 |
| dc.language.none.fl_str_mv |
spa |
| language |
spa |
| dc.relation.none.fl_str_mv |
https://revistas.ulasalle.edu.pe/innosoft/article/view/311/405 https://revistas.ulasalle.edu.pe/innosoft/article/view/311/406 |
| dc.rights.none.fl_str_mv |
Derechos de autor 2026 Innovación y Software https://creativecommons.org/licenses/by/4.0 info:eu-repo/semantics/openAccess |
| rights_invalid_str_mv |
Derechos de autor 2026 Innovación y Software https://creativecommons.org/licenses/by/4.0 |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf text/html |
| dc.publisher.none.fl_str_mv |
Universidad La Salle |
| publisher.none.fl_str_mv |
Universidad La Salle |
| dc.source.none.fl_str_mv |
Innovation and Software; Vol 6 No 2 (2025): September - February; 74-89 Innovación y Software; Vol. 6 Núm. 2 (2025): Septiembre - Febrero; 74-89 2708-0935 2708-0927 https://doi.org/10.48168/innosoft.s24 https://purl.org/42411/s24 https://n2t.net/ark:/42411/s24 reponame:Revistas - Universidad La Salle instname:Universidad La Salle instacron:USALLE |
| instname_str |
Universidad La Salle |
| instacron_str |
USALLE |
| institution |
USALLE |
| reponame_str |
Revistas - Universidad La Salle |
| collection |
Revistas - Universidad La Salle |
| repository.name.fl_str_mv |
|
| repository.mail.fl_str_mv |
|
| _version_ |
1864286165097512960 |
| score |
13.118823 |
Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).