Random Forests as an extension of the classification trees with the R and Python programs

Medina-Merino, Rosa Fátima; Ñique-Chacón, Carmen Ismelda

Random Forests as an extension of the classification trees with the R and Python programs

Descripción del Articulo

This article presents the application of the non-parametric Random Forest method through supervised learning, as an extension of classification trees. The Random Forest algorithm arises as the grouping of several classification trees. Basically it randomly selects a number of variables with which ea...

Descripción completa

Detalles Bibliográficos
Autores:	Medina-Merino, Rosa Fátima, Ñique-Chacón, Carmen Ismelda
Formato:	artículo
Fecha de Publicación:	2017
Institución:	Universidad de Lima
Repositorio:	Revistas - Universidad de Lima
Lenguaje:	español
OAI Identifier:	oai:revistas.ulima.edu.pe:article/1775
Enlace del recurso:	https://revistas.ulima.edu.pe/index.php/Interfases/article/view/1775
Nivel de acceso:	acceso abierto
Materia:	Random Forest classification trees non-parametric classification models supervised learning R language Python language Bosques aleatorios árboles de clasificación modelos no paramétricos de clasificación aprendizaje supervisado lenguaje R lenguaje Python

id	REVULIMA_4f0f45ea7055e90694c45502e512f957
oai_identifier_str	oai:revistas.ulima.edu.pe:article/1775
network_acronym_str	REVULIMA
network_name_str	Revistas - Universidad de Lima
repository_id_str
spelling	Random Forests as an extension of the classification trees with the R and Python programsBosques aleatorios como extensión de los árboles de clasificación con los programas R y PythonMedina-Merino, Rosa FátimaÑique-Chacón, Carmen IsmeldaRandom Forestclassification treesnon-parametric classification modelssupervised learningR languagePython languageBosques aleatoriosárboles de clasificaciónmodelos no paramétricos de clasificaciónaprendizaje supervisadolenguaje Rlenguaje PythonThis article presents the application of the non-parametric Random Forest method through supervised learning, as an extension of classification trees. The Random Forest algorithm arises as the grouping of several classification trees. Basically it randomly selects a number of variables with which each individual tree is constructed and predictions are made with these variables that will later be weighted through the calculation of the most voted class of these trees that were generated, to finally do the prediction by Random Forest. For the application, we worked with 3168 recorded voices, for which the results of an acoustic analysis are presented, registering variables such as frequency, spectrum, modulation, among others, seeking to obtain a pattern of identification and classification according to gender through a voice identifier. The data record used is in open access and can be downloaded from the Kaggle web platform via <https://www.kaggle.com/primaryobjects/voicegende>r. For the development of the algorithm’s model, the statistical program R was used. Additionally, applications were made with Python by the development of classification algorithms.El presente artículo presenta la aplicación del método no paramétrico Random Forest mediante el aprendizajesupervisado, como una extensión de los árboles de clasificación. El algoritmo de Random Forest surge como la agrupación de varios árboles de clasificación; básicamente selecciona de manera aleatoria una cantidad de variablescon las cuales se construye cada uno de los árboles individuales, y se realizan predicciones con estas variables que posteriormente serán ponderadas a través del cálculo de la clase más votada de los árboles que se generaron, para finalmente hacer la predicción por Random Forest. Para la aplicación se trabajó con 3168 registros de voz grabados, para los cuales se presentan los resultados de un análisis acústico, registrándose variables tales como frecuencia, espectro, modulación, entre otras, con lo cual se busca obtener un patrón de identificación y clasificación según género a través de un identificador de voz. El registro de datos utilizado es de acceso libre y puede ser descargado desde la plataforma web de Kaggle a través del enlace <https://www.kaggle.com/primaryobjects/voicegender>. Para el desarrollo del algoritmo del modelo, se recurrió al programa estadístico R. Adicionalmente, se realizaron aplicaciones con Python mediante el desarrollo de algoritmos de clasificación.Universidad de Lima2017-12-18info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://revistas.ulima.edu.pe/index.php/Interfases/article/view/177510.26439/interfases2017.n10.1775Interfases; No. 010 (2017); 165-189Interfases; Núm. 010 (2017); 165-189Interfases; n. 010 (2017); 165-1891993-491210.26439/interfases2017.n10reponame:Revistas - Universidad de Limainstname:Universidad de Limainstacron:ULIMAspahttps://revistas.ulima.edu.pe/index.php/Interfases/article/view/1775/1828Derechos de autor 2018 Interfasesinfo:eu-repo/semantics/openAccessoai:revistas.ulima.edu.pe:article/17752023-07-24T13:32:20Z
dc.title.none.fl_str_mv	Random Forests as an extension of the classification trees with the R and Python programs Bosques aleatorios como extensión de los árboles de clasificación con los programas R y Python
title	Random Forests as an extension of the classification trees with the R and Python programs
spellingShingle	Random Forests as an extension of the classification trees with the R and Python programs Medina-Merino, Rosa Fátima Random Forest classification trees non-parametric classification models supervised learning R language Python language Bosques aleatorios árboles de clasificación modelos no paramétricos de clasificación aprendizaje supervisado lenguaje R lenguaje Python
title_short	Random Forests as an extension of the classification trees with the R and Python programs
title_full	Random Forests as an extension of the classification trees with the R and Python programs
title_fullStr	Random Forests as an extension of the classification trees with the R and Python programs
title_full_unstemmed	Random Forests as an extension of the classification trees with the R and Python programs
title_sort	Random Forests as an extension of the classification trees with the R and Python programs
dc.creator.none.fl_str_mv	Medina-Merino, Rosa Fátima Ñique-Chacón, Carmen Ismelda
author	Medina-Merino, Rosa Fátima
author_facet	Medina-Merino, Rosa Fátima Ñique-Chacón, Carmen Ismelda
author_role	author
author2	Ñique-Chacón, Carmen Ismelda
author2_role	author
dc.subject.none.fl_str_mv	Random Forest classification trees non-parametric classification models supervised learning R language Python language Bosques aleatorios árboles de clasificación modelos no paramétricos de clasificación aprendizaje supervisado lenguaje R lenguaje Python
topic	Random Forest classification trees non-parametric classification models supervised learning R language Python language Bosques aleatorios árboles de clasificación modelos no paramétricos de clasificación aprendizaje supervisado lenguaje R lenguaje Python
description	This article presents the application of the non-parametric Random Forest method through supervised learning, as an extension of classification trees. The Random Forest algorithm arises as the grouping of several classification trees. Basically it randomly selects a number of variables with which each individual tree is constructed and predictions are made with these variables that will later be weighted through the calculation of the most voted class of these trees that were generated, to finally do the prediction by Random Forest. For the application, we worked with 3168 recorded voices, for which the results of an acoustic analysis are presented, registering variables such as frequency, spectrum, modulation, among others, seeking to obtain a pattern of identification and classification according to gender through a voice identifier. The data record used is in open access and can be downloaded from the Kaggle web platform via <https://www.kaggle.com/primaryobjects/voicegende>r. For the development of the algorithm’s model, the statistical program R was used. Additionally, applications were made with Python by the development of classification algorithms.
publishDate	2017
dc.date.none.fl_str_mv	2017-12-18
dc.type.none.fl_str_mv	info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion
format	article
status_str	publishedVersion
dc.identifier.none.fl_str_mv	https://revistas.ulima.edu.pe/index.php/Interfases/article/view/1775 10.26439/interfases2017.n10.1775
url	https://revistas.ulima.edu.pe/index.php/Interfases/article/view/1775
identifier_str_mv	10.26439/interfases2017.n10.1775
dc.language.none.fl_str_mv	spa
language	spa
dc.relation.none.fl_str_mv	https://revistas.ulima.edu.pe/index.php/Interfases/article/view/1775/1828
dc.rights.none.fl_str_mv	Derechos de autor 2018 Interfases info:eu-repo/semantics/openAccess
rights_invalid_str_mv	Derechos de autor 2018 Interfases
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidad de Lima
publisher.none.fl_str_mv	Universidad de Lima
dc.source.none.fl_str_mv	Interfases; No. 010 (2017); 165-189 Interfases; Núm. 010 (2017); 165-189 Interfases; n. 010 (2017); 165-189 1993-4912 10.26439/interfases2017.n10 reponame:Revistas - Universidad de Lima instname:Universidad de Lima instacron:ULIMA
instname_str	Universidad de Lima
instacron_str	ULIMA
institution	ULIMA
reponame_str	Revistas - Universidad de Lima
collection	Revistas - Universidad de Lima
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_	1844893189938348032
score	13.469024

Random Forests as an extension of the classification trees with the R and Python programs

Nota importante:
La información contenida en este registro es de entera responsabilidad de la institución que gestiona el repositorio institucional donde esta contenido este documento o set de datos. El CONCYTEC no se hace responsable por los contenidos (publicaciones y/o datos) accesibles a través del Repositorio Nacional Digital de Ciencia, Tecnología e Innovación de Acceso Abierto (ALICIA).

Random Forests as an extension of the classification trees with the R and Python programs

Descripción del Articulo

Ejemplares Similares