Tópicos Sugeridos dentro de su búsqueda.
Tópicos Sugeridos dentro de su búsqueda.
Buscar alternativas:
classification trees » classification models (Expander búsqueda), classification _ (Expander búsqueda), classification using (Expander búsqueda)
data classification » abc classification (Expander búsqueda), a classification (Expander búsqueda), image classification (Expander búsqueda)
classification trees » classification models (Expander búsqueda), classification _ (Expander búsqueda), classification using (Expander búsqueda)
data classification » abc classification (Expander búsqueda), a classification (Expander búsqueda), image classification (Expander búsqueda)
1
artículo
Publicado 2010
Enlace
Enlace
This article proposes a classification for socio-cultural and linguistic data bases, especially those that document society, culture and language of Amazonian indigenous or rural mestizo people. The proposal was elaborated in the context of a DOBES language documentation project about the language use of the “People of the Center” (Bora, Witoto, Ocaina, Nonuya, and Resigaro). The basic principles of this proposal are derived from Bakhtin's/Vološinov's theory of the proposition.
2
artículo
Publicado 2017
Enlace
Enlace
This article presents the application of the non-parametric Random Forest method through supervised learning, as an extension of classification trees. The Random Forest algorithm arises as the grouping of several classification trees. Basically it randomly selects a number of variables with which each individual tree is constructed and predictions are made with these variables that will later be weighted through the calculation of the most voted class of these trees that were generated, to finally do the prediction by Random Forest. For the application, we worked with 3168 recorded voices, for which the results of an acoustic analysis are presented, registering variables such as frequency, spectrum, modulation, among others, seeking to obtain a pattern of identification and classification according to gender through a voice identifier. The data record used is in open access and can be do...
3
artículo
Publicado 2017
Enlace
Enlace
This article presents the application of the non-parametric Random Forest method through supervised learning, as an extension of classification trees. The Random Forest algorithm arises as the grouping of several classification trees. Basically it randomly selects a number of variables with which each individual tree is constructed and predictions are made with these variables that will later be weighted through the calculation of the most voted class of these trees that were generated, to finally do the prediction by Random Forest. For the application, we worked with 3168 recorded voices, for which the results of an acoustic analysis are presented, registering variables such as frequency, spectrum, modulation, among others, seeking to obtain a pattern of identification and classification according to gender through a voice identifier. The data record used is in open access and can be do...
4
artículo
Publicado 2005
Enlace
Enlace
This study has the aim to find a pattern in delayed payments from the information obtained at the moment of requesting credit in a specific creditable product: At the same time, we show a very useful new statistical technique for this area, that is the classification tree (CART) that is applied in situations where we have independent predictor variables of classification or criterion that define the group to which every individual belongs. The paper also tries to find a set of decision rules that allow an explanation of the actual classification and the use of these rules to classify any new individual.
5
artículo
Publicado 2005
Enlace
Enlace
This study has the aim to find a pattern in delayed payments from the information obtained at the moment of requesting credit in a specific creditable product: At the same time, we show a very useful new statistical technique for this area, that is the classification tree (CART) that is applied in situations where we have independent predictor variables of classification or criterion that define the group to which every individual belongs. The paper also tries to find a set of decision rules that allow an explanation of the actual classification and the use of these rules to classify any new individual.
6
artículo
Publicado 2020
Enlace
Enlace
This study describes a model of explanations in natural language for classification decision trees. The explanations include global aspects of the classifier and local aspects of the classification of a particular instance. The proposal is implemented in the ExpliClas open source Web service [1], which in its current version operates on trees built with Weka and data sets with numerical attributes. The feasibility of the proposal is illustrated with two example cases, where the detailed explanation of the respective classification trees is shown.
7
artículo
Publicado 2018
Enlace
Enlace
Based on the study of students’ perception of the undergraduate teaching activity of the Faculty of Administrative Sciences of the National University of San Marcos in the second semester of 2017, which focused on topics such as: knowledge of the subject, attendance and punctuality, ethics, didactic capacity and compliance of the syllabus among others; the presentation of a personalized report to the teacher is proposed, where each one of his students is located in different scenarios, which have been determined through the use of classification and regression trees of data mining. In order to identify patterns through which the teacher is valued (virtues and weaknesses) and thus collaborate with the improvement of the teaching quality of the Faculty.
8
artículo
Publicado 2018
Enlace
Enlace
Based on the study of students’ perception of the undergraduate teaching activity of the Faculty of Administrative Sciences of the National University of San Marcos in the second semester of 2017, which focused on topics such as: knowledge of the subject, attendance and punctuality, ethics, didactic capacity and compliance of the syllabus among others; the presentation of a personalized report to the teacher is proposed, where each one of his students is located in different scenarios, which have been determined through the use of classification and regression trees of data mining. In order to identify patterns through which the teacher is valued (virtues and weaknesses) and thus collaborate with the improvement of the teaching quality of the Faculty.
9
artículo
Publicado 2023
Enlace
Enlace
This article details the process that was carried out for the salary forecast in a database of a census given in 1996, where the Python programming language was used, for the analysis of the data of the dataset the Google Colab server was used to execute the algorithms in the cloud, since the team considered that the speed of data analysis in Google Colab is faster. One of the data mining techniques was also used to classify the variables using decision trees that have the ability to graphically represent several alternative solutions in order to determine the most effective courses/routes of action for the classification of the obtainment. of a person's salary.
10
artículo
Publicado 2010
Enlace
Enlace
This article proposes a classification for socio-cultural and linguistic data bases, especially those that document society, culture and language of Amazonian indigenous or rural mestizo people. The proposal was elaborated in the context of a DOBES language documentation project about the language use of the “People of the Center” (Bora, Witoto, Ocaina, Nonuya, and Resigaro). The basic principles of this proposal are derived from Bakhtin's/Vološinov's theory of the proposition.
11
artículo
Publicado 2010
Enlace
Enlace
The aim of this study is the determination, from an empirical perspective, of the accounting and financial features which could condition financial profitability of real estate companies, to identify the performances that guarantee its permanency in the current marketplace, characterized by the world economic crisis, specially in Spain, whose housing sector represents an important contributor to the economic growth. Although at a theoretical level the DuPont Model establishes the relationships between a group of accounting ratios and financial profitability. This paper uses a sample of 5,484 Spanish real estate companies to quantify these relationships and to extract the most relevant ones and to obtain the patterns of the most profitable companies. We use ROE to measure profitability and we analyze various independent variables about solvency, liquidity, activity, turnover, financial eq...
12
artículo
Application of decision trees for the identification of adaptability of students in online education
Publicado 2023
Enlace
Enlace
Due to the global pandemic by Covid-19, online education was established in student learning. However, the effectiveness of this modality, as well as the adaptability of the students, is something that may depend on some factors. In this sense, this research article presents a description of the use of decision trees to determine the adaptability of students in online education, using a dataset of 1205 records with data such as the type of connection and internet, device, condition. financial, among other important data. Likewise, tools such as Google Colab, Python and popular libraries were used in similar works of Artificial Intelligence and Machine Learning.
13
artículo
Various studies have proven that the levels of violence in video games can negatively influence the development of children, especially in adolescence and that is why care must be taken that the classification is appropriate according to the content present. For the analysis, the ESRB classification was used, which contains 7 different categories, together with the implementation of a decision tree model, which is a data mining technique capable of graphically representing the relationship between the variables. The results showed that the precision level for a level 6 tree does not exceed the minimum required.
14
artículo
Publicado 2022
Enlace
Enlace
With the increasing popularity of online social networking platforms, the amount of social data has grown exponentially. Social data analysis is essential as spamming activities and spammers are escalating over online social networking platforms. This paper focuses on spammer detection on the Twitter social networking platform. Although existing researchers have developed numerous machine learning methods to detect spammers, these methods are inefficient for appropriately detecting spammers on Twitter due to the imbalance of spam and nonspam data distribution, the involvement of diverse features and the applicability of data mechanisms by spammers to avoid their detection. This research work proposes a novel hybrid approach of the gravitational search algorithm and the decision tree (HGSDT) for detecting Twitter spammers. The individual decision tree (DT) algorithm is not able to address...
15
artículo
Publicado 2024
Enlace
Enlace
Author contribution: All authors made an equal contribution to the development and planning of the study. Conflict of Interest: The authors have no potential conflicts of interest, or such divergences linked with this research study. Data Availability Statement: Data are available from the authors upon request.
16
artículo
Publicado 2010
Enlace
Enlace
The aim of this study is the determination, from an empirical perspective, of the accounting and financial features which could condition financial profitability of real estate companies, to identify the performances that guarantee its permanency in the current marketplace, characterized by the world economic crisis, specially in Spain, whose housing sector represents an important contributor to the economic growth. Although at a theoretical level the DuPont Model establishes the relationships between a group of accounting ratios and financial profitability. This paper uses a sample of 5,484 Spanish real estate com-panies to quantify these relationships and to extract the most relevant ones and to obtain the patterns of the most profitable companies. We use ROE to measure profitability and we analyze various independent variables about solvency, liquidity, activity, turnover, financial e...
17
artículo
Publicado 2023
Enlace
Enlace
Abstract—In recent years, computer science has advanced exponentially, helping significantly to identify and classify text extracted from social networks, specifically Twitter. This work identifies, classifies, and analyzes tweets related to real natural disasters through tweets with the hashtag #Nat-uralDisasters, using Machine learning (ML) algorithms, such as Bernoulli Naive Bayes (BNB), Multinomial Naive Bayes (MNB), Logistic Regression (LR), K-Nearest Neighbors (KNN), Decision Tree (DT), Random Forest (RF). First, tweets related to natural disasters were identified, creating a dataset of 122k geo-located tweets for training. Secondly, the data-cleaning process was carried out by applying stemming and lemmatization techniques. Third, exploratory data analysis (EDA) was performed to gain an initial understanding of the data. Fourth, the training and testing process of the BNB, MNB, ...
18
artículo
Forest census allows getting precise data for logging planning and elaboration of the forest management plan. Species identification blunders carry inadequate forest management plans and high risks inside forest concessions. Hence, an identification protocol prevents the exploitation of non-commercial or endangered timber species. The current Peruvian legislation allows the incorporation of non-technical experts, called “materos”, during the identification. Materos use common names given by the folklore and traditions of their communities instead of formal ones, which generally lead to misclassifications. In the real world, logging companies hire materos instead of botanists due to cost/time limitations. Given such a motivation, we explore an end-to-end software solution to automatize the species identification. This paper introduces the Peruvian Amazon Forestry Dataset, which includ...
19
artículo
Publicado 2023
Enlace
Enlace
Identifying and classifying text extracted from social networks, following the traditional method, is very complex. In recent years, computer science has advanced exponentially, helping significantly to identify and classify text extracted from social networks, specifically Twitter. This work aims to identify, classify and analyze tweets related to real natural disasters through tweets with the hashtag #NaturalDisasters, using Machine learning (ML) algorithms, such as Bernoulli Naive Bayes (BNB), Multinomial Naive Bayes (MNB), Logistic Regression (LR), K-Nearest Neighbors (KNN), Decision Tree (DT), Random Forest (RF). First, tweets related to natural disasters were identified, creating a dataset of 122k geolocated tweets for training. Secondly, the data-cleaning process was carried out by applying stemming and lemmatization techniques. Third, exploratory data analysis (EDA) was performed...
20
artículo
Publicado 2018
Enlace
Enlace
Abstract: This article summarizes the main contributions of the thesis with the title “K-Nearest neighbor in a classification and prediction application in the Judicial Power of Peru". In this thesis a model is constructed using the method of the nearest k-neighbors that allows classifying and predicting the Superior Courts of Justice of Peru. Through a descriptive data analysis, the Lima Court is excluded from the study. With the remaining 30 Superior Courts, a three-group model based on unsupervised classification is generated, for which the Euclidean distance matrix that originates the classification tree is deduced. The classification model of three nearest neighbors is constructed, with partition and random cross-validation folds, which indicates; the predictor space model, the quadratic error or error index that validates the op-timal value of k = 3 neighbors, the model error and...