COMPARATIVE STUDY OF LEARNING FROM IMBALANCED DATA

Comparative Study Of Learning From Imbalanced Data

Request Complete PDF Copy on WhatsApp

ABSRACT -- [Total Page(s) 1]

Page 1 of 1

- The automation of most of our activities has led to the continuous production of data that arrive in the form of fast-arriving streams. In a supervised learning setting, instances in these streams are labeled as belonging to a particular class. When the number of classes in the data stream is more than two, such a data stream is referred to as a multi-class data stream. Multi-class imbalanced data stream describes the situation where the instance distribution of the classes is skewed, such that instances of some classes occur more frequently than others. Classes with the frequently occurring instances are referred to as the majority classes, while the classes with instances that occur less frequently are denoted as the minority classes.
  Classification algorithms, or supervised learning techniques, use historic instances to build models, which are then used to predict the classes of unseen instances. Multi-class imbalanced data stream classification poses a great challenge to classical classification algorithms. This is due to the fact that traditional algorithms are usually biased towards the majority classes, since they have more examples of the majority classes when building the model.
  The research conducted in this thesis aims to address this research gap by proposing a novel online learning methodology that combines oversampling of the minority classes with cluster-based majority class under-sampling, without decomposing the data stream into multiple binary sets. Sampling involves continuously selecting a balanced number of instances across all classes for model building. Our focus is on improving the rate of correctly predicting instances of the minority classes in multi-class imbalanced data streams, through the introduction of the Synthetic Minority Over-sampling Technique (SMOTE) and Cluster-based Under-sampling Technique - Data Streams (CUT-DS) methodologies. In this work, we dynamically balance the classes by utilizing a windowing mechanism during the incremental sampling process. Our CUT-DS algorithms are evaluated using six different types of classification techniques, followed by comparing their results against a state-of-the-art algorithm. Our contributions are tested using both synthetic and real data sets. The experimental results show that the approaches developed in this thesis yield high prediction rates of minority instances as contained in the multiple minority classes within a non-evolving stream.

ABSRACT -- [Total Page(s) 1]

Page 1 of 1

- CHAPTER ONE - [ Total Page(s): 2 ]1.5 Scope of the studyThe study is restricted to the nature of Imbalanced data, providing comparative study of learning schemes for learning from imbalanced data. The scope of the study in broad terms of other than learning from imbalanced data. Few among them are;Machine Learning algorithmic approach to learning from imbalanced data such as decision Trees (The Naïve Bayes Tree), and Artificial Neural network (The Multilayer Perceptron ), Machine learning performance evaluation measures, Perfor ... Continue reading---
Request Complete PDF Copy on WhatsApp

Research Topics and Full Project Work.

ProjectWaka.com is a bank of full project works, students' final year project ideas, free project topics and materials pdf, project work samples and complete project pdf. We have made provision of all project contents and in rare cases did require a service support fee; our database has grown to about one million [1,000,000] research project materials, we are committed to serving you research topics in education, project topics on accounting, project topics for mass communication, project topics in computer science, project topics in economics, project topics in business administration, project topics for public administration and on every courses you may ever need.

The quantity and quality of student projects, the satisfying academic solutions together with the simplicity of our platform and our free offering with just a service support fee requirement from users makes us the best and largest research project website.
You can subscribe for our updates on the following handle: Facebook, Whatsapp, Twitter, Instagram, and linkedin.
If you have any suggestion or complain email hello@projectwaka.com or WhatsApp +234818 764 4224. or Call +234807 177 5447.

ABSRACT -- [Total Page(s) 1]

ABSRACT -- [Total Page(s) 1]