Title
Class confidence weighted kNN algorithms for imbalanced data sets
Abstract
In this paper, a novel k-nearest neighbors (kNN) weighting strategy is proposed for handling the problem of class imbalance. When dealing with highly imbalanced data, a salient drawback of existing kNN algorithms is that the class with more frequent samples tends to dominate the neighborhood of a test instance in spite of distance measurements, which leads to suboptimal classification performance on the minority class. To solve this problem, we propose CCW (class confidence weights) that uses the probability of attribute values given class labels to weight prototypes in kNN. The main advantage of CCW is that it is able to correct the inherent bias to majority class in existing kNN algorithms on any distance measurement. Theoretical analysis and comprehensive experiments confirm our claims.
Year
DOI
Venue
2011
10.1007/978-3-642-20847-8_29
PAKDD (2)
Keywords
Field
DocType
frequent sample,comprehensive experiment,minority class,class confidence weighted knn,knn algorithm,majority class,class label,class imbalance,distance measurement,imbalanced data set,classification performance,class confidence weight
Drawback,Data mining,Data set,Weighting,Computer science,Artificial intelligence,Spite,Distance measurement,Pattern recognition,Algorithm,Bayesian network,Machine learning,Salient
Conference
Volume
ISSN
Citations 
6635
0302-9743
37
PageRank 
References 
Authors
1.19
18
2
Name
Order
Citations
PageRank
Wei Liu146837.36
Sanjay Chawla21372105.09