Title
Online multi-label stream feature selection based on neighborhood rough set with missing labels
Abstract
Multi-label feature selection has been essential in many big data applications and plays a significant role in processing high-dimensional data. However, the existing online stream feature selection methods ignore the existence of missing labels. Inspired by the neighborhood rough set that does not require prior knowledge of the feature space, we propose a novel online multi-label stream feature selection algorithm called OFS-Mean. We define a neighborhood relationship that can automatically select an appropriate number of neighbors. Without any prior space and parameters, the algorithm’s performance of the algorithm is improved by real-time online prediction of missing labels based on the similarity between the instance and its neighbors. The proposed OFS-Mean divides the feature selection process into two stages: online feature importance evaluation and online redundancy update to screen important features. With the support of neighborhood rough set, the proposed OFS-Mean can adapt to various types of datasets, improving the algorithm generalization ability. In the experiment, the similarity test is used to verify the prediction results; the comparison with the traditional semi-supervised feature selection method under the condition of selecting the same number of features has achieved ideal results.
Year
DOI
Venue
2022
10.1007/s10044-022-01067-2
Pattern Analysis and Applications
Keywords
DocType
Volume
Online feature selection, Neighborhood rough set, Missing labels, Stream feature, Multi-label
Journal
25
Issue
ISSN
Citations 
4
1433-7541
0
PageRank 
References 
Authors
0.34
27
4
Name
Order
Citations
PageRank
Shunpan Liang101.01
Ze Liu200.34
Dianlong You301.01
Weiwei Pan400.34