Title | ||
---|---|---|
Online multi-label stream feature selection based on neighborhood rough set with missing labels |
Abstract | ||
---|---|---|
Multi-label feature selection has been essential in many big data applications and plays a significant role in processing high-dimensional data. However, the existing online stream feature selection methods ignore the existence of missing labels. Inspired by the neighborhood rough set that does not require prior knowledge of the feature space, we propose a novel online multi-label stream feature selection algorithm called OFS-Mean. We define a neighborhood relationship that can automatically select an appropriate number of neighbors. Without any prior space and parameters, the algorithm’s performance of the algorithm is improved by real-time online prediction of missing labels based on the similarity between the instance and its neighbors. The proposed OFS-Mean divides the feature selection process into two stages: online feature importance evaluation and online redundancy update to screen important features. With the support of neighborhood rough set, the proposed OFS-Mean can adapt to various types of datasets, improving the algorithm generalization ability. In the experiment, the similarity test is used to verify the prediction results; the comparison with the traditional semi-supervised feature selection method under the condition of selecting the same number of features has achieved ideal results. |
Year | DOI | Venue |
---|---|---|
2022 | 10.1007/s10044-022-01067-2 | Pattern Analysis and Applications |
Keywords | DocType | Volume |
Online feature selection, Neighborhood rough set, Missing labels, Stream feature, Multi-label | Journal | 25 |
Issue | ISSN | Citations |
4 | 1433-7541 | 0 |
PageRank | References | Authors |
0.34 | 27 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shunpan Liang | 1 | 0 | 1.01 |
Ze Liu | 2 | 0 | 0.34 |
Dianlong You | 3 | 0 | 1.01 |
Weiwei Pan | 4 | 0 | 0.34 |