Interactive reinforced feature selection with traverse strategy
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 1935–1962 |
Journal / Publication | Knowledge and Information Systems |
Volume | 65 |
Issue number | 5 |
Online published | 21 Jan 2023 |
Publication status | Published - May 2023 |
Link(s)
Abstract
In this paper, we propose a single-agent Monte Carlo-based reinforced feature selection method, as well as two efficiency improvement strategies, i.e., early stopping strategy and reward-level interactive strategy. Feature selection is one of the most important technologies in data prepossessing, aiming to find the optimal feature subset for a given downstream machine learning task. Enormous research has been done to improve its effectiveness and efficiency. Recently, the multi-agent reinforced feature selection (MARFS) has achieved great success in improving the performance of feature selection. However, MARFS suffers from the heavy burden of computational cost, which greatly limits its application in real-world scenarios. In this paper, we propose an efficient reinforcement feature selection method, which uses one agent to traverse the whole feature set and decides to select or not select each feature one by one. Specifically, we first develop one behavior policy and use it to traverse the feature set and generate training data. And then, we evaluate the target policy based on the training data and improve the target policy by Bellman equation. Besides, we conduct the importance sampling in an incremental way and propose an early stopping strategy to improve the training efficiency by the removal of skew data. In the early stopping strategy, the behavior policy stops traversing with a probability inversely proportional to the importance sampling weight. In addition, we propose a reward-level and training-level interactive strategy to improve the training efficiency via external advice. What's more, we propose an incremental descriptive statistics method to represent the state with low computational cost. Finally, we design extensive experiments on real-world data to demonstrate the superiority of the proposed method. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2023.
Research Area(s)
- Feature selection, Reinforcement learning, Monte Carlo, ALGORITHM, MACHINE
Citation Format(s)
Interactive reinforced feature selection with traverse strategy. / Liu, Kunpeng; Wang, Dongjie; Du, Wan et al.
In: Knowledge and Information Systems, Vol. 65, No. 5, 05.2023, p. 1935–1962.
In: Knowledge and Information Systems, Vol. 65, No. 5, 05.2023, p. 1935–1962.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review