Abstract
This paper focuses on maximizing the precision in binary classification problems using the k-Nearest Neighbour (k-NN) algorithm by simultaneously selecting the variables and neighbourhood size (k). The inputs to k-NN include a set of variables, the neighbourhood size and the distance metric usually selected based on data characteristics. The first two are typically decided sequentially in many studies. The current simultaneous optimization problem is formulated by a mixed-integer linear fractional program and solved by parametric algorithm. The squared Euclidean distance metric is used but the model can be adapted for other distance metrics. The methodology is tested on ten publicly available datasets. Results showed that using at least half to all variables with appropriate k value can achieve better or equally good precision. An effective set of variables jointly determined with neighbourhood size can facilitate k-NN to perform classification with high precision. ©2023 IEEE.
| Original language | English |
|---|---|
| Title of host publication | HORA 2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications Proceedings |
| Publisher | IEEE |
| Number of pages | 6 |
| ISBN (Electronic) | 979-8-3503-3752-5 |
| ISBN (Print) | 979-8-3503-3753-2 |
| DOIs | |
| Publication status | Published - 2023 |
| Event | 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA 2023) - Hybrid, Istanbul, Türkiye Duration: 8 Jun 2023 → 10 Jun 2023 https://horacongress.com/ |
Conference
| Conference | 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA 2023) |
|---|---|
| Abbreviated title | HORA 2023 |
| Place | Türkiye |
| City | Istanbul |
| Period | 8/06/23 → 10/06/23 |
| Internet address |
Research Keywords
- Binary classification
- Precision
- k-NN
- Variable selection
- Determination of k
- Fractional programming