Optimizing variable selection and k in the k-NN classifier with precision objective

Ka Yuk Carrie LIN*

*Corresponding author for this work

    Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

    1 Citation (Scopus)

    Abstract

    This paper focuses on maximizing the precision in binary classification problems using the k-Nearest Neighbour (k-NN) algorithm by simultaneously selecting the variables and neighbourhood size (k). The inputs to k-NN include a set of variables, the neighbourhood size and the distance metric usually selected based on data characteristics. The first two are typically decided sequentially in many studies. The current simultaneous optimization problem is formulated by a mixed-integer linear fractional program and solved by parametric algorithm. The squared Euclidean distance metric is used but the model can be adapted for other distance metrics. The methodology is tested on ten publicly available datasets. Results showed that using at least half to all variables with appropriate k value can achieve better or equally good precision. An effective set of variables jointly determined with neighbourhood size can facilitate k-NN to perform classification with high precision. ©2023 IEEE.
    Original languageEnglish
    Title of host publicationHORA 2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications Proceedings
    PublisherIEEE
    Number of pages6
    ISBN (Electronic)979-8-3503-3752-5
    ISBN (Print)979-8-3503-3753-2
    DOIs
    Publication statusPublished - 2023
    Event5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA 2023) - Hybrid, Istanbul, Türkiye
    Duration: 8 Jun 202310 Jun 2023
    https://horacongress.com/

    Conference

    Conference5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA 2023)
    Abbreviated titleHORA 2023
    PlaceTürkiye
    CityIstanbul
    Period8/06/2310/06/23
    Internet address

    Research Keywords

    • Binary classification
    • Precision
    • k-NN
    • Variable selection
    • Determination of k
    • Fractional programming

    Fingerprint

    Dive into the research topics of 'Optimizing variable selection and k in the k-NN classifier with precision objective'. Together they form a unique fingerprint.

    Cite this