An Optimization-Based Sample Selection Method Considering Sample Redundancy and Usefulness

Feng Zhu*, Jianshe Feng, Zicheng Su, Min Xie

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Data pre-processing is a crucial step for data management, communication, and modeling in Prognostics and Health Management (PHM) framework. However, most existing studies have primarily focused on feature selection while disregarding the significance of sample selection. In practice, effective sample selection not only decreases data redundancy but also enhances computational efficiency and model performance. This research introduces a new offline sample selection method that employs an optimization algorithm based on simultaneous sparse recovery. We formulate the sample selection problem as a linear programming model that is solvable using most standard convex solvers. The solution of the model not only identifies the important samples but also establishes the mapping relationships between each sample and the important ones. The method can identify a small subset of samples that are crucial by considering their usefulness and freshness. We demonstrate the effectiveness of our proposed method in a case study that employs the dataset from the 2016 PHM data challenge. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
Original languageEnglish
Title of host publicationProceedings of the UNIfied Conference of DAMAS, IncoME and TEPEN Conferences (UNIfied 2023)
EditorsAndrew D. Ball, Huajiang Ouyang, Jyoti K. Sinha, Zuolu Wang
Place of PublicationCham
PublisherSpringer 
Pages1119-1129
Volume1
ISBN (Electronic)978-3-031-49413-0
ISBN (Print)9783031494123
DOIs
Publication statusPublished - 2024
EventUNIfied Conference of International Workshop on Defence Applications of Multi-Agent Systems (DAMAS 2023), International Conference on Maintenance Engineering (IncoME-V 2023), International conference on the Efficiency and Performance Engineering Network (TEPEN 2023) - University of Huddersfield, Huddersfield, United Kingdom
Duration: 29 Aug 20231 Sept 2023
https://tepen.net/tepen-conferences/

Publication series

NameMechanisms and Machine Science
Volume151
ISSN (Print)2211-0984
ISSN (Electronic)2211-0992

Conference

ConferenceUNIfied Conference of International Workshop on Defence Applications of Multi-Agent Systems (DAMAS 2023), International Conference on Maintenance Engineering (IncoME-V 2023), International conference on the Efficiency and Performance Engineering Network (TEPEN 2023)
Country/TerritoryUnited Kingdom
CityHuddersfield
Period29/08/231/09/23
Internet address

Research Keywords

  • Data pre-processing
  • Optimization
  • Prognostics and health management
  • Sample selection
  • Virtual metrology

Fingerprint

Dive into the research topics of 'An Optimization-Based Sample Selection Method Considering Sample Redundancy and Usefulness'. Together they form a unique fingerprint.

Cite this