Projects per year
Abstract
Motivation : Drawing peaks in a data window of an MS dataset happens at all time in MS data visualization applications. This asks to retrieve from an MS dataset some selected peaks in a data window whose image in a display window reflects the visual feature of all peaks in the data window. If an algorithm for this purpose is asked to output high-quality solutions in real time, then the most fundamental dependence of it is on the storage format of the MS dataset.
Results : We present mzMD, a new storage format of MS datasets and an algorithm to query this format of a storage system for a summary (a set of selected representative peaks) of a given data window. We propose a criterion Q-score to examine the quality of data window summaries. Experimental statistics on real MS datasets verified the high speed of mzMD in retrieving high-quality data window summaries. mzMD reported summaries of data windows whose Q-score outperforms those mzTree reported. The query speed of mzMD is the same as that of mzTree whereas its query speed stability is better than that of mzTree.
Results : We present mzMD, a new storage format of MS datasets and an algorithm to query this format of a storage system for a summary (a set of selected representative peaks) of a given data window. We propose a criterion Q-score to examine the quality of data window summaries. Experimental statistics on real MS datasets verified the high speed of mzMD in retrieving high-quality data window summaries. mzMD reported summaries of data windows whose Q-score outperforms those mzTree reported. The query speed of mzMD is the same as that of mzTree whereas its query speed stability is better than that of mzTree.
Original language | English |
---|---|
Article number | btac098 |
Pages (from-to) | 2333–2340 |
Journal | Bioinformatics |
Volume | 38 |
Issue number | 5 |
Online published | 16 Feb 2022 |
DOIs | |
Publication status | Published - 15 Apr 2022 |
Funding
This work was supported by Natural Science Foundation of China [No. 61732009]. Dr L.W. was supported by Natural Science Foundation of China [No. 61972329] and GRF grants for Hong Kong Special Administrative Region, P. R. China (CityU 11210119 and CityU 11206120).
Research Keywords
- MASS-SPECTROMETRY
- LC-MS/MS
- PROTEOMICS
- PLATFORM
- FORMAT
Fingerprint
Dive into the research topics of 'mzMD: visualization-oriented MS data storage and retrieval'. Together they form a unique fingerprint.Projects
- 2 Finished
-
GRF: Algorithms for Searching MS Spectra against Protein Databases and Protein Sequencing Using Combined Top-down and Bottom-up Approach for Monoclonal Antibodies
WANG, L. (Principal Investigator / Project Coordinator)
1/01/21 → 13/06/25
Project: Research
-
GRF: Efficient Algorithms for Identification of Modified Proteoforms Using Top-down Mass Spectra
WANG, L. (Principal Investigator / Project Coordinator) & Liu, X. (Co-Investigator)
1/01/20 → 5/06/24
Project: Research