Development of a Machine Learning Algorithm for Drug Screening Analysis on High-Resolution UPLC-MSE/QTOF Mass Spectrometry.

TitleDevelopment of a Machine Learning Algorithm for Drug Screening Analysis on High-Resolution UPLC-MSE/QTOF Mass Spectrometry.
Publication TypeJournal Article
Year of Publication2023
AuthorsHao Y, Lynch K, Fan P, Jurtschenko C, Cid M, Zhao Z, Yang HS
JournalJ Appl Lab Med
Volume8
Issue1
Pagination53-66
Date Published2023 Jan 04
ISSN2576-9456
KeywordsChromatography, High Pressure Liquid, Chromatography, Liquid, Drug Evaluation, Preclinical, Humans, Ions, Mass Spectrometry
Abstract

BACKGROUND: Ultra-performance liquid chromatography (UPLC)-MSE/quadrupole time-of-flight (QTOF) high-resolution mass spectrometry employs untargeted, data-independent acquisition in a dual mode that simultaneously collects precursor ions and product ions at low and ramped collision energies, respectively. However, algorithmic analysis of large-scale multivariate data of comprehensive drug screening as well as the positivity criteria of drug identification have not been systematically investigated. It is also unclear whether ion ratio (IR), the intensity ratio of a defined product ion divided by the precursor ion, is a stable parameter that can be incorporated into the MSE/QTOF data analysis algorithm.

METHODS: IR of 91 drugs were experimentally determined and variation of IR was investigated across 5 concentrations measured on 3 different days. A data-driven machine learning approach was employed to develop multivariate linear regression (MLR) models incorporating mass error, retention time, number of detected fragment ions and IR, accuracy of isotope abundance, and peak response using drug-supplemented urine samples. Performance of the models was evaluated in an independent data set of unknown clinical urine samples in comparison with the results of manual analysis.

RESULTS: IR of most compounds acquired by MSE/QTOF were low and concentration-dependent (i.e., IR increased at higher concentrations). We developed an MLR model with composite score outputs incorporating 7 parameters to predict positive drug identification. The model achieved a mean accuracy of 89.38% in the validation set and 87.92% agreement in the test set.

CONCLUSIONS: The MLR model incorporating all contributing parameters can serve as a decision-support tool to facilitate objective drug identification using UPLC-MSE/QTOF.

DOI10.1093/jalm/jfac100
Alternate JournalJ Appl Lab Med
PubMed ID36610415
Related Faculty: 
He Sarina Yang, M.D., Ph.D. Zhen Zhao, Ph.D.

Pathology & Laboratory Medicine 1300 York Avenue New York, NY 10065 Phone: (212) 746-6464
Surgical Pathology: (212) 746-2700