A retrospective study differentiating nontuberculous mycobacterial pulmonary disease from pulmonary tuberculosis on computed tomography using radiomics and machine learning algorithms.

A retrospective study differentiating nontuberculous mycobacterial pulmonary disease from pulmonary tuberculosis on computed tomography using radiomics and machine learning algorithms.

Publication date: Dec 01, 2024

To evaluate the effectiveness of a machine learning based on computed tomography (CT) radiomics to distinguish nontuberculous mycobacterial pulmonary disease (NTM-PD) from pulmonary tuberculosis (PTB). In this retrospective analysis, medical records of 99 individuals afflicted with NTM-PD and 285 individuals with PTB in Zhejiang Chinese and Western Medicine Integrated Hospital were examined. Random numbers generated by a computer were utilized to stratify the study cohort, with 80% designated as the training cohort and 20% as the validation cohort. A total of 2153 radiomics features were extracted using Python (Pyradiomics package) to analyse the CT characteristics of the large disease areas. The identification of significant factors was conducted through the least absolute shrinkage and selection operator (LASSO) regression. The following four supervised learning classifier models were developed: random forest (RF), support vector machine (SVM), logistic regression (LR), and extreme gradient boosting (XGBoost). For assessment and comparison of the predictive performance among these models, receiver-operating characteristic (ROC) curves and the areas under the ROC curves (AUCs) were employed. The Student’s t-test, Levene test, and LASSO algorithm collectively selected 23 optimal features. ROC analysis was then conducted, with the respective AUC values of the XGBoost, LR, SVM, and RF models recorded to be 1, 0. 9044, 0. 8868, and 0. 7982 in the training cohort. In the validation cohort, the respective AUC values of the XGBoost, LR, SVM, and RF models were 0. 8358, 0. 8085, 0. 87739, and 0. 7759. The DeLong test results noted the lack of remarkable variation across the models. The CT radiomics features can help distinguish between NTM-PD and PTB. Among the four classifiers, SVM showed a stable performance in effectively identifying these two diseases.

Concepts Keywords
Chinese Adult
Ct Aged
Mycobacterial Algorithms
Receiver CT
Diagnosis, Differential
Female
Humans
Machine Learning
machine learning
Male
Middle Aged
Mycobacterium Infections, Nontuberculous
nontuberculous mycobacteria
pulmonary tuberculosis
Radiomics
radiomics
Retrospective Studies
ROC Curve
Support Vector Machine
Tomography, X-Ray Computed
Tuberculosis, Pulmonary

Semantics

Type Source Name
disease MESH pulmonary disease
disease MESH pulmonary tuberculosis
drug DRUGBANK Flunarizine
drug DRUGBANK Saquinavir
disease IDO algorithm
disease MESH Mycobacterium Infections Nontuberculous

Original Article

Leave a Comment

Your email address will not be published. Required fields are marked *