⮝ Full datasets listing

PXD041337

PXD041337 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleMachine Learning Identifies Plasma Proteomic Signatures of Descending Thoracic Aortic Disease
DescriptionDescending thoracic aortic aneurysms and dissections can go undetected until severe and catastrophic, and few clinical indices exist to screen for aneurysms or predict their risk of dissection or rupture. This study generated a plasma proteomic dataset from 150 patients with descending thoracic aortic disease and 52 controls to identify proteomic signatures capable of differentiating descending thoracic aortic disease from non-disease controls, as well as between cases with aneurysm versus descending ‘type B’ dissection. Of the 1,468 peptides and 195 proteins quantified across all samples, 853 peptides and 99 proteins were quantitatively different between disease and control patients (BH adjusted p-value < 0.01 from t-tests). Supervised machine learning (ML) methods were used to classify disease from control and aneurysm from descending dissection cases. The highest precision-recall area under the curve (PR AUC) was achieved on the held-out test set using significantly different proteins between disease and control patients (PR AUC 0.99), followed by input of significant peptides (PR AUC 0.96). Despite no statistically significant proteins between aneurysm and dissection cases, use of all proteins was able to modestly classify between the two disease states (PR AUC 0.77). To overcome correlation in the proteins and enable biological pathway analysis, a disease versus control classifier was optimized using only seven unique protein clusters, which achieved comparable performance to models trained on all/significant proteins (accuracy 0.88, F1-score 0.78, PR AUC 0.90). Model interpretation with permutation importance revealed that proteins in the most important clusters for differentiating disease and control function in coagulation, protein-lipid complex remodeling, and acute inflammatory response.
HostingRepositoryPRIDE
AnnounceDate2024-06-16
AnnouncementXMLSubmission_2024-06-16_01:16:02.253.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterSarah Parker
SpeciesList scientific name: Homo sapiens (Human); NCBI TaxID: 9606;
ModificationListNo PTMs are included in the dataset
InstrumentOrbitrap Exploris 480
Dataset History
RevisionDatetimeStatusChangeLog Entry
02023-04-04 18:53:06ID requested
12024-06-16 01:16:03announced
Publication List
10.1186/s12014-024-09487-4;
Momenzadeh A, Kreimer S, Guo D, Ayres M, Berman D, Chyu KY, Shah PK, Milewicz D, Azizzadeh A, Meyer JG, Parker S, Differentiation between descending thoracic aortic diseases using machine learning and plasma proteomic signatures. Clin Proteomics, 21(1):38(2024) [pubmed]
Keyword List
submitter keyword: Plasma Proteomics, Machine Learning, Type B Dissection,Descending Aortic Disease, Thoracic Aortic Aneurysm
Contact List
Sarah Parker
contact affiliationDepartment of Cardiology
contact emailsarah.parker@cshs.org
lab head
Sarah Parker
contact affiliationCedars-Sinai Medical Center
contact emailsarah.parker@cshs.org
dataset submitter
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2024/06/PXD041337
PRIDE project URI
Repository Record List
[ + ]