PXD002052 is an
original dataset announced via ProteomeXchange.
Dataset Summary
Title | Machine Learning Based Classification of Diffuse Large B-cell Lymphoma Patients by their Protein Expression Profiles |
Description | Characterization of tumors at the molecular level has improved our knowledge of cancer causation and progression. Proteomic analysis of their signaling pathways promises to enhance our understanding of cancer aberrations at the functional level, but this requires accurate and robust tools. Here, we develop a state of the art quantitative mass spectrometric pipeline to characterize formalin-fixed paraffin-embedded (FFPE) tissues of patients with closely related subtypes of diffuse large B-cell lymphoma (DLBCL). We combined a super-SILAC approach with label-free quantification (hybrid LFQ), to address situations where the protein is absent in the super-SILAC standard yet present in the patient samples. Shotgun proteomic analysis on a quadrupole Orbitrap quantified almost 9000 tumor proteins in 20 patients. The quantitative accuracy of our approach allowed the segregation of DLBCL patients according to their cell-of-origin, using both their global protein expression patterns and the 55-protein signature obtained previously from patient-derived cell lines (Deeb et al. MCP 2012 PMID 22442255). Expression levels of individual segregation-driving proteins as well as categories such as extracellular matrix proteins behaved consistent with known trends between the subtypes. We employed machine learning (support vector machines) to extract candidate proteins with the highest segregating power. A panel of four proteins (PALD1, MME, TNFAIP8 and TBC1D4) classified the patients with very low error rates. Highly ranked proteins from the support vector analysis revealed differential expression of core signaling molecules between the subtypes, elucidating aspects of their pathobiology. |
HostingRepository | PRIDE |
AnnounceDate | 2015-09-08 |
AnnouncementXML | Submission_2015-09-09_01:17:02.xml |
DigitalObjectIdentifier | |
ReviewLevel | Peer-reviewed dataset |
DatasetOrigin | Original dataset |
RepositorySupport | Unsupported dataset by repository |
PrimarySubmitter | Sally Deeb |
SpeciesList | scientific name: Homo sapiens (Human); NCBI TaxID: 9606; |
ModificationList | No PTMs are included in the dataset |
Instrument | Q Exactive |
Dataset History
Revision | Datetime | Status | ChangeLog Entry |
0 | 2015-04-13 01:37:04 | ID requested | |
⏵ 1 | 2015-09-09 01:17:03 | announced | |
Publication List
Deeb SJ, Tyanova S, Hummel M, Schmidt-Supprian M, Cox J, Mann M, Machine Learning-based Classification of Diffuse Large B-cell Lymphoma Patients by Their Protein Expression Profiles. Mol Cell Proteomics, 14(11):2947-60(2015) [pubmed] |
Keyword List
curator keyword: Biomedical |
submitter keyword: Lymphoma, proteomics, classification, SVM, hybrid LFQ |
Contact List
Matthias Mann |
contact affiliation | Max Planck Institute of Biochemistry |
contact email | mmann@biochem.mpg.de |
lab head | |
Sally Deeb |
contact affiliation | Max Planck Institute |
contact email | deeb@biochem.mpg.de |
dataset submitter | |
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2015/09/PXD002052 |
PRIDE project URI |
Repository Record List
[ + ]
[ - ]
- PRIDE
- PXD002052
- Label: PRIDE project
- Name: Machine Learning Based Classification of Diffuse Large B-cell Lymphoma Patients by their Protein Expression Profiles