<<< Full experiment listing

PXD002052

PXD002052 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleMachine Learning Based Classification of Diffuse Large B-cell Lymphoma Patients by their Protein Expression Profiles
DescriptionCharacterization of tumors at the molecular level has improved our knowledge of cancer causation and progression. Proteomic analysis of their signaling pathways promises to enhance our understanding of cancer aberrations at the functional level, but this requires accurate and robust tools. Here, we develop a state of the art quantitative mass spectrometric pipeline to characterize formalin-fixed paraffin-embedded (FFPE) tissues of patients with closely related subtypes of diffuse large B-cell lymphoma (DLBCL). We combined a super-SILAC approach with label-free quantification (hybrid LFQ), to address situations where the protein is absent in the super-SILAC standard yet present in the patient samples. Shotgun proteomic analysis on a quadrupole Orbitrap quantified almost 9000 tumor proteins in 20 patients. The quantitative accuracy of our approach allowed the segregation of DLBCL patients according to their cell-of-origin, using both their global protein expression patterns and the 55-protein signature obtained previously from patient-derived cell lines (Deeb et al. MCP 2012 PMID 22442255). Expression levels of individual segregation-driving proteins as well as categories such as extracellular matrix proteins behaved consistent with known trends between the subtypes. We employed machine learning (support vector machines) to extract candidate proteins with the highest segregating power. A panel of four proteins (PALD1, MME, TNFAIP8 and TBC1D4) classified the patients with very low error rates. Highly ranked proteins from the support vector analysis revealed differential expression of core signaling molecules between the subtypes, elucidating aspects of their pathobiology.
HostingRepositoryPRIDE
AnnounceDate2015-09-08
AnnouncementXMLSubmission_2015-09-09_01:17:02.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterSally Deeb
SpeciesList scientific name: Homo sapiens (Human); NCBI TaxID: 9606;
ModificationListNo PTMs are included in the dataset
InstrumentQ Exactive
Dataset History
RevisionDatetimeStatusChangeLog Entry
02015-04-13 01:37:04ID requested
12015-09-09 01:17:03announced
Publication List
Deeb SJ, Tyanova S, Hummel M, Schmidt-Supprian M, Cox J, Mann M, Machine Learning-based Classification of Diffuse Large B-cell Lymphoma Patients by Their Protein Expression Profiles. Mol Cell Proteomics, 14(11):2947-60(2015) [pubmed]
Keyword List
curator keyword: Biomedical
submitter keyword: Lymphoma, proteomics, classification, SVM, hybrid LFQ
Contact List
Matthias Mann
contact affiliationMax Planck Institute of Biochemistry
contact emailmmann@biochem.mpg.de
lab head
Sally Deeb
contact affiliationMax Planck Institute
contact emaildeeb@biochem.mpg.de
dataset submitter
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2015/09/PXD002052
PRIDE project URI
Repository Record List
[ + ]