<<< Full experiment listing

PXD050561

PXD050561 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleA multiscale functional map of somatic mutations in cancer integrating protein structure and network topology
DescriptionA major goal of cancer biology is to understand the mechanisms underlying tumorigenesis driven by somatically acquired mutations. Two distinct types of computational methodologies have emerged: one focuses on analyzing clustering of mutations within protein sequences and 3D structures, while the other characterizes mutations by leveraging the topology of protein-protein interaction network. Their insights are largely non-overlapping, offering complementary strengths. Here, we established a unified, end-to-end 3D structurally-informed protein interaction network propagation framework, NetFlow3D, that systematically maps the multiscale mechanistic effects of somatic mutations in cancer. The establishment of NetFlow3D hinges upon the Human Protein Structurome, a comprehensive repository we compiled that incorporates the 3D structures of every single protein as well as the binding interfaces of all known protein interactions in humans. NetFlow3D leverages the Structurome to integrate information across atomic, residue, protein and network levels: It conducts 3D clustering of mutations across atomic and residue levels on protein structures to identify potential driver mutations. It then anisotropically propagates their impacts across the protein interaction network, with propagation guided by the specific 3D structural interfaces involved, to identify significantly interconnected network "modules", thereby uncovering key biological processes underlying disease etiology. Applied to 1,038,899 somatic protein-altering mutations in 9,946 TCGA tumors across 33 cancer types, NetFlow3D identified 12,378 significant 3D clusters throughout the Human Protein Structurome, of which ~54% would not have been found if using only experimentally-determined structures. It then identified 28 significantly interconnected modules that encompass ~8-fold more proteins than applying standard network analyses.
HostingRepositoryMassIVE
AnnounceDate2024-08-07
AnnouncementXMLSubmission_2024-08-07_21:00:32.972.xml
DigitalObjectIdentifier
ReviewLevelNon peer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterYingying Zhang
SpeciesList scientific name: Human 293T cells;
ModificationListNo PTMs are included in the dataset
InstrumentOrbitrap Fusion Lumos
Dataset History
RevisionDatetimeStatusChangeLog Entry
02024-03-12 15:26:59ID requested
12024-08-07 21:00:33announced
Publication List
no publication
Keyword List
submitter keyword: Cancer Genomics, 3D Protein Structure, Interactome, Protein-Protein Interaction Network, TMT, IP-MS
Contact List
Haiyuan Yu
contact affiliationCornell University
contact emailhaiyuan.yu@cornell.edu
lab head
Yingying Zhang
contact affiliationCornell University
contact emailyz2296@cornell.edu
dataset submitter
Full Dataset Link List
MassIVE dataset URI
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://massive.ucsd.edu/v07/MSV000094298/