<<< Full experiment listing

PXD037601-2

PXD037601 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleFunctional annotation of proteins for signaling network inference in non-model species
DescriptionMolecular biology aims to understand the molecular basis of cellular responses, unravel dynamic regulatory networks, and model complex biological systems. However, these studies remain challenging in non-model species as a result of poor functional annotation of regulatory proteins, like kinases or phosphatases. To overcome this limitation, we developed a multi-layer neural network that annotates proteins by determining functionality directly from the protein sequence. We annotated the kinases and phosphatases in the non-model species, Glycine max (soybean), achieving a prediction sensitivity of up to 97%. To demonstrate the applicability, we used our functional annotations in combination with Bayesian network principles to predict signaling cascades using time series phosphoproteomics. We shed light on phosphorylation cascades in soybean seedlings upon cold treatment and identified Glyma.10G173000 (TOI5) and Glyma.19G007300 (TOT3) as key temperature response regulators in soybean. Importantly, the signaling cascade predictions do not rely upon known upstream kinases, kinase motifs, or protein interaction data, enabling de novo identification of kinase-substrate interactions. In addition to high accuracy and strong generalization, we showed that our functional prediction neural network is scalable to other model and non-model species, including Oryza sativa (rice), Zea mays (maize), Sorghum bicolor (sorghum), and Triticum aestivum (wheat). Overall, we demonstrated a data-driven systems biology approach for non-model species leveraging our predicted upstream kinases and phosphatases.
HostingRepositoryPRIDE
AnnounceDate2024-10-22
AnnouncementXMLSubmission_2024-10-22_05:47:56.471.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterCassio Flavio Fonseca de Lima
SpeciesList scientific name: Glycine max; NCBI TaxID: 3847;
ModificationListphosphorylated residue; acetylated residue; monohydroxylated residue; iodoacetamide derivatized residue
InstrumentQ Exactive
Dataset History
RevisionDatetimeStatusChangeLog Entry
02022-10-20 02:23:48ID requested
12023-06-13 02:15:50announced
22024-10-22 05:47:56announced2024-10-22: Updated project metadata.
Publication List
Dataset with its publication pending
Keyword List
submitter keyword: Phosphoproteomics, Protein family,Soybean, Temperature
Contact List
Ive De Smet
contact affiliation(1) Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium; (2) VIB Center for Plant Systems Biology, B-9052 Ghent, Belgium.
contact emailivsme@psb.ugent.be
lab head
Cassio Flavio Fonseca de Lima
contact affiliationPhD student
contact emailcafon@psb.ugent.be
dataset submitter
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2023/06/PXD037601
PRIDE project URI
Repository Record List
[ + ]