⮝ Full datasets listing

PXD067277

PXD067277 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleEnhancing Peptide Identification in Metaproteomics through Curriculum Learning in Deep Learning
DescriptionMetaproteomics offers a powerful window into the active functions of microbial communities, but accurately identifying peptides remains challenging due to the size and incompleteness of protein databases derived from metagenomes. These databases often contain vastly more sequences than those from single organisms, creating a computational bottleneck in peptide-spectrum match (PSM) filtering. Here we present WinnowNet, a deep learning–based method for PSM filtering, available in two versions: one using transformers and the other convolutional neural networks. Both variants are designed to handle the unordered nature of PSM data and are trained using a curriculum learning strategy that moves from simple to complex examples. WinnowNet consistently achieves more true identifications at equivalent false discovery rates compared to leading tools, including Percolator, MS$^2$Rescore, and DeepFilter, and outperforms filters integrated into popular analysis pipelines. It also uncovers more gut microbiome biomarkers related to diet and health, highlighting its potential to support advances in personalized medicine
HostingRepositoryPRIDE
AnnounceDate2025-10-20
AnnouncementXMLSubmission_2025-10-19_16:21:18.573.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterShichao Feng
SpeciesList scientific name: marine metagenome; NCBI TaxID: NEWT:408172; scientific name: soil metagenome; NCBI TaxID: NEWT:410658; scientific name: human gut metagenome; NCBI TaxID: NEWT:408170;
ModificationListNo PTMs are included in the dataset
InstrumentOrbitrap Fusion Lumos; LTQ Orbitrap Elite
Dataset History
RevisionDatetimeStatusChangeLog Entry
02025-08-12 12:34:46ID requested
12025-10-19 16:21:19announced
Publication List
10.1038/s41467-025-63977-z;
Feng S, Zhang B, Wang H, Xiong Y, Tian A, Yuan X, Pan C, Guo X, Enhancing peptide identification in metaproteomics through curriculum learning in deep learning. Nat Commun, 16(1):8934(2025) [pubmed]
Keyword List
submitter keyword: LC-MS/MS, marine, human gut, soil
Contact List
Xuan Guo
contact affiliationDepartment of Computer Science and Engineering, University of North Texas
contact emailxuan.guo@unt.edu
lab head
Shichao Feng
contact affiliationUniversity of North Texas
contact emailfengfeng@my.unt.edu
dataset submitter
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2025/10/PXD067277
PRIDE project URI
Repository Record List
[ + ]