PXD018851-1

PXD018851 is an original dataset announced via ProteomeXchange.

Dataset Summary

Title	CCPRD: A novel analytical framework for comprehensive proteomic reference database construction of non-model organisms
Description	Protein reference databases are a critical part of producing efficient proteomic analyses. However, the method for constructing clean, efficient, and comprehensive protein reference databases is lacking. Existing methods either do not have contamination control procedures, or these methods rely on a three-frame and/or six-frame translation that sharply increases the search space and harms MS results. Herein we propose a framework for constructing a customized comprehensive proteomic reference database (CCPRD) from draft genomes and deep sequencing transcriptomes. Its effectiveness is demonstrated by incorporating the proteomes of nematocysts from endoparasitic cnidarian: myxozoans. By applying customized contamination removal procedures, contaminations in omic data were successfully identified and removed. This is an effective method that does not result in over-decontamination. This can be shown by comparing the CCPRD MS results with an artificially-contaminated database and another database with removed contaminations in genomes and transcriptomes added back. CCPRD outperformed traditional frame-based methods by identifying 35.2%-50.7% more peptides and 35.8%-43.8% more proteins, with a maximum 84.6% in size reduction. A BUSCO analysis showed that the CCPRD maintained a relatively high level of completeness compared to traditional methods. These results confirm the superiority of the CCPRD over existing methods in peptide and protein identification numbers, database size, and completeness. By providing a general framework for generating the reference database, the CCPRD, which does not need a high-quality genome, can potentially be applied to any organisms and significantly contribute to proteomic research.
HostingRepository	PRIDE
AnnounceDate	2020-07-16
AnnouncementXML	Submission_2020-07-16_02:42:20.xml
DigitalObjectIdentifier
ReviewLevel	Peer-reviewed dataset
DatasetOrigin	Original dataset
RepositorySupport	Unsupported dataset by repository
PrimarySubmitter	qingxiang Guo
SpeciesList	scientific name: Thelohanellus kitauei; NCBI TaxID: 669202; scientific name: Myxobolus honghuensis; NCBI TaxID: 1085956; scientific name: Myxobolus wulii; NCBI TaxID: 649408;
ModificationList	acetylated residue; monohydroxylated residue
Instrument	Q Exactive HF

Dataset History

Revision	Datetime	Status	ChangeLog Entry
0	2020-04-28 02:59:55	ID requested
⏵ 1	2020-07-16 02:42:21	announced
2	2024-10-22 05:09:17	announced	2024-10-22: Updated project metadata.

Publication List

Guo Q, Li D, Zhai Y, Gu Z, CCPRD: A Novel Analytical Framework for the Comprehensive Proteomic Reference Database Construction of NonModel Organisms. ACS Omega, 5(25):15370-15384(2020) [pubmed]

Guo Q, Li D, Zhai Y, Gu Z, CCPRD: A Novel Analytical Framework for the Comprehensive Proteomic Reference Database Construction of NonModel Organisms. ACS Omega, 5(25):15370-15384(2020) [pubmed]

Keyword List

submitter keyword: CCPRD, proteomics, reference database, protein identification, myxozoans, nematocysts

submitter keyword: CCPRD, proteomics, reference database, protein identification, myxozoans, nematocysts

Contact List

Zemao Gu
contact affiliation	College of Fisheries, Huazhong Agricultural University
contact email	guzemao@mail.hzau.edu.cn
lab head
qingxiang Guo
contact affiliation	Huazhong Agricultural University
contact email	guoqing@webmail.hzau.edu.cn
dataset submitter

Full Dataset Link List

Dataset FTP location NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2020/07/PXD018851
PRIDE project URI

Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2020/07/PXD018851

PRIDE project URI

Repository Record List

[ + ]