⮝ Full datasets listing

PXD023921

PXD023921 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleMulti-protease approach for the improved identification and molecular characterization of small proteins and short open reading frame-encoded peptides
DescriptionThe identification of proteins below 70 amino acids in bottom-up proteomics is still a challenging task due to the limited number of peptides generated by proteolytic digestion. This includes the short open reading frame-encoded peptides (SEP), which are a subset of the small proteins that were not previously annotated or that are alternatively encoded. Here, we systematically investigated the use of multiple proteases (trypsin, chymotrypsin, LysC, LysArgiNase and GluC) in GeLC-MS/MS analysis to improve the sequence coverage and the number of identified peptides for small proteins (<70 amino acids), with a focus on SEP, in the archaeon Methanosarcina mazei. Combining the data of all proteases, we identified 63 small proteins and additional 28 SEP with at least two unique peptides, while only 55 small proteins and 22 SEP could be identified using trypsin only. For 27 small proteins and 12 SEP, a 100 % sequence coverage could be achieved. Moreover, for five SEP, incorrectly predicted translation start points were identified, confirming the data of a previous top-down proteomics study of this organism. The results show clearly that a multi-protease approach can improve the identification and molecular characterization of small proteins and SEP.
HostingRepositoryPRIDE
AnnounceDate2021-04-01
AnnouncementXMLSubmission_2021-04-01_02:48:37.189.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterAndreas Tholey
SpeciesList scientific name: Methanosarcina mazei Go1; NCBI TaxID: 192952;
ModificationListNo PTMs are included in the dataset
InstrumentQ Exactive
Dataset History
RevisionDatetimeStatusChangeLog Entry
02021-02-01 06:50:00ID requested
12021-04-01 02:48:37announced
Publication List
Kaulich PT, Cassidy L, Bartel J, Schmitz RA, Tholey A, Multi-protease Approach for the Improved Identification and Molecular Characterization of Small Proteins and Short Open Reading Frame-Encoded Peptides. J Proteome Res, 20(5):2895-2903(2021) [pubmed]
Keyword List
submitter keyword: SEP, sORF, Peptidomics, LC-MSMS
Contact List
Andreas Tholey
contact affiliationSystematische Proteomics & Bioanalytik, Institut für Experimentelle Medizin Christian-Albrechts-Universität zu Kiel 24105 Kiel, Germany
contact emaila.tholey@iem.uni-kiel.de
lab head
Andreas Tholey
contact affiliationSystematic Proteome Research & Bioanalytics, University of Kiel
contact emaila.tholey@iem.uni-kiel.de
dataset submitter
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2021/04/PXD023921
PRIDE project URI
Repository Record List
[ + ]