<<< Full experiment listing

PXD025813

PXD025813 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleIdentification and analysis of small proteins and short open reading frame encoded peptides in Hep3B cell
DescriptionThe small proteins and short open reading frames encoded peptides (SEPs) are of fundamental importance because of their essential roles in biological processes. However, the annotation or identification of them is challenging, in part owing to the limitation of the traditional genome annotation pipeline and their inherent characteristics of low abundance and low molecular weight. To discover and characterize SEPs in Hep3B cell line, we developed an optimized peptidomic assay by combining different peptide extraction and separation methods. The organic solvent precipitation method in peptidomic showed promotion in the enrichment of low molecular proteins or peptides, and the data clearly showed a beneficial effect from the reduction of sample complexity, resulting in high-quality MS/MS spectra. Furthermore, different strategies exhibited good complementarity in improving the total amount of small proteins and their sequence coverage. In total, 1192 proteins within less than 100 amino acids were identified, including 271 newly discovered SEPs that been annotated in the OpenProt database and 147 SEPs of them encoded from ncRNA or lincRNA. Results in this work provide robust evidence to date that the human proteome is more complicated than previously appreciated, and this will be a benefit to discoveries of proteins without function annotation. SIGNIFICANCE: In this work, methods were optimized to identify SEPs in Hep3B. The organic solvent precipitation presents promotion in enrichment of low molecular proteins or peptides, and the data clearly showed a beneficial effect from the reduction of sample complexity, resulting in high quality MS/MS spectra. Different strategies exhibited good complementarity in improving total amount of small proteins and their sequence coverage. In total, 1192 proteins within less than 100 amino acids were identified, including 271 newly discovered SEPs that been annotated in the OpenProt database and 147 SEPs of them encoded from ncRNA or lincRNA. Furthermore, 22 SEPs generated from the uORF may has potential effect in translation control, and 149 newly identified SEPs have known functional domains or cross-species conservation. Results in this work present robust evidence for the coding potential of the ignored region of human genomes and may provide additional insights into tumor biology.
HostingRepositoryiProX
AnnounceDate2021-05-05
AnnouncementXMLSubmission_2021-05-05_21:06:01.943.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterBing Wang
SpeciesList scientific name: Homo sapiens; NCBI TaxID: 9606;
ModificationListNo PTMs are included in the dataset
InstrumentQ Exactive
Dataset History
RevisionDatetimeStatusChangeLog Entry
02021-05-05 21:04:57ID requested
12021-05-05 21:06:03announced
Publication List
Wang B, Hao J, Pan N, Wang Z, Chen Y, Wan C, Identification and analysis of small proteins and short open reading frame encoded peptides in Hep3B cell. J Proteomics, 230():103965(2021) [pubmed]
Keyword List
submitter keyword: Acetonitrile precipitation, Hep3B cell line, Peptidomic
SEP enrichment, Short open reading frames, sORF-encoded peptides.
Contact List
Cuihong Wan
contact affiliationCentral China Normal University
contact emailch_wan@mail.ccnu.edu.cn
lab head
Bing Wang
contact affiliationCentral China Normal University
contact email635819601@qq.com
dataset submitter
Full Dataset Link List
iProX dataset URI