⮝ Full datasets listing

PXD040265

PXD040265 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleUsing long-read CAGE sequencing to profile cryptic-promoter derived transcripts and their contribution to the immunopeptidome
DescriptionRecent studies have demonstrated that the non-coding genome can produce unannotated proteins as antigens that induce immune response. One major source of this activity is the aberrant epigenetic reactivation of transposable elements (TEs). In tumors, TEs often provide cryptic or alternate promoters, which can generate transcripts that encode tumor-specific unannotated proteins. Thus, TE-derived transcripts have the potential to produce tumor-specific, but recurrent, antigens shared among many tumors. Identification of TE-derived tumor antigens holds the promise to improve cancer immunotherapy approaches; however, current genomics and computational tools are not optimized for their detection. Here we combined CAGE technology with full-length long-read transcriptome sequencing (Long-Read CAGE, or LRCAGE) and developed a suite of computational tools to significantly improve immunopeptidome detection by incorporating TE-derived and other tumor transcripts into the proteome database. By applying our methods to human lung cancer cell line H1299 data, we demonstrated that long-read technology significantly improves mapping of promoters with low mappability scores and LRCAGE guarantees accurate construction of uncharacterized 5’ transcript structure. Unannotated peptides predicted from newly characterized transcripts were readily detectable in whole cell lysate mass-spectrometry data. Incorporating unannotated peptides into the proteome database enabled us to detect non-canonical antigens in HLA-pulldown LC-MS/MS data. At last, we showed that epigenetic treatment increased the number of non-canonical antigens, particularly those encoded by TE-derived transcripts, which might expand the pool of targetable antigens for cancers with low mutational burden.
HostingRepositoryPRIDE
AnnounceDate2024-10-22
AnnouncementXMLSubmission_2024-10-22_06:03:46.971.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterJu Heon Maeng
SpeciesList scientific name: Homo sapiens (Human); NCBI TaxID: 9606;
ModificationListNo PTMs are included in the dataset
InstrumentOrbitrap Fusion Lumos
Dataset History
RevisionDatetimeStatusChangeLog Entry
02023-02-20 05:00:22ID requested
12023-09-23 08:45:07announced
22024-10-22 06:03:47announced2024-10-22: Updated project metadata.
Publication List
Dataset with its publication pending
Keyword List
submitter keyword: antigens, immunotherapy, epigenetic treatment,transposable elements, long-read sequencing
Contact List
Ting Wang
contact affiliationDepartment of Genetics, Washington University School of Medicine, St. Louis, MO, USA;Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA;McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
contact emailtwang@wustl.edu
lab head
Ju Heon Maeng
contact affiliationWashington University in St. Louis
contact emailj.maeng@wustl.edu
dataset submitter
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2023/09/PXD040265
PRIDE project URI
Repository Record List
[ + ]