PXD000131-3
PXD000131 is an original dataset announced via ProteomeXchange.
Dataset Summary
Title | Integrative genome, transcriptome and proteome analysis of rat livers from two different genetic backgrounds |
Description | In this project, we aim to pair-wise analyze the genomes, transcriptomes and proteomes of in-bred rats originating from two different genetic backgrounds. These two strains are Brown Norway (BN-Lx) and Spontaneously Hypertensive Rats (SHR). First, we re-sequenced the genomes for both BN and SHR rats, followed by RNA-seq and proteomics of their liver tissues. We then append novel predicted gene models, non-synonymous SNPs and INDELs (derived from genome re-sequencing), as well as transcript variants such as RNA-editing and alternative splicing (derived from RNA-seq) that can diversify existing protein sequences onto the ENSEMBL rat FASTA (Build 68) to build an enhanced database. For proteomics studies, equal amount of liver lysates were digested with trypsin, LysC, GluC, AspN and chymotrypsin and were individually fractionated with strong cationic exchange chromatography. Doubly- and triply-charged fractions were analyzed with an Triple-TOF 5600 with collision-activated dissociation (CAD); while electron-transfer dissociation (ETD) was applied for fractions containing triple charges and above with a LTQ-Orbitrap Velos. Data analysis: Peak List generation: For Wiff files generated from TripleTOF 5600, tandem MS spectra were de-isotoped, charge- deconvoluted and peak lists converted to Mascot generic format (MGF) files using AB Sciex Data Converter (version 1.1). For data generated from the LTQ-Orbitrap Velos, Raw files were converted to MGF files using Proteome Discoverer (version 1.3). The non-fragment filter was used to simplify ETD spectra and the Top N filter for the HCD spectra. Three MGF files were generated (one for HCD, one for ETD IT and one for ETD FT). The files with an orbitrap readout were deisotoped and charge de-convoluted. Database Searching: All MGF files were queried with Mascot search engine (version 2.3) via Proteome Discoverer version 1.3 (PD 1.3, Thermo Fisher) for submission. The spectra were searched against in-house database (NGS_COMBINED). One of the five different enzymes used (Trypsin/P, LysC/P, Chymotrypsin, GluC-DE and AspN_ambic) were selected for each file and up to 9 missed cleavages were allowed. Cysteine carbamidomethylation was set as fixed modification, and oxidation of methionine and acetylation of the N-term as variable modifications. Peptide tolerance was initially set to 50 ppm and the MS/MS tolerance was set to 0.1 Da (for TOF readout), 0.02 Da (orbitrap readout) and 0.5 Da (ion trap readout). All peptide-spectrum matches (PSMs) were evaluated with Percolator for validation. We classified each PSM based on their q value. For proteins identification, we used set a high stringency filter of q = 0 (0% FDR). For peaks lists that do not yield any peptide matches, we exported them with PD 1.3 for further analysis. De novo search with PEAKS: Unassigned peak lists that are exported were re-analyzed with another software suite i.e. PEAKS Studio (version 6.0). The identification workflows is as follows. Peak lists were first filtered with a quality value of 0.65 as suggested by the manufacturer followed by de novo spectra interpretation. In this step, both peptide tolerance and MS/MS tolerance were set according to MASCOT search. To broaden the search space for these unassigned spectra, we additionally set de-amidation of asparagine and glutamine, and pyro-glu from glutamic acid and glutamine as variable modifications, on top of the other modifications indicated above. Maximum allowed variable PTM per peptide was set to 3. Finally de novo interpreted PSMs were submitted to PEAKS DB database matching, this time allowing semi-enzymatic specificity and a maximum cleavages per peptide of 2. Database used was set to NGS_COMBINED. FDR was estimated using decoy-fusion. The genomics and transcriptomics data are already deposited in the respective EBI repositories. Some of these data are derived from an already published manuscript. For the genomics data (from: Genetic basis of transcriptome differences between the founder strains of the rat HXB/BXH recombinant inbred panel by Simonis et al PMID:22541052) DNA data in Sequence Read Archive (SRA): BN-Lx genome: ERP001355 http://www.ebi.ac.uk/ena/data/view/ERP001355, SHR genome: ERP001371, BN reference genome: ERP000510, http://www.ebi.ac.uk/ena/data/view/ERP000510. RNA data in ArrayExpress: BN-Lx and SHR fragment RNA-seq data: E-MTAB-1029 http://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-1029, BN-Lx and SHR paired-end RNA-seq data: to be submitted. |
HostingRepository | PRIDE |
AnnounceDate | 2014-07-24 |
AnnouncementXML | Submission_2014-07-24_03:50:09.xml |
DigitalObjectIdentifier | |
ReviewLevel | Peer-reviewed dataset |
DatasetOrigin | Original dataset |
RepositorySupport | Unsupported dataset by repository |
PrimarySubmitter | Teck Yew Low |
SpeciesList | scientific name: Rattus norvegicus (Rat); NCBI TaxID: 10116; |
ModificationList | monohydroxylated residue; acetylated residue; iodoacetamide derivatized residue |
Instrument | LTQ Orbitrap Velos; TripleTOF 5600 |
Dataset History
Revision | Datetime | Status | ChangeLog Entry |
---|---|---|---|
0 | 2013-01-24 02:31:21 | ID requested | |
1 | 2013-11-29 08:25:07 | announced | |
2 | 2014-05-30 06:25:39 | announced | Updated publication reference for PubMed record(s): 24290761. |
⏵ 3 | 2014-07-24 03:50:10 | announced | Updated project metadata. |
Publication List
Low TY, van Heesch S, van den Toorn H, Giansanti P, Cristobal A, Toonen P, Schafer S, H, ΓΌ, bner N, van Breukelen B, Mohammed S, Cuppen E, Heck AJ, Guryev V, Quantitative and qualitative proteome characteristics extracted from in-depth integrated genomics and proteomics analysis. Cell Rep, 5(5):1469-78(2013) [pubmed] |
Keyword List
ProteomeXchange project tag: PRIME-XS Project |
submitter keyword: Rat, Livers, LC-MSMS, Proteome |
Contact List
Teck Yew Low | |
---|---|
contact affiliation | Utrecht Univeristy |
contact email | lowteckyew@gmail.com |
dataset submitter |
Full Dataset Link List
Dataset FTP location NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2013/11/PXD000131 |
PRIDE project URI |
Repository Record List
[ + ]