PXD000131-3

PXD000131 is an original dataset announced via ProteomeXchange.

Dataset Summary

Title	Integrative genome, transcriptome and proteome analysis of rat livers from two different genetic backgrounds
Description	In this project, we aim to pair-wise analyze the genomes, transcriptomes and proteomes of in-bred rats originating from two different genetic backgrounds. These two strains are Brown Norway (BN-Lx) and Spontaneously Hypertensive Rats (SHR). First, we re-sequenced the genomes for both BN and SHR rats, followed by RNA-seq and proteomics of their liver tissues. We then append novel predicted gene models, non-synonymous SNPs and INDELs (derived from genome re-sequencing), as well as transcript variants such as RNA-editing and alternative splicing (derived from RNA-seq) that can diversify existing protein sequences onto the ENSEMBL rat FASTA (Build 68) to build an enhanced database. For proteomics studies, equal amount of liver lysates were digested with trypsin, LysC, GluC, AspN and chymotrypsin and were individually fractionated with strong cationic exchange chromatography. Doubly- and triply-charged fractions were analyzed with an Triple-TOF 5600 with collision-activated dissociation (CAD); while electron-transfer dissociation (ETD) was applied for fractions containing triple charges and above with a LTQ-Orbitrap Velos. Data analysis: Peak List generation: For Wiff files generated from TripleTOF 5600, tandem MS spectra were de-isotoped, charge- deconvoluted and peak lists converted to Mascot generic format (MGF) files using AB Sciex Data Converter (version 1.1). For data generated from the LTQ-Orbitrap Velos, Raw files were converted to MGF files using Proteome Discoverer (version 1.3). The non-fragment filter was used to simplify ETD spectra and the Top N filter for the HCD spectra. Three MGF files were generated (one for HCD, one for ETD IT and one for ETD FT). The files with an orbitrap readout were deisotoped and charge de-convoluted. Database Searching: All MGF files were queried with Mascot search engine (version 2.3) via Proteome Discoverer version 1.3 (PD 1.3, Thermo Fisher) for submission. The spectra were searched against in-house database (NGS_COMBINED). One of the five different enzymes used (Trypsin/P, LysC/P, Chymotrypsin, GluC-DE and AspN_ambic) were selected for each file and up to 9 missed cleavages were allowed. Cysteine carbamidomethylation was set as fixed modification, and oxidation of methionine and acetylation of the N-term as variable modifications. Peptide tolerance was initially set to 50 ppm and the MS/MS tolerance was set to 0.1 Da (for TOF readout), 0.02 Da (orbitrap readout) and 0.5 Da (ion trap readout). All peptide-spectrum matches (PSMs) were evaluated with Percolator for validation. We classified each PSM based on their q value. For proteins identification, we used set a high stringency filter of q = 0 (0% FDR). For peaks lists that do not yield any peptide matches, we exported them with PD 1.3 for further analysis. De novo search with PEAKS: Unassigned peak lists that are exported were re-analyzed with another software suite i.e. PEAKS Studio (version 6.0). The identification workflows is as follows. Peak lists were first filtered with a quality value of 0.65 as suggested by the manufacturer followed by de novo spectra interpretation. In this step, both peptide tolerance and MS/MS tolerance were set according to MASCOT search. To broaden the search space for these unassigned spectra, we additionally set de-amidation of asparagine and glutamine, and pyro-glu from glutamic acid and glutamine as variable modifications, on top of the other modifications indicated above. Maximum allowed variable PTM per peptide was set to 3. Finally de novo interpreted PSMs were submitted to PEAKS DB database matching, this time allowing semi-enzymatic specificity and a maximum cleavages per peptide of 2. Database used was set to NGS_COMBINED. FDR was estimated using decoy-fusion. The genomics and transcriptomics data are already deposited in the respective EBI repositories. Some of these data are derived from an already published manuscript. For the genomics data (from: Genetic basis of transcriptome differences between the founder strains of the rat HXB/BXH recombinant inbred panel by Simonis et al PMID:22541052) DNA data in Sequence Read Archive (SRA): BN-Lx genome: ERP001355 http://www.ebi.ac.uk/ena/data/view/ERP001355, SHR genome: ERP001371, BN reference genome: ERP000510, http://www.ebi.ac.uk/ena/data/view/ERP000510. RNA data in ArrayExpress: BN-Lx and SHR fragment RNA-seq data: E-MTAB-1029 http://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-1029, BN-Lx and SHR paired-end RNA-seq data: to be submitted.
HostingRepository	PRIDE
AnnounceDate	2014-07-24
AnnouncementXML	Submission_2014-07-24_03:50:09.xml
DigitalObjectIdentifier
ReviewLevel	Peer-reviewed dataset
DatasetOrigin	Original dataset
RepositorySupport	Unsupported dataset by repository
PrimarySubmitter	Teck Yew Low
SpeciesList	scientific name: Rattus norvegicus (Rat); NCBI TaxID: 10116;
ModificationList	monohydroxylated residue; acetylated residue; iodoacetamide derivatized residue
Instrument	LTQ Orbitrap Velos; TripleTOF 5600

Dataset History

Revision	Datetime	Status	ChangeLog Entry
0	2013-01-24 02:31:21	ID requested
1	2013-11-29 08:25:07	announced
2	2014-05-30 06:25:39	announced	Updated publication reference for PubMed record(s): 24290761.
⏵ 3	2014-07-24 03:50:10	announced	Updated project metadata.

Publication List

Low TY, van Heesch S, van den Toorn H, Giansanti P, Cristobal A, Toonen P, Schafer S, H, ü, bner N, van Breukelen B, Mohammed S, Cuppen E, Heck AJ, Guryev V, Quantitative and qualitative proteome characteristics extracted from in-depth integrated genomics and proteomics analysis. Cell Rep, 5(5):1469-78(2013) [pubmed]

Low TY, van Heesch S, van den Toorn H, Giansanti P, Cristobal A, Toonen P, Schafer S, H, ü, bner N, van Breukelen B, Mohammed S, Cuppen E, Heck AJ, Guryev V, Quantitative and qualitative proteome characteristics extracted from in-depth integrated genomics and proteomics analysis. Cell Rep, 5(5):1469-78(2013) [pubmed]

Keyword List

ProteomeXchange project tag: PRIME-XS Project
submitter keyword: Rat, Livers, LC-MSMS, Proteome

ProteomeXchange project tag: PRIME-XS Project

submitter keyword: Rat, Livers, LC-MSMS, Proteome

Contact List

Teck Yew Low
contact affiliation	Utrecht Univeristy
contact email	lowteckyew@gmail.com
dataset submitter

Full Dataset Link List

Dataset FTP location NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2013/11/PXD000131
PRIDE project URI

Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2013/11/PXD000131

PRIDE project URI

Repository Record List

[ + ]