<<< Full experiment listing

PXD000477

PXD000477 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleMspire-Simulator: LC-MS shotgun proteomic simulator for creating realistic gold standard data - HEK cell lysate
DescriptionThe most important step in any quantitative proteomic pipeline is feature detection (aka peak picking). However, generating quality hand-annotated data sets to validate the algorithms, especially for lower abundance peaks, is nearly impossible. An alternative for creating gold standard data is to simulate it with features closely mimicking real data. We present Mspire-Simulator, a free, open source shotgun proteomic simulator that goes beyond previous simulation attempts by generating LC-MS features with realistic m/z and intensity variance along with other noise components. It also includes machine learned models for retention time and peak intensity prediction and a genetic algorithm to custom fit model parameters for experimental data sets. We show that these methods are applicable to data from three different mass spectrometers, including two fundamentally different types, and show visually and analytically that simulated peaks are nearly indistinguishable from actual data. Researchers can use simulated data to rigorously test quantitation software, and proteomic researchers may benefit from overlaying simulated data on actual data sets. While not directly relevant in this case, a search was conducted by Proteome-Discoverer v1.4 by both mascot and Sequest-HT. The parameters included 2 missed cleavages by Trypsin, carboamidomethylation of the Cysteines, Phosphorylations of STY residues, and Oxidations of HW residues, and at a precursor mass tolerance of 10 ppm.
HostingRepositoryPRIDE
AnnounceDate2020-01-24
AnnouncementXMLSubmission_2020-01-24_05:21:16.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterRyan Taylor
SpeciesList scientific name: Homo sapiens (Human); NCBI TaxID: 9606;
ModificationListmonohydroxylated residue; iodoacetamide derivatized residue
InstrumentLTQ Orbitrap
Dataset History
RevisionDatetimeStatusChangeLog Entry
02013-09-23 01:55:39ID requested
12020-01-24 05:21:17announced
Publication List
Noyce AB, Smith R, Dalgleish J, Taylor RM, Erb KC, Okuda N, Prince JT, Mspire-Simulator: LC-MS shotgun proteomic simulator for creating realistic gold standard data. J Proteome Res, 12(12):5742-9(2013) [pubmed]
Keyword List
submitter keyword: Human, HEK-293T, Ms-Simulator, FASP
Contact List
John T. Prince
contact affiliationDepartment of Biochemistry, Brigham Young University , 701 East University Parkway, BNSN C100, Provo, Utah 84602, United States
contact emailjtprince@gmail.com
lab head
Ryan Taylor
contact affiliationChemistry and Biochemistry
contact emailryanmt@byu.net
dataset submitter
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2020/01/PXD000477
PRIDE project URI
Repository Record List
[ + ]