<<< Full experiment listing

PXD012700

PXD012700 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleBolt: A new age peptide search engine for comprehensive MS/MS sequencing through vast protein databases in minutes
DescriptionThe standard platform for proteomics experiments today is mass spectrometry, particularly for samples derived from complex matrices. Recent increases in mass spectrometry sequencing speed, sensitivity and resolution now permit comprehensive coverage of even the most precious and limited samples, particularly when coupled with improvements in protein extraction techniques and chromatographic separation. However, the results obtained from laborious sample extraction and expensive instrumentation are often hindered by a sub optimal data processing pipelines. One critical data processing piece is peptide sequencing which is most commonly done through database search engines. In almost all MS/MS search engines users must limit their search space due to time constraints and q-value considerations. In nearly all experiments, the search is limited to a canonical database that typically does not reflect the individual genetic variations of the organism being studied. Searching for posttranslational modifications can exponentially increase the search space thus careful consideration must be used during the selection process. In addition, engines will nearly always assume the presence of only fully tryptic peptides. Despite these stringent parameters, proteomic data searches may take hours or even days to complete and opening even one of these criteria to more realistic biological settings will lead to detrimental increases in search time on expensive and custom data processing towers. Even on high performance servers, these search engines are computationally expensive, and most users decide to dial back their search parameters. We present Bolt, a new search engine that can search more than nine hundred thousand protein sequences (canonical, isoform, mutations, and contaminants) with 31 post translation modifications and N-terminal and C-terminal partial tryptic search in a matter of minutes on a standard configuration laptop. Along with increases in speed, Bolt provides an additional benefit of improvement in high confidence identifications, as demonstrated by manual validation of unique peptides identified by Bolt that were missed with parallel searching using standard engines. When in disagreement, 67% of peptides identified by Bolt may be manually validated by strong fragmentation patterns, compared to 14% of peptides uniquely identified by SEQUEST. Bolt represents, to the best of our knowledge, the first fully scalable, cloud based quantitative proteomic solution that can be operated within a user-friendly GUI interface.
HostingRepositoryPRIDE
AnnounceDate2019-08-29
AnnouncementXMLSubmission_2019-08-29_06:16:36.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterAmol Prakash
SpeciesList scientific name: Homo sapiens (Human); NCBI TaxID: 9606;
ModificationListcarbamoylated residue; phosphorylated residue; acetylated residue; formylated residue; deamidated residue
InstrumentLTQ Orbitrap Elite
Dataset History
RevisionDatetimeStatusChangeLog Entry
02019-02-14 02:00:23ID requested
12019-08-29 06:16:38announced
Publication List
Prakash A, Ahmad S, Majumder S, Jenkins C, Orsburn B, Bolt: a New Age Peptide Search Engine for Comprehensive MS/MS Sequencing Through Vast Protein Databases in Minutes. J Am Soc Mass Spectrom, 30(11):2408-2418(2019) [pubmed]
Keyword List
submitter keyword: Bolt, Hela, Mutations, SNP
Contact List
Amol Prakash
contact affiliationOptys Tech Corporation
contact emailamol.prakash@optystech.com
lab head
Amol Prakash
contact affiliationOptys Tech Corporation
contact emailamol.prakash@optystech.com
dataset submitter
Full Dataset Link List
Dataset FTP location
PRIDE project URI
Repository Record List
[+]