<<< Full experiment listing

PXD025310

PXD025310 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleAird: A computation-oriented mass spectrometry data format enables a higher compression ratio and less decoding time
DescriptionWe describe "Aird", an opensource and computation-oriented format with controllable precision, flexible indexing strategies, and high compression rate. Aird provides a novel compressor called Zlib-Diff-PforDelta (ZDPD) for m/z data. Compared with Zlib only, m/z data size is about 55% lower in Aird on average. With the high-speed decoding and encoding performance brought by the Single Instruction Multiple Data(SIMD) technology used in the ZDPD, Aird merely takes 33% decoding time compared with Zlib. We used the open dataset HYE, which contains 48 raw files from SCIEX TripleTOF 5600 and TripleTOF6600. The total file size is 206GB as the vendor format. The total size increases to 854GB after converting to mzML with 32-bit encoding precision. While it takes only 189GB when using Aird. Aird uses JavaScript Object Notation (JSON) for metadata storage. Aird-SDK is written in Java and AirdPro is a GUI client for vendor file converting which is written in C#. They are freely available at https://github.com/CSi-Studio/Aird-SDK and https://github.com/CSi-Studio/AirdPro
HostingRepositoryiProX
AnnounceDate2021-04-12
AnnouncementXMLSubmission_2023-08-28_00:34:30.019.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterXie Cong
SpeciesList scientific name: Homo sapiens; NCBI TaxID: 9606;
ModificationListNo PTMs are included in the dataset
InstrumentQ Exactive HF
Dataset History
RevisionDatetimeStatusChangeLog Entry
02021-04-11 20:54:53ID requested
12021-04-11 20:55:33announced
22023-08-28 00:34:30announced2023-08-28: Update publication information.
Publication List
Lu M, An S, Wang R, Wang J, Yu C, Aird: a computation-oriented mass spectrometry data format enables a higher compression ratio and less decoding time. BMC Bioinformatics, 23(1):35(2022) [pubmed]
Keyword List
submitter keyword: Aird, DIA, DDA, PRM, Proteomics, Metabolomics
Contact List
Miaoshan Lu
contact affiliationWestlake University
contact emaillumiaoshan@westlake.edu.cn
lab head
Xie Cong
contact affiliationCSi Biotech Limited Liability Company
contact email569130520@qq.com
dataset submitter
Full Dataset Link List
iProX dataset URI