PXD025142 is an
original dataset announced via ProteomeXchange.
Dataset Summary
Title | Aird: A computation-oriented mass spectrometry data format enables a higher compression ratio and less decoding time |
Description | We describe "Aird", an opensource and computation-oriented format with controllable precision, flexible indexing strategies, and high compression rate. Aird provides a novel compressor called Zlib-Diff-PforDelta (ZDPD) for m/z data. Compared with Zlib only, m/z data size is about 55% lower in Aird on average. With the high-speed decoding and encoding performance brought by the Single Instruction Multiple Data(SIMD) technology used in the ZDPD, Aird merely takes 33% decoding time compared with Zlib. We used the open dataset HYE, which contains 48 raw files from SCIEX TripleTOF 5600 and TripleTOF6600. The total file size is 206GB as the vendor format. The total size increases to 854GB after converting to mzML with 32-bit encoding precision. While it takes only 189GB when using Aird. Aird uses JavaScript Object Notation (JSON) for metadata storage. Aird-SDK is written in Java and AirdPro is a GUI client for vendor file converting which is written in C#. They are freely available at https://github.com/CSi-Studio/Aird-SDK and https://github.com/CSi-Studio/AirdPro. |
HostingRepository | PRIDE |
AnnounceDate | 2021-04-12 |
AnnouncementXML | Submission_2021-04-12_07:59:57.765.xml |
DigitalObjectIdentifier | |
ReviewLevel | Peer-reviewed dataset |
DatasetOrigin | Original dataset |
RepositorySupport | Unsupported dataset by repository |
PrimarySubmitter | cong xie |
SpeciesList | scientific name: Homo sapiens (Human); NCBI TaxID: 9606; |
ModificationList | No PTMs are included in the dataset |
Instrument | Q Exactive HF; Q Exactive |
Dataset History
Revision | Datetime | Status | ChangeLog Entry |
0 | 2021-04-01 02:53:18 | ID requested | |
⏵ 1 | 2021-04-12 07:59:58 | announced | |
Publication List
Dataset with its publication pending |
Keyword List
submitter keyword: Aird, DIA, DDA, PRM, Proteomics, Metabolomics,Compressor |
Contact List
Miaoshan Lu |
contact affiliation | Westlake University |
contact email | lumiaoshan@westlake.edu.cn |
lab head | |
cong xie |
contact affiliation | CSi |
contact email | 569130520@qq.com |
dataset submitter | |
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2021/04/PXD025142 |
PRIDE project URI |
Repository Record List
[ + ]
[ - ]
- PRIDE
- PXD025142
- Label: PRIDE project
- Name: Aird: A computation-oriented mass spectrometry data format enables a higher compression ratio and less decoding time