⮝ Full datasets listing

PXD063526-1

PXD063526 is an original dataset announced via ProteomeXchange.

Dataset Summary
TitleDe Novo Sequencing of Polyclonal Antibodies by Integrating Intact Mass, Top-Down and Bottom-Up Mass Spectrometry
DescriptionAntibodies are specialized proteins produced by the adaptive immune system to identify and neutralize harmful antigens. Antibody-based diagnostics and therapeutics have advanced rapidly, with recombinant antibodies derived from monoclonal antibodies (mAbs) becoming the preferred standard due to their high reproducibility and effectiveness. However, mAbs are limited to binding a single antigen epitope. Polyclonal antibodies (pAbs), on the other hand, are produced by multiple B cells and can target multiple epitopes, making them more robust against antigen variations. Despite their advantages, pAbs are notoriously challenging to sequence due to their high complexity. Current de novo sequencing methods based on mass spectrometry have shown limitations when applied to pAb mixtures, often requiring germline databases or yielding incomplete or inaccurate sequences. Here we propose PolySeq, a fully de novo sequencing workflow that integrates bottom-up, top-down, and intact mass spectrometry data to accurately sequence pAb samples without relying on external databases. Our workflow automates the entire sequencing process from peptides to full proteins, achieving complete and accurate antibody profiles. Evaluation results on a mixture of four known mAbs and a pAb sample derived from mouse myeloma demonstrate that our de novo antibody sequences achieved up to 100% coverage and accuracy, with strong supporting evidence from bottom-up, top-down, and intact mass data. Furthermore, antibodies recombinantly expressed using de novo sequencing results showed expected properties, such as binding affinity and antigen neutralization, thus confirming the accuracy and efficacy of our pAb de novo sequencing workflow.
HostingRepositoryPRIDE
AnnounceDate2025-10-31
AnnouncementXMLSubmission_2025-10-31_01:26:01.109.xml
DigitalObjectIdentifier
ReviewLevelPeer-reviewed dataset
DatasetOriginOriginal dataset
RepositorySupportUnsupported dataset by repository
PrimarySubmitterNgoc Hieu Tran
SpeciesList scientific name: Homo sapiens (Human); NCBI TaxID: NEWT:9606;
ModificationListmonohydroxylated residue; deamidated residue
InstrumentOrbitrap Eclipse
Dataset History
RevisionDatetimeStatusChangeLog Entry
02025-05-01 01:49:56ID requested
12025-10-31 01:26:01announced
Publication List
10.1016/J.MCPRO.2025.101088;
Keyword List
submitter keyword: intact mass, de novo sequencing, top-down, bottom-up,polyclonal antibody sequencing
Contact List
Baozhen Shan
contact affiliationBioinformatics Solutions Inc., Waterloo, Ontario, Canada
contact emailbshan@bioinfor.com
lab head
Ngoc Hieu Tran
contact affiliationBioinformatics Solutions Inc.
contact emailhtran@bioinfor.com
dataset submitter
Full Dataset Link List
Dataset FTP location
NOTE: Most web browsers have now discontinued native support for FTP access within the browser window. But you can usually install another FTP app (we recommend FileZilla) and configure your browser to launch the external application when you click on this FTP link. Or otherwise, launch an app that supports FTP (like FileZilla) and use this address: ftp://ftp.pride.ebi.ac.uk/pride/data/archive/2025/10/PXD063526
PRIDE project URI
Repository Record List
[ + ]