Recent advances in mass spectrometry-based peptidomics has catalyzed the identification and quantification of thousands endogenous peptides across diverse biological systems. However, the large theoretical peptidomic landscape and high proportion of missing values poses several challenges for downstream analyses and limits the comparability of clinical samples. Here, we present a generalizable computational workflow that clusters peptides with overlapping sequences to reduce the dimensionality of peptidomic data, improve the definition of protease cut-sites, enhance inter-sample comparability, and enable the implementation of reliable and large-scale data analysis methods akin to those employed in other omics fields. We showcase the algorithm by performing large-scale quantitative analysis of wound fluid peptidomes of highly defined porcine wounds and human clinical non-healing wounds. The analysis revealed signature phenotype-specific peptide regions reflecting pathogen-specific proteolytic activity at the earliest stages of colonization, resulting in novel class of potential peptide cluster-based biomarkers.