Updated project metadata.
Abstract We have re-analysed publicly available mass spectrometry (MS) data sets enriched for phosphopeptides from Asian rice (Oryza sativa). In total we have identified, 15522 phosphosites on Serine, Threonine and Tyrosine residues on rice proteins. The data has been loaded into UniProtKB, enabling researchers to visualise the sites alongside other stored data on rice proteins, including structural models from AlphaFold2, and into PeptideAtlas, enabling visualisation of the source evidence for each site, including scores and source mass spectra. We identified sequence motifs for phosphosites, and link motifs to enrichment of different biological processes, indicating different downstream regulation caused by different kinase groups. We cross-referenced phosphosites against single amino acid variation (SAAV) data sourced from the rice 3000 genomes data, to identify SAAVs within or proximal to phosphosites that could cause loss of a particular site in a given rice variety. The data was further clustered to identify groups of sites with similar patterns across rice family groups, allowing us to identify sites highly conserved in Japonica, but mostly absent in, for example, Aus type rice varieties - known to have different responses to drought. These resources can assist rice researchers to discover alleles with significantly different functional effects across rice varieties.