Dataset supporting the identification of machine learning-lead validation of SET8 substrates using targeted mass spectrometry (PRM-MS).