Models
InstaNovo 1.1.0 includes two new models: instanovo-v1.1.0.ckpt, and instanovoplus-v1.1.0.ckpt trained
on a larger dataset with more PTMs.
Note: The InstaNovo Extended 1.0.0 training data mis-represented Cysteine as unmodified for the majority of the training data. Please update to the latest version of the model.
Training Datasets
- ProteomeTools Part
I (PXD004732),
II (PXD010595), and
III (PXD021013) (referred to as the all-confidence ProteomeTools
AC-PTdataset in our paper) - Additional PRIDE dataset with more modifications: (PXD000666, PXD000867, PXD001839, PXD003155, PXD004364, PXD004612, PXD005230, PXD006692, PXD011360, PXD011536, PXD013543, PXD015928, PXD016793, PXD017671, PXD019431, PXD019852, PXD026910, PXD027772)
- Massive-KB v1
- Additional phosphorylation dataset (not yet publicly released)
Acknowledgements
Big thanks to Pathmanaban Ramasamy, Tine Claeys, and Lennart Martens of the CompOmics research group for providing us with additional phosphorylation training data.