I need your help. I have more than 40000 proteins in fasta file format.
First I want to write a function:
- that is able to calculate the masses of the b- and y-ions
- that creates a peptide database from the target proteins (mat-file)
- that creates a peptide database coming from the decoy proteins (mat-file)
Then, I want to:
- load the observed data
- filter the peptide databases for candidate peptides given a certain ppm accuracy
- write a function that scores the candidate peptides against the observed data
- Come up with a thresholding scheme to discern bonafide peptide spectrum matches from the bogus ones