PepNovo

Live Search    Download   Publications
Contact: Ari Frank

De novo sequencing of low precision MS/MS data

The standard tool for high throughput sequencing of MS/MS data is the database search (tools like Sequest, Mascot, and InsPecT). However, there are cases where the traditional database search cannot be used. For instance, to be able to offer the relevant candidate peptides for a database search, the target organism's genome must be sequenced. Though the number of sequenced genomes is constantly growing, many organisms are still not sequenced, and do not even have a sequenced close homologue. In addition, even if a genome is sequenced, all its alternative splice variants of genes also need to be known to be able to identify all peptides in a sample.

The de novo sequencing method is useful in the situations described above since it does not require any knowledge of the sequenced genome, rather it performs all its sequencing effort using only the information present in the mass spectrum itself. In addition de novo sequencing can serve as an independent verification stage for the database search results.

We developed PepNovo to serve as a high throughput de novo peptide sequencing tool for tandem mass spectrometry data. PepNovo typically runs in less than 0.2 seconds per spectrum. PepNovo uses a probabilistic network to model the peptide fragmentation events in a mass spectrometer. In addition, it uses a likelihood ratio hypothesis test to determine if the peaks observed in the mass spectrum are more likely to have been produced under the fragmentation model, than under a probabilistic model that treats the appearance of peaks as random events. In benchmark experiments, PepNovo was found to outperform several of leading de novo algorithms such as SHERENGA, Peaks, and Lutefisk.

 

Download

New PepNovo v3 + MS-Filter

PepNovo v2 + MS-Filter

PepNovo for MS-Blast

PepNovo v1 (older version for low-precision ion-trap only):
- Windows binaries
- Source code

Publications

De Novo Peptide Sequencing and Identification with Precision Mass Spectrometry
Ari M. Frank, Mikhail M. Savitski, Michael L. Nielsen, Roman A. Zubarev, and Pavel A. Pevzner
J. Proteome Res. 6:114-123, 2007.
Paper Pubmed

PepNovo: De Novo Peptide Sequencing via Probabilistic Network Modeling
Frank, A. and Pevzner, P.
Analytical Chemistry 77:964-973, 2005.
Paper Pubmed

Peptide sequence tags for fast database search in mass-spectrometry.
Frank A, Tanner S, Bafna V, Pevzner P.
J Proteome Res. 2005 Jul-Aug;4(4):1287-95.
Paper Pubmed

Peptide Sequence Tags for Fast Database Search in Mass Spectrometry.
Ari Frank, Stephen Tanner, and Pavel Pevzner.
Conference on Research in Computational Molecular Biology (RECOMB) 2005.


Latest Releases

PepNovo

2008.07.08

MS-Clustering

2008.06.09

Inspect, MS-Alignment

2008.04.04

MS-Dictionary

2007.11.30

MS-GeneratingFunction

2007.11.28

Spectral Networks

Sept 2007

 

 

 

Media Coverage


A powerful tool for PTM discovery (Jan 2008, Journal of Proteome research, Vol 7. Issue 1)


From spectral networks to shotgun sequencing (June 2007, Nature Methods, Vol. 4 No. 6)

Identifying peptides without a database (May 2007, Journal of Proteome Research)

UCSD Computer Scientist Wins Young Investigator Award, Research on Snake Venom Proteins Highlighted (Nov 2006, UCSD)