Calculate (mean) sequence divergence for many sequences

Question

I have ~13K sequences a 120 bases and I want to compare them to find things like conserved regions, a mean divergence between them or very diverging outliers.

The problem is, with this number of sequences the things I tried aren't doable.

So has anyone done something similar in this size and can give me some hints how to achieve it? Or maybe just some tips where I should look for?

score 2 · Answer 1 · answered Sep 20 '16 at 14:36

2

Use the dnadist program of the PHYLIP package. You have some help in the Biopython library to deal with the Phylip alignment format here.

answered Sep 20 '16 at 14:36

xbello

7,223
3
28
41

Calculate (mean) sequence divergence for many sequences

1 Answers1