How to use mafft to do large scale sequence alignment?


I wish to use mafft to do sequence alignment on a large protein sequence dataset which contains over 100,000 sequences with average sequence length being 1000 residues. I guess I need to use a supercomputer.

Does anyone know how many CPU cores and how large memory does it need to run the alignment smoothly?

Can mafft estimate the time it needs to finish an alignment? by itself And what will be the estimated time to finish above alignment if enough computational resources is input?



Source link