I would like to align reads to the genome of the Macaca fascicularis monkey.
Until now, the genomes I worked with appeared in the Illumina iGenomes page:

emea.support.illumina.com/sequencing/sequencing_software/igenome.html

And so I had the WholeGenomeFasta/genome.fa file to work with (from which I created the STAR index etc).
However, there is no iGenome for Macaca fascicularis and so I downloaded the fasta files from the Ensembl site, under "Download DNA Sequences":

www.ensembl.org/Macaca_fascicularis/Info/Index

Which are all the files here:

ftp.ensembl.org/pub/release-104/fasta/macaca_fascicularis/dna/

I'd like to create a single genome.fa file that is similar to the one from iGenomes. My question is: should this be done simply by concatenating the *dna.primary_assembly fasta files?

Thanks a lot! 🙂

Daniella



Source link