I want to download gene sequences for a list of genes, including UTR annotations. I was for some reason sure that this information was provided for each genome at the moment of release coming from some gene model identification step.
I was thus sure to have extracted also the UTR regions when downloading [Unspliced (Gene) option in Biomart/Ensembl] for a list of genes the gene sequences in bulk. It turned out that those sequences were not annotated. Also turned out that i had no idea about this as i've no experience with genome assembly/annotation 🙂
Can somebody tell me how this annotation is performed and why for some genomes it is released and for others it is not ? Is there a way to perform this annotation ex-novo for example for a list of genes of interest?
thanks in advance