Hello all,

I am new to bioinformatics and I need help doing the following:

1) Extract all gene, isoform, coding exons, 5'UTR and 3'UTR coordinates in BED format from the following GTF file:

ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_33/gencode.v33.annotation.gtf.gz

For 5' and 3' UTR regions there should be two versions

1) Gene level

2) Isoform level

For gene level data include all UTRs of all the isoforms and merge any overlapping isoform UTRs.



Source link