FeatureCounts dont read all features


Dear all,

I think I may have a problem importing GTF file into featureCounts cause it only import 121 features out of 3K+ features. I got my GTF files from genbank assembly (www.ncbi.nlm.nih.gov/assembly/GCF_000021625.1) without any modification;

> fc <- featureCounts(bam.files, isGTFAnnotationFile=TRUE, annot.ext="/Users/RNAseq/GCF_009646135.1_ASM964613v1_feature_table.gtf")

Load annotation file GCA_000016245.1_ASM1624v1_genomic.gtf ...             ||
||    Features : 121                                                          ||
||    Meta-features : 121                                                     ||
||    Chromosomes/contigs : 2    

Any idea what the problem is?




updated 1 hour ago by


written 2 hours ago by


By default, featureCounts considers only exons as features.

 -t <string>         Specify feature type in GTF annotation. `exon' by 
                      default. Features used for read counting will be 
                      extracted from annotation using the provided value.

In this annotation, there is only 127 exons... so it considers only those.

 cut -f 3 GCF_000021625.1_ASM2162v1_genomic.gtf | sort | uniq -c
3733 CDS
 127 exon
3854 gene
3727 start_codon
3727 stop_codon

To correct this behaviour, use option -t 'gene" which make more sense with prokaryotes anyway.

before adding your answer.

Traffic: 2260 users visited in the last hour

Source link