I have kind of fastq files with multiple records:
1. >ID some information
2. --A-TGTGAC
3. 0100111100
etc.
Where the 2nd line is a consensus sequence (gap or nucleotide), and 3rd is (now binary) a conservative.
How to parse this file and extract position with the "1" score?
Pure Python code is too complicated. Biopython works with only Phred score.
Source link