Hello,
I used samtools depth to get a textfile of coverage across my genome for ~600 samples. In order to inform coverage cutoffs in the next step I'd like to make a summary/histogram of some sort that summarizes coverage for all of these genomes together.
For example, eventually I'd like to say, filter out reads below x coverage and above y coverage. This would be based on the distribution of coverage for all samples to filter out likely duplicates etc.
Any help is appreciated!