gravatar for hodiv99219

2 hours ago by

Sorry for the beginner question but I did map my dataset (8 individuals modern and ancient DNA) to hs37d5 and did the pseudohaploid call on all sites; I end up with eigenstrat output with the .snp having 38M snps.
After that I wanted to merge my dataset with the 1240K eigenstrat dataset, i do so using the build in "mergeit" function from the package.

However while I merge the two dataset I have this error displaying:

Local.snp: genetic distance set from physical distance

On top of that, the .geno and .snp file are empty, while the .ind were merged successfully.

I guessed it was because I don't use the same reference as they do ?

The pipeline I used worked on previous dataset, the only stuff that changed is that I don't specify SNPs to do the calling (i also noticed that the output is myoutput.snp.txt instead of myoutput.snp usually.. but after renaming it, everything is fine on that end)

I also thought it was due that my dataset wasn't in Eigenstrat format but I did specified my calling as follow:

samtools mpileup -R -B -q 1 -f hs37d5.fa -b BAMLIST.txt | awk '$3=="R" {$3="N"; print} {print}' |  pileupCaller --sampleNameFile SAMPLELIST.txt --samplePopName TESTOUT  -o EigenStrat -e Call/Local

Does anyone have an idea why can't i merge those two datasets ?


modified 2 hours ago

2 hours ago


Source link