Plink error: Variant names are limited to 16000 characters.

0

Hi plink experts,

I'm trying to convert *.dose.vcf(.gz) files (product of the Michigan Imputation Server) to plink binary ones and got error since some snps has extremely long IDs. Does anyone know how to tackle this?

Commands I've tried in plink1.9 and 2:

--set-missing-var-ids @:#[b37] 
--set-all-var-ids  @:#[b37] 
--new-id-max-allele-len 
--vcf *.dose.vcf dosage=HDS 
--snps-only 
--biallelic-only 

I also tried DosageConvertor which has a function --trimNames but it resulted in dosage.gz, map, fam format files, which I don't know how to use...

I found very limited info online about this (see groups.google.com/g/plink2-users/c/u6nvaI1Olyg), it suggested to trim ID before converting in plink but did not exactly say how to do it.

Any suggestion will be appreciated! Many thanks in advance!


plink


imputation

• 39 views



Source link