Format GISTIC2 all_data_by_genes.txt and cBioPortal

0

The documentation for cBioPortal file formats discussing continuous copy number states that the GISTIC2 output file <prefix>_all_data_by_genes.txt can be used directly as the cBioPortal data file (after changing column names.) cBioPortal expects this data to be in LOG2 format.

I have a file all_data_by_genes.txt (NOTE: Not <prefix>_all_data_by_genes.txt) generated by a run of GISTIC2 against an amalgamated segment (*.seg) file. However, then I try to use it according to the documentation, cBioPortal errors out saying that there are negative numbers in the data fields of the file (and there are.) This makes me assume the file is not actually LOG2 data.

Does anyone know ...

  1. What is the data type/format of the data in this file?
  2. Should I be using a different output file instead of all_data_by_genes.txt?
  3. If I SHOULD be using all_data_by_genes.txt, do I need to convert the data?

Thanks!
Mike


copy


number


output


format


gistic2


cbioportal

• 95 views



Source link