Hi everyone, I'm new to sequencing and trying to get my bearings around the open-source tools that are available.

Currently, I'm trying to figure out what the SRA ID is for this paper: [https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7863352/][1]. The link to the [NCBI BioProject][2] says that the ID is 517527, but when I try to use the SRA tools to print the sequences using `fastq-dump --stdout SRR517527`, I get ~5000 entries in the output. This doesn't make sense, because the BioProject says that there's only 318 data samples.

I'd really appreciate any advice on this issue, and especially an explanation of what exactly the difference is between SRR/ERR/DRR accessions and SRA (or a link to where I can read about the difference, I couldn't find a good guide on the NCBI website).

[1]: www.ncbi.nlm.nih.gov/pmc/articles/PMC7863352/
[2]: www.ncbi.nlm.nih.gov/bioproject/PRJNA517527



Source link