Tutorial
FASTQ vs gVCF vs .genome
Understand raw reads, variant calls, confident reference regions, and why .genome is the AI-ready layer built for exploration.
What you will learn:
You will know what each file type preserves, when to use it, and how .genome fits on top of sequencing data.
Genome files exist at different distances from the sequencing machine. FASTQ is close to the instrument. gVCF is closer to variant analysis. .genome is closer to human and AI exploration. None of these is the only real genome file. They do different jobs.
Genome Computer's view is that users should not be handed raw files and left alone. Raw data matters, but raw data is not the same thing as understanding. A .genome bundle is designed to turn sequencing outputs into a structured, AI-ready context you can actually explore.
The three layers
- FASTQ: raw sequencing reads with bases and quality scores as they come off the sequencing machine.
- gVCF: a genome variant call file that records variants and confidently sequenced reference regions.
.genome: an AI-ready bundle that organizes genome data, annotations, sources, and prompts for self-exploration.
The difference between VCF and gVCF is important. A VCF is mostly about where you differ from the reference. A gVCF can also preserve confidence about regions where you match the reference. That makes it more useful for future analysis because absence of a variant is only meaningful when you know the region was confidently sequenced.
When each file is useful
- Use FASTQ when you need raw reads or want future reprocessing from the most primary sequencing output.
- Use gVCF when you want variant calls plus confidence about non-variant regions.
- Use
.genomewhen you want an AI tool to reason across annotations, sources, and prompts without starting from raw bioinformatics files. - Keep the rawer files for archival and technical review. Use
.genomefor everyday exploration.
Ask AI to explain your own files
- Upload or open your
.genomebundle. - Ask the AI to identify any FASTQ, VCF, gVCF, annotation, source, and prompt files.
- Ask what each file preserves.
- Ask which file is best for your current question.
- Ask what evidence would require a rawer file.
- Ask what can be explored directly from the
.genomelayer.
File type prompt
Inspect my genome files and classify each as FASTQ, VCF, gVCF, .genome annotation, source, prompt, or other supporting file. For each file, explain what it preserves, what it is useful for, and what kinds of questions it can and cannot answer.
Practical comparison prompt
Explain the practical difference between FASTQ, gVCF, and .genome using my files as examples. Focus on what I would use each file for when exploring my genome with AI.
The point is not to memorize file formats. The point is to know which layer to use for which job. .genome is the exploration layer. FASTQ and gVCF are the deeper technical layers you may want to keep, request, or use later.