
Genome Variant Analysis


java -jar GenomeAnalysisTK.jar -T UnifiedGenotyper -R reference.fasta -I input.bam -o raw_variants.vcf --output_mode EMIT_ALL_SITES


Argument name(s)Default valueSummary
Optional Inputs
noneSet of alleles to use in genotyping
[]Comparison VCF file
nonedbSNP file
Optional Outputs
stdoutFile to which variants should be written
Optional Parameters
[]One or more specific annotations to apply to variant calls
0.0Fraction of contamination to aggressively remove
[]One or more specific annotations to exclude
SNPGenotype likelihoods calculation model to employ -- SNP is the default option, while INDEL is also available for calling indels and BOTH is available for calling both together
DISCOVERYSpecifies how to determine the alternate alleles to use for genotyping
[Standard, StandardUG]One or more classes/groups of annotations to apply to variant calls. The single value 'none' removes the default group
0.001Heterozygosity value used to compute prior likelihoods for any locus
0.01Standard deviation of eterozygosity for SNP and indel calling.
1.25E-4Heterozygosity for indel calling
0.05Maximum fraction of reads with deletions spanning this locus for it to be callable
17Minimum base quality required to consider a base for calling
5Minimum number of consensus indels required to trigger genotyping run
0.25Minimum fraction of all reads at a locus that must contain an indel (of any allele) for that sample to contribute to the indel count for alleles
LOGLESS_CACHINGThe PairHMM implementation to use for -glm INDEL genotype likelihood calculations
1.0E-4The PCR error rate to be used for computing fragment-based likelihoods
2Ploidy per sample. For pooled data, set to (Number of samples in each pool * Sample Ploidy).
10.0The minimum phred-scaled confidence threshold at which variants should be called
Optional Flags
falseAnnotate number of alleles observed
falseIf provided, we will calculate the SLOD (SB annotation)
falseUse new AF model instead of the so-called exact model
Advanced Parameters
NAContamination per sample
10Indel gap continuation penalty, as Phred-scaled probability. I.e., 30 => 10^-30/10
45Indel gap open penalty, as Phred-scaled probability. I.e., 30 => 10^-30/10
[]Input prior for calls
6Maximum number of alternate alleles to genotype
1024Maximum number of genotypes to consider at any site
100Maximum number of PL values to output
[]If provided, only these samples will be emitted into the VCF, regardless of which samples are present in the BAM file
EMIT_VARIANTS_ONLYWhich type of calls we should output
Advanced Flags
falseAnnotate all sites with PLs

Share your experience or ask a question