Category

Genome Variant Analysis


Usage

java -jar GenomeAnalysisTK.jar -T ContEst -R reference.fasta -I:eval tumor.bam -I:genotype normal.bam --popFile populationAlleleFrequencies.vcf -L populationSites.interval_list [-L targets.interval_list] -isr INTERSECTION -o output.txt


Manual

Argument name(s)Default valueSummary
Required Inputs
--popfile
 -pf
NAthe variant file containing information about the population allele frequencies
Optional Inputs
--genotypes
nonethe genotype information for our sample
Optional Outputs
--out
 -o
stdoutAn output file created by the walker. Will overwrite contents if file exists
Optional Parameters
--base_report
 -br
NAWhere to write a full report about the loci we processed
--beta_threshold
0.95threshold for p(f>=0.5) to trim
--genotype_mode
 -gm
HARD_THRESHOLDwhich approach should we take to getting the genotypes (only in array-free mode)
--lane_level_contamination
 -llc
NAset to META (default), SAMPLE or READGROUP to produce per-bam, per-sample or per-lane estimates
--likelihood_file
 -lf
NAwrite the likelihood values to the specified location
--min_mapq
20threshold for minimum mapping quality score
--min_qscore
20threshold for minimum base quality score
--minimum_base_count
 -mbc
500what minimum number of bases do we need to see to call contamination in a lane / sample?
--population
CEUevaluate contamination for just a single contamination population
--precision
 -pc
0.1the degree of precision to which the contamination tool should estimate (e.g. the bin size)
--sample_name
 -sn
unknownThe sample name; used to extract the correct genotypes from mutli-sample truth vcfs
--trim_fraction
0.01at most, what fraction of sites should be trimmed based on BETA_THRESHOLD
Optional Flags
--verify_sample
 -vs
falseshould we verify that the sample name is in the genotypes file?
Advanced Parameters
--fixed_epsilon_qscore
NAuse a constant epsilon (phred scale) for calculation
--min_genotype_depth
50what minimum depth is required to call a site in seq genotype mode
--min_genotype_llh
5.0the min log likelihood for UG to call a genotype
--min_genotype_ratio
0.8the ratio of alt to other bases to call a site a hom non-ref variant
--min_site_depth
0minimum depth at a site to consider in calculation
--trim_interval
0.0progressively trim from 0 to TRIM_FRACTION by this interval


Share your experience or ask a question