Genome Variant Analysis

java -jar GenomeAnalysisTK.jar
Function: Create plots to visualize base recalibration results
Usage: java -jar GenomeAnalysisTK.jar -T AnalyzeCovariates -R myrefernce.fasta -before recal2.table -after recal3.table -plots recalQC.pdf
java -jar GenomeAnalysisTK.jar
Function: Compute the read error rate per position
Usage: java -jar GenomeAnalysisTK.jar -T ErrorRatePerCycle -R reference.fasta -I my_sequence_reads.bam -o error_rates.gatkreport.txt
VarScan
Function: Call variants and identifies their somatic status (Germline/LOH/Somatic) using pileup files from a matched tumor-normal pair.
Usage: java -jar VarScan.jar copynumber [normal_pileup] [tumor_pileup] [output] OPTIONS
GEMINI autosomal recessive
Function: Find variants meeting an autosomal recessive model
Usage: gemini autosomal_recessive test.auto_rec.db --columns "chrom,start,end,gene"
VarScan
Function: Performs set-comparison operations on two files of variants.
Usage: java -jar VarScan.jar compare [file1] [file2] [type] [output] OPTIONS
java -jar GenomeAnalysisTK.jar
Function: Validate a VCF file with an extra strict set of criteria
Usage: java -jar GenomeAnalysisTK.jar -T ValidateVariants -R reference.fasta -V input.vcf --validationTypeToExclude ALLELES
java -jar GenomeAnalysisTK.jar
Function: Simplify multi-nucleotide variants (MNPs) into more basic/primitive alleles.
Usage: java -jar GenomeAnalysisTK.jar -T VariantsToAllelicPrimitives -R reference.fasta -V input.vcf -o output.vcf
gatk FastaAlternateReferenceMaker
Function: Generate an alternative reference sequence over the specified interval
Usage: gatk FastaAlternateReferenceMaker [options] -R reference.fasta -O output.fasta -L input.intervals -V input.vcf
Supported input format: VCF
java -jar GenomeAnalysisTK.jar
Function: Count the number of bases in a set of reads
Usage: java -jar GenomeAnalysisTK.jar -R reference.fasta -T CountBases -I input.bam [-L input.intervals]
java -jar GenomeAnalysisTK.jar
Function: Define intervals to target for local realignment
Usage: java -jar GenomeAnalysisTK.jar -T RealignerTargetCreator -R reference.fasta -I input.bam --known indels.vcf -o forIndelRealigner.intervals
java -jar GenomeAnalysisTK.jar
Function: Create plots to visualize base recalibration results
Usage: java -jar GenomeAnalysisTK.jar -T AnalyzeCovariates -R myrefernce.fasta -ignoreLMT -BQSR recal1.table \ # you can discard any two -before recal2.table -after recal3.table -plots myrecals.pdf
java -jar GenomeAnalysisTK.jar
Function: Compare GATK's internal pileup to a reference Samtools pileup
Usage: java -jar GenomeAnalysisTK.jar -T CheckPileup -R reference.fasta -I your_data.bam --pileup:SAMPileup pileup_file.txt -L chr1:257-275 -o output_file_name
java -jar GenomeAnalysisTK.jar
Function: Count the number of reads seen from male samples
Usage: java -jar GenomeAnalysisTK.jar -T CountMales -R reference.fasta -I samples.bam -o output.txt
java -jar GenomeAnalysisTK.jar
Function: Validate a VCF file with an extra strict set of criteria
Usage: java -jar GenomeAnalysisTK.jar -T ValidateVariants -R reference.fasta -V input.vcf --validationTypeToExclude ALL
java -jar GenomeAnalysisTK.jar
Function: Randomly split variants into different sets
Usage: java -jar GenomeAnalysisTK.jar -T RandomlySplitVariants -R reference.fasta -V input.vcf -o1 output_1.vcf -o2 output_2.vcf