REDItoolDnaRna.py manual | BioQueue Encyclopedia

Usage

REDItoolDnaRna.py -i rnaseq.bam -j dnaseq.bam -f myreference.fa -o myoutputfolder

Manual

Options:

-i RNA-Seq BAM file

-j DNA-Seq BAM files separated by comma or folder containing BAM files. Note that each chromosome/region must be present in a single BAM file only.

-I Sort input RNA-Seq BAM file

-J Sort input DNA-Seq BAM file

-f Reference file in fasta format. Note that chromosome/region names in the reference must match chromosome/region names in BAMs files.

-C Base interval to explore [100000]. It indicates how many bases have to be loaded during the run.

-k List of chromosomes to skip separated by comma or file (each line must contain a chromosome/region name).

-t Number of threads [1]. It indicates how many processes should be launched. Each process will work on an individual chromosome/region.

-o Output folder [rediFolder_XXXX] in which all results will be stored. XXXX is a random number generated at each run.

-F Internal folder name [null] is the main folder containing output tables.

-M Save a list of columns with quality scores. It produces at most two files in the pileup-like format.

-c Minimum read coverage (dna,rna) [10,10]

-Q Fastq offset value (dna,rna) [33,33]. For Illumina fastq 1.3+ 64 should be used.

-q Minimum quality score (dna,rna) [25,25]

-m Minimum mapping quality score (dna,rna) [25,25]

-O Minimum homoplymeric length (dna,rna) [5,5]

-s Infer strand (for strand oriented reads) [1]. It indicates which read is in line with RNA. Available values are: 1:read1 as RNA,read2 not as RNA; 2:read1 not as RNA,read2 as RNA; 12:read1 as RNA,read2 as RNA; 0:read1 not as RNA,read2 not as RNA.

-g Strand inference type 1:maxValue 2:useConfidence [1]; maxValue: the most prominent strand count will be used; useConfidence: strand is assigned if over a prefixed frequency confidence (-x option)

-x Strand confidence [0.70]

-S Strand correction. Once the strand has been inferred, only bases according to this strand will be selected.

-G Infer strand by GFF annotation (must be GFF and sorted, otherwise use -X). Sorting requires grep and sort unix executables.

-K GFF File with positions to exclude (must be GFF and sorted, otherwise use -X). Sorting requires grep and sort unix executables.

-T Work only on given GFF positions (must be GFF and sorted, otherwise use -X). Sorting requires grep and sort unix executables.

-X Sort annotation files. It requires grep and sort unix executables.

-e Exclude multi hits in RNA-Seq

-E Exclude multi hits in DNA-Seq

-d Exclude duplicates in RNA-Seq

-D Exclude duplicates in DNA-Seq

-p Use paired concardant reads only in RNA-Seq

-P Use paired concardant reads only in DNA-Seq

-u Consider mapping quality in RNA-Seq

-U Consider mapping quality in DNA-Seq

-a Trim x bases up and y bases down per read [0-0] in RNA-Seq

-A Trim x bases up and y bases down per read [0-0] in DNA-Seq

-b Blat folder for correction in RNA-Seq

-B Blat folder for correction in DNA-Seq

-l Remove substitutions in homopolymeric regions in RNA-Seq

-L Remove substitutions in homopolymeric regions in DNA-Seq

-v Minimum number of reads supporting the variation [3] for RNA-Seq

-n Minimum editing frequency [0.1] for RNA-Seq

-N Minimum variation frequency [0.1] for DNA-Seq

-z Exclude positions with multiple changes in RNA-Seq

-Z Exclude positions with multiple changes in DNA-Seq

-W Select RNA-Seq positions with defined changes (separated by comma ex: AG,TC) [default all]

-R Exclude invariant RNA-Seq positions

-V Exclude sites not supported by DNA-Seq

-w File containing splice sites annotations (SpliceSite file format see above for details)

-r Num. of bases near splice sites to explore [4]

--gzip Gzip output files

-h, --help Print the help

REDItoolDnaRna.py

Category

Usage

Manual

Share your experience or ask a question