×

nf-core/deepvariant @ 0.0.0-7f68141

(DEPRECATED - see main.nf) DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data

Latest version: 0.0.0-7f68141
Total downloads: 10
Source: nf-core/modules
Maintainers: @abhi18av @ramprasadn

DeepVariant module / subworkflow options

The DeepVariant tool can be run using the deepvariant/rundeepvariant subcommand, or the subworkflow deepvariant, which calls the subcommands makeexamples, callvariants and postprocessvariants. The subcommand rundeepvariant is simpler, but the subworkflow may be useful if you want to run callvariants on GPU.

Conda is not supported at the moment

The bioconda recipe is not fully working as expected.

See https://github.com/bioconda/bioconda-recipes/issues/30310 and https://github.com/nf-core/modules/issues/1754 for more information.

Hence, we are using the docker container provided by the authors of the tool:

This image is mirrored on the nf-core quay.io for convenience.

DeepVariant subworkflow

You can use the subworkflow nf-core/deepvariant, which integrates the three processes to perform variant calling with common file formats.

These module subcommands incorporate the individual steps of the DeepVariant pipeline:

* makeexamples: Converts the input alignment file to a tfrecord format suitable for the deep learning model
* callvariants: Call variants based on input tfrecords. The output is also in
tfrecord format, and needs postprocessing to convert it to vcf.
* postprocessvariants: Convert variant calls from callvariants to VCF, and
also create GVCF files based on genomic information from makeexamples.

Recommended parameters

makeexamples

This process imports the data used for calling, and thus decides what information is available to the deep neural network. It's important to use the correct settings for the model you want to use for each step. The script run_deepvariant.py does this automatically. To figure out the flags needed for each model, you can run run_deepvariant.py while adding dry_run=true, to print out the command used for each step, as described here.

callvariants

It is mandatory to specify a model type. The models are available on the container filesystem in /opt/models - specify the one you want with the --checkpoint argument.

withName: "DEEPVARIANT_CALLVARIANTS" {
    ext.args = '--checkpoint "/opt/models/wgs'
}

The channels specified in the makeexamples process must match the model used for calling.

Input 5 channels
#1 tuple
meta map

Groovy Map containing sample information e.g. [ id:'test', single_end:false ]

input file

BAM/CRAM file

*.bam/cram
index file

Index of BAM/CRAM file

*.bai/crai
intervals file

file containing intervals

#2 tuple
meta2 map

Groovy Map containing reference information e.g. [ id:'genome' ]

fasta file

The reference fasta file

*.fasta
#3 tuple
meta3 map

Groovy Map containing reference information e.g. [ id:'genome' ]

fai file

Index of reference fasta file

*.fai
#4 tuple
meta4 map

Groovy Map containing reference information e.g. [ id:'genome' ]

gzi file

GZI index of reference fasta file

*.gzi
#5 tuple
meta5 map

Groovy Map containing reference information e.g. [ id:'genome' ]

par_bed file

BED file containing PAR regions

*.bed
Output 5 channels
#1 vcf tuple
meta map

Groovy Map containing sample information e.g. [ id:'test', single_end:false ]

${prefix}.vcf.gz file

Compressed VCF file

*.vcf.gz
#2 gvcf tuple
meta map

Groovy Map containing sample information e.g. [ id:'test', single_end:false ]

${prefix}.g.vcf.gz file

Compressed GVCF file

*.g.vcf.gz
#3 vcf_tbi tuple
meta map

Groovy Map containing sample information e.g. [ id:'test', single_end:false ]

${prefix}.vcf.gz.tbi file

Tabix index of compressed VCF file

*.vcf.gz.tbi
#4 gvcf_tbi tuple
meta map

Groovy Map containing sample information e.g. [ id:'test', single_end:false ]

${prefix}.g.vcf.gz.tbi file

Tabix index of compressed GVCF file

*.g.vcf.gz.tbi
#5 versions
versions.yml file

File containing software versions

versions.yml
Tool Description Homepage