×

nf-core/metacache/build @ 0.0.0-3fc6d40

Taxonomic profiling database building with MetaCache

Latest version: 0.0.0-6c4ed3a
Total downloads: 12
Source: nf-core/modules
Authors: @Gullumluvl
Maintainers: @Gullumluvl

Summary

Taxonomic profiling database building with MetaCache

Get started

Add the following snippet to your workflow script to include this module.

include { METACACHE_BUILD } from 'nf-core/metacache/build'

License

MIT License

Process
Name METACACHE_BUILD
Input 3 channels
#1 tuple
meta map

Groovy Map containing sample information e.g. [ id:'sample1' ]

genome_files file

(possibly gzipped) fasta or fastq files of full genomes, for example from an NCBI assembly

*.{fna,fa,fasta,fnq,fq,fastq}{,.gz}
taxonomy file

NCBI taxonomy formatted files nodes.dmp and names.dmp

{names,nodes,merged}.dmp
seq2taxid file

NCBI-style 'accession2taxid' tab-separated file with 3 or 4 columns: accession, accession_version, taxid, and gid (optional)

*
Output 2 channels
#1 db tuple
meta map

Groovy Map containing sample information e.g. [ id:'sample1' ]

*.meta file

sequence signature database binary file

*.meta
*.cache* file

sequence signature database binary files

*.cache+([0-9])
#2 versions_metacache tuple
${task.process} string

The name of the process

metacache string

The name of the tool

metacache info |& sed -n 's/^MetaCache version \+\([0-9.]\+\).*\$/\1/p' eval

The expression to obtain the version of the tool

Tool Description Homepage
metacache MetaCache is a classification system for mapping genomic sequences (short reads, long reads, contigs, ...) from metagenomic samples to their most likely taxon of origin. It aims to reduce the memory requirement usually associated with k-mer based methods while retaining their speed. MetaCache uses locality sensitive hashing to quickly identify candidate regions within one or multiple reference genomes. A read is then classified based on the similarity to those regions. For an independent comparison to other tools in terms of classification accuracy see the LEMMI benchmarking site. The latest version of MetaCache classifies around 60 Million reads (of length 100) per minute against all complete bacterial, viral and archaea genomes from NCBI RefSeq Release 97 running with 88 threads on a workstation with 2 Intel(R) Xeon(R) Gold 6238 CPUs. https://muellan.github.io/metacache
Version 0.0.0-3fc6d40
Commit ID 6c4ed3a220310b905a1fc9d04f05be2e0837142b
Release Date 22 Apr 2026 15:21:31 (UTC)
Download URL https://registry.nextflow.io/api/v1/modules/nf-core%2Fmetacache%2Fbuild/0.0.0-3fc6d40/download
OCI Store URL https://public.cr.seqera.io/v2/nextflow/plugin/modules/nf-core/metacache/build/blobs/sha256:d09dbc0cc3cafa746d5f97026fc02b7cbef589a43d0a3b2eafc9840ce5fe4d94
Size 4.0 KB
Checksum sha256:d09dbc0cc3cafa746d5f97026fc02b7cbef589a43d0a3b2eafc9840ce5fe4d94
Downloads 3
Version Date Status Downloads Size
0.0.0-6c4ed3a 23 Apr 2026 15:30:10 (UTC) 3 4.0 KB
0.0.0-3fc6d40 22 Apr 2026 15:21:31 (UTC) 3 4.0 KB
0.0.0-7f68141 16 Apr 2026 15:01:10 (UTC) 3 4.0 KB
0.0.0-3c11db3 12 Apr 2026 15:00:07 (UTC) 1 4.0 KB
0.0.0-0c7146d 08 Apr 2026 19:17:17 (UTC) 2 4.0 KB