×

nf-core/metacache/build @ 0.0.0-3c11db3

Taxonomic profiling database building with MetaCache

Latest version: 0.0.0-6c4ed3a
Total downloads: 12
Source: nf-core/modules
Authors: @Gullumluvl
Maintainers: @Gullumluvl

Summary

Taxonomic profiling database building with MetaCache

Get started

Add the following snippet to your workflow script to include this module.

include { METACACHE_BUILD } from 'nf-core/metacache/build'

License

MIT License

Process
Name METACACHE_BUILD
Input 3 channels
#1 tuple
meta map

Groovy Map containing sample information e.g. [ id:'sample1' ]

genome_files file

(possibly gzipped) fasta or fastq files of full genomes, for example from an NCBI assembly

*.{fna,fa,fasta,fnq,fq,fastq}{,.gz}
taxonomy file

NCBI taxonomy formatted files nodes.dmp and names.dmp

{names,nodes,merged}.dmp
seq2taxid file

NCBI-style 'accession2taxid' tab-separated file with 3 or 4 columns: accession, accession_version, taxid, and gid (optional)

*
Output 2 channels
#1 db tuple
meta map

Groovy Map containing sample information e.g. [ id:'sample1' ]

*.meta file

sequence signature database binary file

*.meta
*.cache* file

sequence signature database binary files

*.cache+([0-9])
#2 versions_metacache tuple
${task.process} string

The name of the process

metacache string

The name of the tool

metacache info |& sed -n 's/^MetaCache version \+\([0-9.]\+\).*\$/\1/p' eval

The expression to obtain the version of the tool

Tool Description Homepage
metacache MetaCache is a classification system for mapping genomic sequences (short reads, long reads, contigs, ...) from metagenomic samples to their most likely taxon of origin. It aims to reduce the memory requirement usually associated with k-mer based methods while retaining their speed. MetaCache uses locality sensitive hashing to quickly identify candidate regions within one or multiple reference genomes. A read is then classified based on the similarity to those regions. For an independent comparison to other tools in terms of classification accuracy see the LEMMI benchmarking site. The latest version of MetaCache classifies around 60 Million reads (of length 100) per minute against all complete bacterial, viral and archaea genomes from NCBI RefSeq Release 97 running with 88 threads on a workstation with 2 Intel(R) Xeon(R) Gold 6238 CPUs. https://muellan.github.io/metacache
Version 0.0.0-3c11db3
Commit ID 6c4ed3a220310b905a1fc9d04f05be2e0837142b
Release Date 12 Apr 2026 15:00:07 (UTC)
Download URL https://registry.nextflow.io/api/v1/modules/nf-core%2Fmetacache%2Fbuild/0.0.0-3c11db3/download
OCI Store URL https://public.cr.seqera.io/v2/nextflow/plugin/modules/nf-core/metacache/build/blobs/sha256:f69065c0859f14d8d3a1f373e54fdf5c7f14af0d836bcb5e7b976eac4eeeb9e4
Size 4.0 KB
Checksum sha256:f69065c0859f14d8d3a1f373e54fdf5c7f14af0d836bcb5e7b976eac4eeeb9e4
Downloads 1
Version Date Status Downloads Size
0.0.0-6c4ed3a 23 Apr 2026 15:30:10 (UTC) 3 4.0 KB
0.0.0-3fc6d40 22 Apr 2026 15:21:31 (UTC) 3 4.0 KB
0.0.0-7f68141 16 Apr 2026 15:01:10 (UTC) 3 4.0 KB
0.0.0-3c11db3 12 Apr 2026 15:00:07 (UTC) 1 4.0 KB
0.0.0-0c7146d 08 Apr 2026 19:17:17 (UTC) 2 4.0 KB