×

nf-core/metacache/build @ 0.0.0-0c7146d

Taxonomic profiling database building with MetaCache

Latest version: 0.0.0-6c4ed3a
Total downloads: 12
Source: nf-core/modules
Authors: @Gullumluvl
Maintainers: @Gullumluvl

Summary

Taxonomic profiling database building with MetaCache

Get started

Add the following snippet to your workflow script to include this module.

include { METACACHE_BUILD } from 'nf-core/metacache/build'

License

MIT License

Process
Name METACACHE_BUILD
Input 3 channels
#1 tuple
meta map

Groovy Map containing sample information e.g. [ id:'sample1' ]

genome_files file

(possibly gzipped) fasta or fastq files of full genomes, for example from an NCBI assembly

*.{fna,fa,fasta,fnq,fq,fastq}{,.gz}
taxonomy file

NCBI taxonomy formatted files nodes.dmp and names.dmp

{names,nodes,merged}.dmp
seq2taxid file

NCBI-style 'accession2taxid' tab-separated file with 3 or 4 columns: accession, accession_version, taxid, and gid (optional)

*
Output 2 channels
#1 db tuple
meta map

Groovy Map containing sample information e.g. [ id:'sample1' ]

*.meta file

sequence signature database binary file

*.meta
*.cache* file

sequence signature database binary files

*.cache+([0-9])
#2 versions_metacache tuple
${task.process} string

The name of the process

metacache string

The name of the tool

metacache info |& sed -n 's/^MetaCache version \+\([0-9.]\+\).*\$/\1/p' eval

The expression to obtain the version of the tool

Tool Description Homepage
metacache MetaCache is a classification system for mapping genomic sequences (short reads, long reads, contigs, ...) from metagenomic samples to their most likely taxon of origin. It aims to reduce the memory requirement usually associated with k-mer based methods while retaining their speed. MetaCache uses locality sensitive hashing to quickly identify candidate regions within one or multiple reference genomes. A read is then classified based on the similarity to those regions. For an independent comparison to other tools in terms of classification accuracy see the LEMMI benchmarking site. The latest version of MetaCache classifies around 60 Million reads (of length 100) per minute against all complete bacterial, viral and archaea genomes from NCBI RefSeq Release 97 running with 88 threads on a workstation with 2 Intel(R) Xeon(R) Gold 6238 CPUs. https://muellan.github.io/metacache
Version 0.0.0-0c7146d
Commit ID 6c4ed3a220310b905a1fc9d04f05be2e0837142b
Release Date 08 Apr 2026 19:17:17 (UTC)
Download URL https://registry.nextflow.io/api/v1/modules/nf-core%2Fmetacache%2Fbuild/0.0.0-0c7146d/download
OCI Store URL https://public.cr.seqera.io/v2/nextflow/plugin/modules/nf-core/metacache/build/blobs/sha256:872401f4efef5e7221cda34577b6c15351ec398ccc04d869bffd7c236dbce7d4
Size 4.0 KB
Checksum sha256:872401f4efef5e7221cda34577b6c15351ec398ccc04d869bffd7c236dbce7d4
Downloads 2
Version Date Status Downloads Size
0.0.0-6c4ed3a 23 Apr 2026 15:30:10 (UTC) 3 4.0 KB
0.0.0-3fc6d40 22 Apr 2026 15:21:31 (UTC) 3 4.0 KB
0.0.0-7f68141 16 Apr 2026 15:01:10 (UTC) 3 4.0 KB
0.0.0-3c11db3 12 Apr 2026 15:00:07 (UTC) 1 4.0 KB
0.0.0-0c7146d 08 Apr 2026 19:17:17 (UTC) 2 4.0 KB