nf-core/metacache/build @ 0.0.0-6c4ed3a
Summary
Taxonomic profiling database building with MetaCache
Get started
Add the following snippet to your workflow script to include this module.
include { METACACHE_BUILD } from 'nf-core/metacache/build'
License
MIT License
Process
Name
|
METACACHE_BUILD |
|---|
Input
3 channels
#1
tuple
meta
map
|
Groovy Map containing sample information e.g. |
|---|---|
genome_files
file
|
(possibly gzipped) fasta or fastq files of full genomes, for example from an NCBI assembly *.{fna,fa,fasta,fnq,fq,fastq}{,.gz}
|
taxonomy
file
|
NCBI taxonomy formatted files nodes.dmp and names.dmp {names,nodes,merged}.dmp
|
|---|
seq2taxid
file
|
NCBI-style 'accession2taxid' tab-separated file with 3 or 4 columns: accession, accession_version, taxid, and gid (optional) *
|
|---|
Output
2 channels
#1
db
tuple
meta
map
|
Groovy Map containing sample information e.g. |
|---|---|
*.meta
file
|
sequence signature database binary file *.meta
|
*.cache*
file
|
sequence signature database binary files *.cache+([0-9])
|
#2
versions_metacache
tuple
${task.process}
string
|
The name of the process |
|---|---|
metacache
string
|
The name of the tool |
metacache info |& sed -n 's/^MetaCache version \+\([0-9.]\+\).*\$/\1/p'
eval
|
The expression to obtain the version of the tool |
| Tool | Description | Homepage |
|---|---|---|
| metacache | MetaCache is a classification system for mapping genomic sequences (short reads, long reads, contigs, ...) from metagenomic samples to their most likely taxon of origin. It aims to reduce the memory requirement usually associated with k-mer based methods while retaining their speed. MetaCache uses locality sensitive hashing to quickly identify candidate regions within one or multiple reference genomes. A read is then classified based on the similarity to those regions. For an independent comparison to other tools in terms of classification accuracy see the LEMMI benchmarking site. The latest version of MetaCache classifies around 60 Million reads (of length 100) per minute against all complete bacterial, viral and archaea genomes from NCBI RefSeq Release 97 running with 88 threads on a workstation with 2 Intel(R) Xeon(R) Gold 6238 CPUs. | https://muellan.github.io/metacache |
| Version | 0.0.0-6c4ed3a |
|---|---|
| Commit ID | 6c4ed3a220310b905a1fc9d04f05be2e0837142b |
| Release Date | 23 Apr 2026 15:30:10 (UTC) |
| Download URL | https://registry.nextflow.io/api/v1/modules/nf-core%2Fmetacache%2Fbuild/0.0.0-6c4ed3a/download |
| OCI Store URL | https://public.cr.seqera.io/v2/nextflow/plugin/modules/nf-core/metacache/build/blobs/sha256:b61f0a70bd733eb9f7e994f4d4619036658076ce04cabd75156ece8bfc298957 |
| Size | 4.0 KB |
| Checksum | sha256:b61f0a70bd733eb9f7e994f4d4619036658076ce04cabd75156ece8bfc298957 |
| Downloads | 3 |
| Version | Date | Status | Downloads | Size |
|---|---|---|---|---|
| 0.0.0-6c4ed3a | 23 Apr 2026 15:30:10 (UTC) | 3 | 4.0 KB | |
| 0.0.0-3fc6d40 | 22 Apr 2026 15:21:31 (UTC) | 3 | 4.0 KB | |
| 0.0.0-7f68141 | 16 Apr 2026 15:01:10 (UTC) | 3 | 4.0 KB | |
| 0.0.0-3c11db3 | 12 Apr 2026 15:00:07 (UTC) | 1 | 4.0 KB | |
| 0.0.0-0c7146d | 08 Apr 2026 19:17:17 (UTC) | 2 | 4.0 KB |