cellgeni/seacells @ 1.0.0
cellgeni/seacells
Summary
Runs the SEACells algorithm to aggregate single-cell profiles into a smaller set of representative metacells (cellular states) from single-cell genomics data (e.g. scRNA-seq or scATAC-seq).
Get started
Include this module in your Nextflow pipeline:
include { SEACELLS } from 'cellgeni/seacells'
Inputs
tuple val(meta), path(adata)meta: sample metadata map (expects anidkey; used for tagging and default output prefixing)adata: input AnnData object in.h5adformat
Outputs
h5ad:tuple val(meta), path("*.h5ad")(metacell AnnData object; typically includesseacell_metacells.h5ad)csv:tuple val(meta), path("*.csv")(assignment tables; hard/soft assignments)npy:tuple val(meta), path("*.npy")(assignment weights)pdf:tuple val(meta), path("*.pdf")(diagnostic/QC plots)pkl:tuple val(meta), path("*.pkl")(pickled SEACells model)versions:path("versions.yml")(tool/package versions captured at runtime)
Parameters
This module supports passing arguments to seacells_aggregate.py via task.ext.args.
When running the module directly with nextflow module run, you set these at the command line using Nextflow “process options”:
-process.ext.args='<SEACELLS_ARGS>'(i.e.-process.ext.args=<SEACELLS_ARGS>)-process.ext.prefix='<SAMPLE_PREFIX>'(i.e.-process.ext.prefix=<SAMPLE_PREFIX>, optional; defaults to${meta.id})
Defaults are:
--type gex --n_top_genes 2000 --n_components 50 --convergence_epsilon 0.00001 --min_iter 10 --max_iter 50
The output/sample prefix can be controlled via task.ext.prefix (defaults to ${meta.id}).
SEACells arguments
These are the supported arguments you can include in ext.args:
--type(required): input data modality. Choices:gexoratac.--n_metacells(optional): explicit number of metacells to infer.--gamma(optional): sets metacell count adaptively as $n_{metacells} = \mathrm{round}(n_{cells} / \gamma)$. Mutually exclusive with--n_metacells.--n_top_genes(optional, default2000): number of highly-variable genes to select (GEX preprocessing).--n_components(optional, default50): number of components used for PCA (GEX) or LSI (ATAC).--convergence_epsilon(optional, default1e-5): convergence threshold for optimization.--min_iter(optional, default10): minimum iterations for the SEACells fitting step.--max_iter(optional, default50): maximum iterations for the SEACells fitting step.--n_waypoint_eigs(optional, default10): number of waypoint eigenvectors used during initialization.--use_sparse(flag): use sparse matrix operations (if supported by the input representation).--precomputed(optional): key inadata.obsmcontaining a precomputed embedding to use instead of running PCA/LSI.--celltype_label(optional): obs column name used to compute cell type purity (QC plot).--delimiter(optional): append sample suffix to barcodes using this delimiter.
Notes:
--adata,--sample, and--output_dirare handled by the module wrapper and do not need to be provided inext.args.- You must provide exactly one of
--n_metacellsor--gamma.
Full nextflow module run example
/software/cellgen/cellgeni/nextflow/26.04.0/nextflow module run cellgeni/seacells \
--meta.id pbmc_10k \
--adata /path/to/pbmc10k.h5ad \
-process.ext.prefix=pbmc_10k \
-process.ext.args='--type gex --gamma 75 --n_top_genes 2000 --n_components 50 --convergence_epsilon 0.00001 --min_iter 10 --max_iter 50'
Dependencies
This module runs inside the container quay.io/cellgeni/seacells:latest.
Citation
Persad S, Choo Z-N, Dien C, et al. SEACells: Inference of transcriptional and epigenomic cellular states from single-cell genomics data. bioRxiv (2022). https://doi.org/10.1101/2022.04.02.486748
License
MIT
meta
map
|
Groovy Map containing sample information e.g. [ id:'test' ] |
|---|---|
adata
file
|
AnnData object in h5ad format *.{h5ad}
|
meta
tuple
meta
map
|
Groovy Map containing sample information e.g. [ id:'test' ] |
|---|---|
*.pkl
file
|
Serialized SEACells model *.pkl
|
versions
versions
file
|
YAML file containing software versions used versions.yml
|
|---|
| Tool | Description | Homepage |
|---|---|---|
| seacells | SEACells learns archetypal cellular states and aggregates cells into metacells. | https://github.com/dpeerlab/SEACells |
| Version | 1.0.0 |
|---|---|
| Release Date | 29 Apr 2026 20:11:28 (UTC) |
| Download URL | https://registry.nextflow.io/api/v1/modules/cellgeni%2Fseacells/1.0.0/download |
| OCI Store URL | https://public.cr.seqera.io/v2/nextflow/plugin/modules/cellgeni/seacells/blobs/sha256:58f6116abd3d0260bd914394cb892fce91384ec51812124673306adf35b18169 |
| Size | 6.1 KB |
| Checksum | sha256:58f6116abd3d0260bd914394cb892fce91384ec51812124673306adf35b18169 |
| Downloads | 7 |
| Version | Date | Status | Downloads | Size |
|---|---|---|---|---|
| 1.0.0 | 29 Apr 2026 20:11:28 (UTC) | 7 | 6.1 KB |