scib.preprocessing.reduce_data
- scib.preprocessing.reduce_data(adata, batch_key=None, flavor='cell_ranger', n_top_genes=2000, n_bins=20, pca=True, pca_comps=50, overwrite_hvg=True, neighbors=True, use_rep='X_pca', umap=False)
Apply feature selection and dimensionality reduction steps.
Wrapper function of feature selection (highly variable genes), PCA, neighbours computation and dimensionality reduction. Highly variable gene selection is batch-aware, when a batch key is given.
- Parameters:
adata –
anndata
object with normalised and log-transformed data inadata.X
batch_key – column in
adata.obs
containing batch assignmentflavor – parameter for
scanpy.pp.highly_variable_genes
n_top_genes – parameter for
scanpy.pp.highly_variable_genes
n_bins – parameter for
scanpy.pp.highly_variable_genes
pca – whether to compute PCA
pca_comps – number of principal components
overwrite_hvg – if True, ignores any pre-existing ‘highly_variable’ column in adata.var and recomputes it if n_top_genes is specified else calls PCA on full features. if False, skips HVG computation even if n_top_genes is specified and uses pre-existing HVG column for PCA
neighbors – whether to compute neighbours graph
use_rep – embedding to use for neighbourhood graph
umap – whether to compute UMAP representation