Metrics
This package contains all the metrics used for benchmarking scRNA-seq data integration performance. The metrics can be classified into biological conservation and batch removal metrics. For a detailed description of the metrics implemented in this package, please see our publication.
Biological Conservation Metrics
Biological conservation metrics quantify either the integrity of cluster-based metrics based on clustering results of the integration output, or the difference in the feature spaces of integrated and unintegrated data. Each metric is scaled to a value ranging from 0 to 1 by default, where larger scores represent better conservation of the biological aspect that the metric addresses.
|
Highly variable gene overlap |
|
Average silhouette width (ASW) |
|
Isolated label score |
|
Normalized mutual information |
|
Adjusted Rand Index |
|
Cell cycle conservation score |
|
Trajectory conservation score |
|
Cell-type LISI (cLISI) score |
Batch Correction Metrics
Batch correction metrics values are scaled by default between 0 and 1, in which larger scores represent better batch removal.
|
Graph Connectivity |
|
Batch ASW |
|
Principal component regression score |
|
kBET score |
|
Integration LISI (iLISI) score |
Metrics Wrapper Functions
For convenience, scib
provides wrapper functions that, given integrated and unintegrated adata objects, apply
multiple metrics and return all the results in a pandas.Dataframe
.
The main function is metrics()
, that provides all the parameters for the different metrics.
scib.metrics.metrics(adata, adata_int, ari=True, nmi=True)
The remaining functions call the metrics()
for
Furthermore, metrics()
is wrapped by convenience functions with preconfigured subsets of metrics
based on expected computation time:
metrics_fast()
only computes metrics that require little preprocessingmetrics_slim()
includes all functions ofmetrics_fast()
and adds clustering-based metricsmetrics_all()
includes all metrics
|
Master metrics function |
|
Only metrics with minimal preprocessing and runtime |
|
All metrics apart from kBET and LISI scores |
|
All metrics |
Auxiliary Functions
|
cLISI and iLISI scores |
|
Principal component regression for anndata object |
|
Principal component regression |