The clustering stability (`cs`) accessor#

The clustering stability accessor provides metrics for assessing the stability of clustering results across different resolutions and random subsets of genes.

segtraq.cs.clustering_stability.compute_ari(sdata: SpatialData, resolution: float = 1.0, frac_cells_subset: float = 0.63, key_prefix: str = 'leiden_subset', inplace: bool = True) → float#

Compute the clustering stability using pairwise adjusted Rand index (ARI) on random subsets of genes.

Parameters:

sdata (sd.SpatialData) – The SpatialData object containing clustering information.
resolution (float, optional) – The resolution parameter for Leiden clustering, by default 1.0.
n_genes_subset (int, optional) – The number of genes to subset for clustering, by default 100.
key_prefix (str, optional) – The prefix for the keys under which the clustering results are stored, by default “leiden_subset”.
inplace (bool, optional) – Whether to store the computed ARI in sdata.uns, by default True.

Returns:

The average pairwise ARI across the specified cluster keys.

Return type:

float

segtraq.cs.clustering_stability.compute_mean_cosine_distance(sdata: SpatialData, resolution: float | list[float] = (0.6, 0.8, 1.0), key_prefix: str = 'leiden_subset', random_state: int = 42, cell_type_key: str | None = None, inplace: bool = True) → float#

Compute mean cosine distance for different Leiden clustering resolutions and report the best (lowest) mean cosine distance. If a cell_type_key is provided, compute the mean cosine distance for that clustering only.

Parameters:

sdata (sd.SpatialData) – The SpatialData object containing clustering information.
resolution (float or list of float, optional) – The resolution parameter(s) for Leiden clustering, by default (0.6, 0.8, 1.0).
key_prefix (str, optional) – Prefix for clustering keys in .obs, by default “leiden_subset”.
random_state (int, optional) – Seed for reproducibility, by default 42.
cell_type_key (str, optional) – If provided, compute the mean cosine distance for this clustering only.
inplace (bool, optional) – Whether to store the computed mean cosine distance in sdata.uns, by default True.

Returns:

The best (lowest) mean cosine distance across resolutions.

Return type:

float

segtraq.cs.clustering_stability.compute_purity(sdata: SpatialData, resolution: float = 1.0, frac_cells_subset: float = 0.63, key_prefix: str = 'leiden_subset', inplace: bool = True) → float#

Compute the clustering stability using pairwise purity on random subsets of genes. :param sdata: The SpatialData object containing clustering information. :type sdata: sd.SpatialData :param resolution: The resolution parameter for Leiden clustering, by default 1.0. :type resolution: float, optional :param frac_cells_subset: The fraction of cells to subset for clustering, by default 0.63. :type frac_cells_subset: float, optional :param key_prefix: The prefix for the keys under which the clustering results are stored, by default “leiden_subset”. :type key_prefix: str, optional :param inplace: Whether to store the computed purity in sdata.uns, by default True. :type inplace: bool, optional

Returns:: The average pairwise purity across the specified cluster keys.
Return type:: float

segtraq.cs.clustering_stability.compute_rmsd(sdata: SpatialData, resolution: float | list[float] = (0.6, 0.8, 1.0), key_prefix: str = 'leiden_subset', random_state: int = 42, cell_type_key: str | None = None, inplace: bool = True) → float#

Compute RMSD for different Leiden clustering resolutions and report the best (lowest) RMSD. If a cell_type_key is provided, compute the RMSD for that clustering only.

Parameters:

sdata (sd.SpatialData) – The SpatialData object containing clustering information.
resolution (float or list of float, optional) – The resolution parameter(s) for Leiden clustering, by default (0.6, 0.8, 1.0).
key_prefix (str, optional) – Prefix for clustering keys in .obs, by default “leiden_subset”.
random_state (int, optional) – Seed for reproducibility, by default 42.
cell_type_key (str, optional) – If provided, compute the RMSD for this clustering only.
inplace (bool, optional) – Whether to store the computed RMSD in sdata.uns, by default True.

Returns:

The best (lowest) RMSD across resolutions.

Return type:

float

segtraq.cs.clustering_stability.compute_silhouette_score(sdata: SpatialData, resolution: float | list[float] = (0.6, 0.8, 1.0), metric: str = 'euclidean', key_prefix: str = 'leiden_subset', random_state: int = 42, cell_type_key: str | None = None, inplace: bool = True) → float#

Compute the silhouette score for different resolutions and report the best one. If a cell_type_key is provided, compute the silhouette score for provided labels.

Parameters:

sdata (sd.SpatialData) – The SpatialData object containing clustering information.
resolution (float, optional) – The resolution parameter for Leiden clustering, by default 1.0.
metric (str, optional) – The metric to use for silhouette score calculation, by default “euclidean”.
key_prefix (str, optional) – The prefix for the keys under which the clustering results are stored, by default “leiden_subset”.
random_state (int, optional) – Seed for reproducibility, by default 42.
cell_type_key (str, optional) – If provided, compute the silhouette score for provided labels.
inplace (bool, optional) – Whether to store the computed silhouette score in sdata.uns, by default True.

Returns:

The silhouette score of the clustering.

Return type:

float

The clustering stability (cs) accessor

Contents

The clustering stability (`cs`) accessor#

The clustering stability (cs) accessor

Contents

The clustering stability (cs) accessor#

The clustering stability (`cs`) accessor#