Astrocyte subcluster proportions

Published

March 18, 2026

Code

library(qs2)
library(Seurat)
library(tidyverse)
library(speckle)
library(kableExtra)

astro <- UpdateSeuratObject(qs_read("seurat_objects/20260318-astro_cleaned2_0.3.qs2"))
Idents(astro) <- "seurat_clusters"
# Order samples by Treatment.Group so groups are adjacent
astro$sample_ID <- factor(
  astro$sample_ID,
  levels = c("KK4_465", "KK4_504", "KK4_496", "KK4_492", "KK4_502", "KK4_464"))

Sample ID	Treatment Group
KK4_465	IgG
KK4_504	IgG
KK4_496	IgG
KK4_492	Adu
KK4_502	Adu
KK4_464	Adu

Code

p <- plotCellTypeProps(astro,
                  clusters = Idents(astro),
                  sample = astro$sample_ID)

# Enforce sample_ID factor order on x-axis
sample_order <- c("KK4_465", "KK4_504", "KK4_496", "KK4_492", "KK4_502", "KK4_464")
p + scale_x_discrete(limits = sample_order) +
  theme_minimal() +
  theme(axis.title = element_text(size = 14),
        axis.text.x = element_text(size = 13, angle = 30, hjust = 1),
        axis.text.y = element_text(size = 13),
        legend.text = element_text(size = 14))

The function propeller from the speckl package tests whether cell-type proportions differ between experimental conditions, while properly accounting for biological replication and sample-to-sample variability.

aggregates single cells into sample-level cell-type proportions,
applies a variance-stabilizing transformation (e.g. logit or arcsin),
fits a linear model for each cell type,
uses empirical Bayes moderation to stabilize variance estimates across cell types, and
controls for multiple testing.

As a result, propeller identifies true compositional changes in cell populations and avoids false positives driven by uneven cell capture or outlier samples.

Transform In propeller, transform refers to converting raw cell-type proportions into a scale where statistical testing is valid. Proportions are bounded between 0 and 1 and have unequal variance, so transforming them stabilizes variance and allows linear models to be used appropriately.

Logit The logit transform maps proportions from [0,1] to \((-\infty, +\infty)\) using

\(\log\left(\frac{p}{1-p}\right)\)

This reduces heteroskedasticity, improves power to detect differences in cell-type abundance, and makes effects interpretable as differences in log-odds, while small offsets are used to handle zeros. ***

Robust In propeller, robust refers to using robust empirical Bayes variance estimation in the linear model. This down-weights the influence of outlier samples with extreme cell-type proportions, preventing them from artificially inflating significance.

In practice, robust = TRUE makes the test more resistant to technical or biological outliers while preserving group means, leading to more reliable inference on cell-type composition differences.

Test for differences in astrocyte subcluster proportions between treatment groups in total brain

Code

# The propeller function can take a SingleCellExperiment object or Seurat object as input
# and extract the three necessary  pieces of information from the cell information stored in colData.
#  The three essential pieces of information are

# cluster (Idents function by default)
# sample
# group

prop <- propeller(
  x = astro,
  clusters = Idents(astro),
  sample = astro$sample_ID,
  group = astro$Treatment.Group,
  trend = FALSE,
  robust = TRUE,
  transform = "logit")

prop |> as_tibble () |>
  kbl(digits = 3) |>
  kable_styling("striped")

BaselineProp.clusters	BaselineProp.Freq	PropMean.Adu	PropMean.IgG	PropRatio	Tstatistic	P.Value	FDR
6	0.023	0.027	0.019	1.431	1.882	0.089	0.451
5	0.061	0.054	0.068	0.789	-1.601	0.140	0.451
7	0.020	0.017	0.023	0.764	-1.341	0.209	0.451
2	0.145	0.157	0.129	1.214	1.290	0.226	0.451
4	0.080	0.077	0.083	0.930	-0.302	0.769	0.956
3	0.137	0.137	0.139	0.984	-0.176	0.863	0.956
1	0.252	0.249	0.258	0.964	-0.167	0.871	0.956
0	0.281	0.282	0.280	1.005	0.057	0.956	0.956

Interpretation

Column	Brief explanation
`BaselineProp.clusters`	Cell type or cluster being tested for proportional differences between conditions.
`BaselineProp.Freq`	Overall mean proportion of the cluster across all samples, independent of condition.
`PropMean.Adu`	Mean sample-level proportion of the cluster in the Adu group.
`PropMean.IgG`	Mean sample-level proportion of the cluster in the IgG group.
`PropRatio`	Ratio of mean proportions between groups (Adu / IgG); values >1 indicate enrichment in Adu, <1 depletion.
`Tstatistic`	Moderated t-statistic testing whether transformed proportions differ between conditions; sign indicates direction.
`P.Value`	Raw p-value for the difference in proportions for that cluster.
`FDR`	False discovery rate–adjusted p-value accounting for testing across all clusters.