Simulation Results


Detection Statistics


            

Impact on Diversity Measurements


Detection by Clone Size

Sequencing Depth Effect

Compare different sequencing depths (1K to 1M reads):

Understanding the Simulation

Biological Context

Adaptive immune repertoires contain thousands to millions of unique clonotypes (T-cell or B-cell receptor sequences). A few clones undergo massive expansion (hyperexpanded), while most remain rare (singlets or small clones).

Clonality Bias

When sequencing depth is insufficient:

  • Hyperexpanded clones are almost always detected
  • Rare clones are missed at high rates
  • This makes repertoires appear MORE clonal than they truly are
  • Diversity is systematically UNDERestimated

Parameters

  • Power-law exponent (α): Controls distribution shape. Higher α = steeper decline, more extreme inequality between large and small clones.
  • Sequencing reads: Total number of sequences obtained. Compare to total cells to assess sampling fraction.
  • Minimum threshold: Sequences appearing fewer than this many times are discarded (to filter sequencing errors). Higher thresholds increase bias against rare clones.

Key Metrics

  • Richness: Number of unique clones detected
  • Shannon entropy: Diversity measure accounting for abundance
  • Clonality: Inverse of normalized diversity (0=maximally diverse, 1=monoclonal)
  • Gini coefficient: Inequality measure (0=perfect equality, 1=maximum inequality)