Simulation Results
Detection Statistics
Impact on Diversity Measurements
Detection by Clone Size
Sequencing Depth Effect
Compare different sequencing depths (1K to 1M reads):
Understanding the Simulation
Biological Context
Adaptive immune repertoires contain thousands to millions of unique clonotypes (T-cell or B-cell receptor sequences). A few clones undergo massive expansion (hyperexpanded), while most remain rare (singlets or small clones).
Clonality Bias
When sequencing depth is insufficient:
- Hyperexpanded clones are almost always detected
- Rare clones are missed at high rates
- This makes repertoires appear MORE clonal than they truly are
- Diversity is systematically UNDERestimated
Parameters
- Power-law exponent (α): Controls distribution shape. Higher α = steeper decline, more extreme inequality between large and small clones.
- Sequencing reads: Total number of sequences obtained. Compare to total cells to assess sampling fraction.
- Minimum threshold: Sequences appearing fewer than this many times are discarded (to filter sequencing errors). Higher thresholds increase bias against rare clones.
Key Metrics
- Richness: Number of unique clones detected
- Shannon entropy: Diversity measure accounting for abundance
- Clonality: Inverse of normalized diversity (0=maximally diverse, 1=monoclonal)
- Gini coefficient: Inequality measure (0=perfect equality, 1=maximum inequality)