Skip to content

Validation

Study Cohorts

MyoPath was validated on 478 H&E whole slide images from two independent cohorts:

CohortnSourceComposition
GTEx399Genotype-Tissue Expression ProjectAutopsy skeletal muscle; healthy controls and subclinical wasting spectrum
HuashanMuscle79Huashan Hospital, Fudan UniversityClinical muscle biopsies; DM1, LGMD, inflammatory myopathy, and healthy controls

The GTEx cohort served as the training set; HuashanMuscle served as the independent external validation set. No retraining was performed on HuashanMuscle data.

Segmentation Performance

Segmentation accuracy was assessed by Dice coefficient and Intersection-over-Union (IoU) against manual expert annotations:

Tissue LayerMethodDiceIoU
MyofiberCellpose-SAM0.92 ± 0.030.85 ± 0.06
FatPixel classifier0.95 ± 0.020.91 ± 0.03
NucleusWatershed0.87 ± 0.040.78 ± 0.06
Connective tissueBoolean subtraction0.88 ± 0.040.78 ± 0.06

Reproducibility

Intraclass correlation coefficients (ICC) for all seven pathology indicators exceeded 0.88, indicating high inter-rater and intra-rater reproducibility of the automated pipeline.

MyoPath Score Performance

Discrimination

DatasetAUCMetric
GTEx (LOO-CV)0.735Leave-one-out cross-validation
GTEx (full model)0.788Training set AUC
HuashanMuscle0.873External validation (no retraining)

The higher AUC on the external validation set suggests that clinically diagnosed myopathy (HuashanMuscle) produces more separable morphometric changes than subclinical wasting (GTEx).

Primary Biomarkers

Two indicators emerged as the strongest individual discriminators:

Biomarkerp-valueEffect size (rank-biserial r)
NCI1.3×1050.69
Fiber CV2.9×1040.58

Both carry the largest standardized regression coefficients in the MyoPath Score model (βNCI=0.57, βCV=0.53).

Clinical Correlations

Disease-Specific Findings

Myotonic dystrophy type 1 (DM1):

  • NCI was highest in DM1 (median 0.121), consistent with centronuclear pathology
  • NCI correlated with CTG repeat count: Spearman ρ=0.46, p=0.042
  • Fiber CV inversely correlated with grip strength: ρ=0.61, p=0.031

Limb-girdle muscular dystrophy (LGMD):

  • Fiber CV showed a dose-response with mutation severity:
    • 2× Missense: CV = 0.44
    • LoF + Missense: CV = 0.49
    • 2× LoF: CV = 0.65

GTEx wasting spectrum:

  • NCI showed a significant dose-response trend across wasting severity grades (Jonckheere-Terpstra p<104)

Cross-Modal Validation with MyoScore

MyoPath pathology metrics correlate with MyoScore's transcriptomic dimensions, demonstrating cross-modal consistency:

MyoScore DimensionMyoPath IndicatorDirection
LeanMuscleFat infiltrationNegative correlation
YouthFibrosis extentNegative correlation
GMHS compositeFiber CVNegative correlation

This convergence between transcriptomic and morphometric assessments supports the biological validity of both tools.

Limitations

  • ROI dependency: Tissue composition metrics (fat, fibrosis) vary with ROI placement. NCI and Fiber CV are more robust.
  • H&E only: The pipeline is optimized for H&E stains. Special stains (ATPase, NADH) and immunohistochemistry are not supported.
  • Single ROI: Current validation used one ROI per slide. Multi-ROI strategies may improve representativeness for heterogeneous biopsies.
  • Connective tissue estimation: Boolean subtraction inherits errors from muscle and fat segmentation layers.
  • Population scope: Training was performed on GTEx (predominantly European ancestry). Performance on other populations requires further validation.
最近更新