Performance Testing

Compare and evaluate distilled model performance against baselines.

Model Performance Comparison
Compare key metrics between original and distilled models
Capability Assessment
Radar chart of model capabilities compared to original