Explore Results gives access to individual sample-level results.
Interpreting results
You can use your evaluation results to:- Compare baseline models to fine-tuned models
- Identify regressions or improvements due after model changes
- Decide whether to retrain, adjust data, or refine evaluators