Score trajectory
- Steady climb — consistent improvement across iterations
- Plateau — scores flatten, suggesting diminishing returns from the current approach
- Over-optimization — scores peak then regress (model may be overfitting to a pattern)
- Gap closed — how much of the remaining gap (to 1.0) was closed
Strategy breakdown
When using auto mode, reflex shows what each phase contributed:- Structural helped — “Structure matters: reorganizing how instructions are presented can dramatically improve model comprehension.”
- Iterative helped — “Specificity matters: models follow precise, explicit instructions better than vague ones.”
- Fewshot helped — “Examples matter: showing the model what good output looks like is one of the most reliable ways to improve quality.”
- Phase hurt performance — the analysis explains what went wrong and what to avoid
Prompt diff
- Length changes and what they mean
- New structural features (headers, bullets, XML tags, examples)
- Added constraints or format specifications