The native use of scout: run the same scenarios against baseline and target, get an HTML diff report. One-shot, deterministic, attached to a PR or release. No scheduler, no dashboard — just a CLI in your existing pipeline.
Before tagging a release, run scout against the candidate vs the last shipped version. Diff report attached to the release PR shows reviewers exactly what API behavior changed under real UI use.
Upgrading a dependency, framework, or infrastructure component. Run scout against pre-upgrade and post-upgrade environments. Surface any behavior drift the upgrade introduced.
Suspicious bug report from a downstream integration. Run scout against the current production version and a known-good past version. Either confirm or rule out API-shape changes as the cause.
Pixel-anchored Python scenarios in your repo. Hand-written or generated by recording tooling.
scout run scenarios/ --web-version $BASELINE drives the UI, records every API call.
Same scenarios, new version. Deterministic — same Locators, same trace.
scout diff <baseline-id> <target-id> produces the HTML report. Attach to your PR or release notes.
scout is a CLI. Whatever you already use — GitHub Actions, GitLab, Forgejo, CircleCI, Jenkins — drop in two run steps.
- run: pip install boxprobe-scout
- run: playwright install chromium
- run: scout run scenarios/ --web-version $BASELINE_VERSION
- run: scout run scenarios/ --web-version $TARGET_VERSION
- run: scout diff $BASELINE_RUN_ID $TARGET_RUN_ID JUnit XML output alongside the HTML for status-only integrations. HTML diff report uploaded as a build artifact — reviewers click through to inspect findings. No data leaves the runner.
diff_ignore.json15 scenarios across two pinned Medusa releases, full reproduce-locally instructions.