Experiment hygiene and threats-to-validity checks #39

Closed
opened 2026-01-31 17:18:25 +01:00 by erikinkinen · 0 comments
Owner

Goal
Prevent trivial or misleading conclusions.

Steps

  • Enforce comparable workloads across strategies
  • Add multi-seed confidence intervals
  • Add sanity checks before long runs
**Goal** Prevent trivial or misleading conclusions. **Steps** * [ ] Enforce comparable workloads across strategies * [ ] Add multi-seed confidence intervals * [ ] Add sanity checks before long runs
erikinkinen added this to the Phase 1 milestone 2026-01-31 17:18:25 +01:00
Sign in to join this conversation.
No milestone
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
erikinkinen/AES#39
No description provided.