Add judge docs, CI, ablations, offline replay, and before/after eval #1
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| name: CI | |
| on: | |
| push: | |
| branches: [main, master] | |
| pull_request: | |
| branches: [main, master] | |
| jobs: | |
| test: | |
| runs-on: ubuntu-latest | |
| steps: | |
| - uses: actions/checkout@v4 | |
| - uses: actions/setup-python@v5 | |
| with: | |
| python-version: "3.11" | |
| - name: Install dependencies | |
| run: | | |
| python -m pip install --upgrade pip | |
| pip install -r requirements.txt | |
| - name: pytest | |
| run: pytest tests/ -v --tb=short | |
| - name: openenv validate | |
| run: openenv validate . | |
| - name: Ablation script (smoke) | |
| run: python scripts/ablation.py --quick | |
| - name: Before/after eval | |
| run: python training/eval_before_after.py --save-dir results |