You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: add runtime version banner, log after unpack for accuracy (#71)
* feat(worker): add version logging after unpack for accurate reporting
Log worker, runpod-flash, and runpod versions at boot. Banner is
emitted after maybe_unpack() so the bundled runpod_flash (from flash
build) is on sys.path, reporting the actual deployed version rather
than the base image's pip-installed version.
- Add version.py with __version__ constant and package version helpers
- Read flash version from bundled runpod_flash.__version__ first
- Fall back to importlib.metadata for non-bundled environments
- Move version banner to after unpack in both QB and LB handlers
* refactor(version): read worker version from pyproject.toml instead of hardcoded constant
Worker version is now read from the co-located pyproject.toml via regex,
eliminating manual version maintenance. Falls back to importlib.metadata
if pyproject.toml is unavailable.
* fix(version): use __version__ constant instead of pyproject.toml reading
The pyproject.toml-reading approach fails because maybe_unpack() extracts
the user's flash project to /app/, overwriting the worker's pyproject.toml
with the user's (version 0.1.0). Use a release-please-managed __version__
constant instead.
* perf(ci): remove 56-min GPU LB build from PR checks
- Remove docker-test-lb job (pytorch base, build-only, no tests)
- Update docker-validation gate to 4 check jobs
- Replace jlumbroso/free-disk-space with targeted rm -rf in prod jobs
docker-test-lb-cpu already validates LB code (~2 min) and docker-test
validates the pytorch base image. GPU LB image still builds on release
via docker-prod-lb.
* perf(ci): optimize PR docker jobs and add LB handler test
Remove unnecessary steps from PR docker jobs that only apply to prod
cross-platform builds: lint dependency (validation gate already checks),
QEMU (amd64-only), uv setup (Dockerfiles handle deps), disk cleanup
(CPU images are ~150MB vs 14GB free), and full git history fetch.
Add LB handler test to docker-test-lb-cpu for regression coverage.
* fix(ci): use system python in LB handler test inside Docker
test-lb-handler.sh unconditionally used `uv run` which creates a fresh
venv and re-downloads 117 packages inside Docker, consuming the entire
30s timeout before the server can start. Apply the same Docker detection
pattern used in test-handler.sh: use system python directly when running
inside a container where packages are pre-installed.
* fix(ci): use /ping endpoint in LB handler test
The test script polled /health for readiness but lb_handler.py only
exposes /ping. Server started fine but the check always timed out.
* chore(deps): update dependencies
boto3 1.42.57->1.42.58, ruff 0.15.3->0.15.4, runpod-flash 1.4.1->1.4.2
0 commit comments