5/6 Parallelize OTLP corpus generation; download compressed .pb when available by gareth-ellis · Pull Request #2138 · elastic/rally

gareth-ellis · 2026-05-29T14:11:05Z

Summary

Two perf improvements for OTLP corpus preparation. Neither changes hot-path benchmark behaviour.

1. Parallel .pb generation. OtlpProtobufFile.create() now uses a ProcessPoolExecutor pipeline — JSON lines stream in, worker processes parse and serialize in parallel, the main process writes batched results in source order. Worker count tunable via RALLY_OTLP_CONVERSION_WORKERS. Converting an 82 GB JSON corpus drops from hours to minutes.

2. Compressed .pb download. When the JSON corpus is published as a compressed archive (e.g. .zst), prepare-track now tries the matching compressed .pb from the corpus base URL (e.g. .pb.zst) and decompresses locally. Typically 2–4× less network bytes than the raw .pb.

Depends on #2135, #2136, #2137 — merge after all three. Part 5 of 6.

Series

1/6 Don't crash on non-UTF-8 ApiError bodies #2134 — Don't crash on non-UTF-8 ApiError bodies
2/6 Add OTLP binary protobuf core IO and track preparation #2135 — Add OTLP binary protobuf core IO and track preparation
3/6 Add OtlpParamSource for OTLP corpora #2136 — Add OtlpParamSource for OTLP corpora
4/6 Add OtlpIngest runner with backpressure-aware retries #2137 — Add OtlpIngest runner
5/6 Parallelize OTLP corpus generation; download compressed .pb when available #2138 (this PR) — Parallelize OTLP corpus generation
6/6 Support gzipped OTLP ingest requests #2139 — Support gzipped OTLP ingest requests (depends on this PR)

Test plan

New tests: parallel/sequential output is byte-identical, RALLY_OTLP_CONVERSION_WORKERS is honored, compressed-then-fallback download path
All 314 OTLP tests green
pre-commit clean

🤖 Generated with Claude Code

The ApiError handler in execute_single() decodes `e.body`, `e.error`, and `e.info` as UTF-8 to build a human-readable error message. When the body is binary (e.g., binary protobuf returned by ES OTLP endpoints on 4xx/5xx), the strict decode raises UnicodeDecodeError, which crashes the worker mid-task. Switch the six decode() calls to use errors="replace" so undecodable bytes become U+FFFD instead of aborting the worker. No semantic change for valid UTF-8 (the common case). This is a latent bug independent of OTLP — any operation that surfaces a binary error body would have hit it.

Introduces OtlpProtobufFile in esrally/utils/io.py for reading/writing length-prefixed OTLP ExportMetricsServiceRequest protobufs, plus an offset sidecar to allow worker partitions to seek without scanning. Wires preparation into esrally/track/loader.py and esrally/track/track.py: - New OTLP document set fields (otlp_pb_size_in_bytes, etc.) - prepare_otlp_document_set tries to download a .pb from the corpus base URL, otherwise converts a local JSON corpus to .pb on disk. - set_absolute_data_path picks up the .pb when present. Adds OTLP protobuf bindings to pyproject.toml (opentelemetry-proto). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

OtlpParamSource streams length-prefixed protobuf records out of an OtlpProtobufFile, partitions them across workers using the offset sidecar, and surfaces percent_completed so the progress bar tracks real progress. Supports a "looped" mode that cycles the partition indefinitely for time-bound benchmarks. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

OtlpIngest POSTs serialized protobuf bytes to the OTLP metrics endpoint, disabling transport-level fast retries in favour of an explicit exponential-with-full-jitter backoff loop. 429/502/503/504 and connection errors are retried; non-retryable ApiErrors return a failure dict so the driver records the error without crashing the worker. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…lable Adds two perf improvements for OTLP corpora that don't change the on-the-wire benchmark behaviour: 1. Parallel .pb generation. OtlpProtobufFile.create() now uses a ProcessPoolExecutor pipeline (worker count tunable via RALLY_OTLP_CONVERSION_WORKERS) so converting a multi-GB JSON corpus completes in minutes instead of hours. 2. Compressed .pb download. When the JSON corpus is published in a compressed archive, prepare-track first tries the matching compressed .pb (e.g. .pb.zst) from the corpus URL and decompresses locally — typically 2-4x less network bytes than the raw .pb. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

ebadyano

lgtm

gareth-ellis and others added 5 commits May 29, 2026 14:41

gareth-ellis requested a review from a team as a code owner May 29, 2026 14:11

gareth-ellis mentioned this pull request May 29, 2026

6/6 Support gzipped OTLP ingest requests #2139

Open

3 tasks

gareth-ellis changed the base branch from otlp-pr4-runner to master May 29, 2026 14:32

This was referenced May 29, 2026

1/6 Don't crash on non-UTF-8 ApiError bodies #2134

Merged

2/6 Add OTLP binary protobuf core IO and track preparation #2135

Open

3/6 Add OtlpParamSource for OTLP corpora #2136

Open

4/6 Add OtlpIngest runner with backpressure-aware retries #2137

Open

gareth-ellis changed the title ~~Parallelize OTLP corpus generation; download compressed .pb when available~~ 5/6 Parallelize OTLP corpus generation; download compressed .pb when available May 29, 2026

gareth-ellis mentioned this pull request May 29, 2026

Add OTLP support to rally #2127

Draft

ebadyano approved these changes Jun 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

5/6 Parallelize OTLP corpus generation; download compressed .pb when available#2138

5/6 Parallelize OTLP corpus generation; download compressed .pb when available#2138
gareth-ellis wants to merge 5 commits into
masterfrom
otlp-pr5-perf

gareth-ellis commented May 29, 2026 •

edited

Loading

Uh oh!

ebadyano left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gareth-ellis commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Series

Test plan

Uh oh!

ebadyano left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gareth-ellis commented May 29, 2026 •

edited

Loading