Add configurable perf_record_delay for profiling during benchmarks#523
Open
charles-typ wants to merge 2 commits intofacebookresearch:v2-betafrom
Open
Add configurable perf_record_delay for profiling during benchmarks#523charles-typ wants to merge 2 commits intofacebookresearch:v2-betafrom
charles-typ wants to merge 2 commits intofacebookresearch:v2-betafrom
Conversation
Summary: The cachelib_num_shards parameter was parsed from gflags and stored in UcacheBenchConfig but never actually applied to the CacheAllocator::Config. This meant the config value was silently ignored and CacheLib used its default of 8192 shards. Now call setNumShards() when cachelib_num_shards > 0, allowing the benchmark to match production shard counts for more accurate CPU utilization profiling. Differential Revision: D96087814
Summary: Add support for a configurable delay before starting perf record during server benchmarks. This allows profiling to capture steady-state behavior after client warmup completes. Changes: - Add perf_record_delay parameter to benchmark configs (ALLOWED_PARAMS) - Auto-compute server's perf_record_delay from client's warmup_seconds when DCPERF_PERF_RECORD is enabled (warmup + 60s buffer) - Add --perf-record-delay CLI argument to server with 120s default - Implement profile_server() function with threading.Timer for delayed perf record start (5 second system-wide profile) - Update jobs_internal.yml with perf_record_delay in ucache_bench jobs Differential Revision: D96231930
|
@charles-typ has exported this pull request. If you are a Meta employee, you can view the originating Diff in D96231930. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary:
Add support for a configurable delay before starting perf record during
server benchmarks. This allows profiling to capture steady-state behavior
after client warmup completes.
Changes:
when DCPERF_PERF_RECORD is enabled (warmup + 60s buffer)
perf record start (5 second system-wide profile)
Differential Revision: D96231930