Valkey support for Top Keys analysis by ranshid · Pull Request #34 · valkey-io/valkey-rfc

ranshid · 2026-02-03T08:05:52Z

No description provided.

Signed-off-by: Ran Shidlansik <ranshid@amazon.com>

madolson · 2026-02-16T21:18:59Z

ValkeyTopKeys.md

+allowing them to perform targeted mitigations such as key deletion, redistribute slots or scaling.  
+
+Some examples where a specific key can contribute to resource consumption include:  
+1. Extremely large hash tables can generate large network spikes when commands like `HGETALL` are executed.  


This isn't a problem, this can be easily attributes to command log.

I agree. and I stated later that COMMAND LOG is a valid solution for some cases (like this).
Still, I think some users would love to get a more generic way to analyze the KEYs which are the root cause for different issues they get instead of cross-analyzing different statistics.

madolson · 2026-02-16T21:19:12Z

ValkeyTopKeys.md

+
+Some examples where a specific key can contribute to resource consumption include:  
+1. Extremely large hash tables can generate large network spikes when commands like `HGETALL` are executed.  
+2. Very large sets can cause extended server unresponsiveness when executing commands such as `SDIFFSTORE`.  


Same, can easily be identified by command log

agree. same response as before. command log is a fine alternative. For root causing issues, I agree command log might be enough for most cases. I do think that in some cases users also want to understand the potential issues they might experience doing some "database analysis" in order to identify what is the largest keys they use without going to understanding this from they application side. This is not RCA, and maybe I should add this to the motivation section?

madolson · 2026-02-16T21:20:51Z

ValkeyTopKeys.md

+
+### 3. Integrability
+
+- Output MUST be suitable for aggregation into cluster-wide or database-wide views.


We already don't have database wide views, why does this need to be database wide?

Yeh. I kinda battled my thoughts on how should we correctly expose these statistics. TBH I think that in most cases application would like a complete dataset analysis (not only per specific database).

madolson · 2026-02-16T21:22:05Z

ValkeyTopKeys.md

+
+Returns Top-N keys by size characteristics.
+```
+TOPKEYS <CARD | MEMORY> TOP <N>


You mention DB awareness, these requests are not per-DB.

right. they require one to select the DB first. (or we can add the db as an optional argument)

madolson · 2026-02-16T21:24:22Z

ValkeyTopKeys.md

+
+### Hot Keys
+
+`hotkeys-max-n <integer>` 


Unclear why this needs to be a config, could just be part of the API.

TBH we can. But I think this will kinda force the implementation to be less frugal in resources like memory and CPU.

madolson · 2026-02-16T21:25:03Z

ValkeyTopKeys.md

+4. **Key Memory Usage**
+   - Refer to the amount of memory consumed by a key. This should be identical to the output of the command `MEMORY USAGE <key>`


If we have memory usage, why do we need cardinality? It feels like memory is strictly more useful.

Agree. I think we can decide on only one. but then we need to decide if memory is worth implementing the tracking investment. I mean with valkey-cli I think users are always using bigkeys analysis and not memkeys. this is probably since memkeys is so much more expensive in CPU and time.

madolson · 2026-02-16T21:25:58Z

ValkeyTopKeys.md

+
+`hotkeys-read-access-threshold <integer>` - default 3000
+`hotkeys-write-access-threshold <integer>` - default 2000
+Threashold configuration.  Only keys exceeding these QPS thresholds appear in HOTKEYS output. Prevents low-activity keys from cluttering results.


I don't understand these configs, and more broadly how hot keys will work. Is it the current hot keys (All current keys accessed more than 3000 times) is it keys that were hot at some point (sort of like slowlog, a given key accessed more than X times in the past).

Sure. I can explain more but I did not want to go into the hotkey algorithm here, as it is being discussed in an already existing PR. I think maybe I will remove these configs from the proposal in the RFC and we can discuss specific configs as part of the detailed PR.

Signed-off-by: Ran Shidlansik <ranshid@amazon.com>

ranshid · 2026-02-17T07:09:35Z

@madolson Thank you for taking the time to review!
I know this is very raw and needs several iterations to focus the discussion on the major decisions, but wanted to start somewhere.
I made some changes following your comments. we can circle more.

Valkey support for Top Keys analysis

1839810

Signed-off-by: Ran Shidlansik <ranshid@amazon.com>

ranshid marked this pull request as ready for review February 6, 2026 05:56

madolson reviewed Feb 16, 2026

View reviewed changes

apply changes following PR review

80b2b52

Signed-off-by: Ran Shidlansik <ranshid@amazon.com>

ranshid mentioned this pull request Feb 17, 2026

Hotkey detection function valkey-io/valkey#2965

Open


		### 3. Integrability

		- Output MUST be suitable for aggregation into cluster-wide or database-wide views.

		4. Key Memory Usage
		- Refer to the amount of memory consumed by a key. This should be identical to the output of the command `MEMORY USAGE <key>`

Conversation

ranshid commented Feb 3, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ranshid commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants