Shard the radix into small trees by VoletiRam · Pull Request #881 · valkey-io/valkey-search

VoletiRam · 2026-03-10T09:05:50Z

Shard the radix into small trees. Previously, we just had one big tree for prefix/suffix each at schema. We had to acquire lock at global level on entire tree for any writes. Break into small configurable trees and lock the tree at shard level. Use first byte of the word to hash the tree. Allows parallel writes on different sharded tree

Shard the radix into small trees. Previously, we just had one big tree for prefix/suffix each at schema. We had to acquire lock at global level on entire tree for any writes. Break into small configurable trees and lock the tree at shard level. Use first byte of the word to hash the tree. Allows parallel writes on different sharded tree Signed-off-by: Ram Prasad Voleti <ramvolet@amazon.com>

allenss-amazon · 2026-03-10T15:52:28Z

src/indexes/text/text_index.h

+  std::vector<std::unique_ptr<Shard>> shards_;
+  size_t num_shards_;


Sharding in the per-key TextIndex isn't helpful and will dramatically hurt our per-key space performance numbers (which are already awful). How hard would it be to make TextIndex be a template and pass into it the # of shards are compile time, then use a hard-coded array here. This means that for the per-key index we have close to zero space overhead.

BCathcart · 2026-03-10T17:29:12Z

My question here is similar to yesterday: what's the benefit of this over having one tree and still locking on the first byte? Essentially locking the paths branching from the root node. We'd need to share a lock on the root node, but the number of times the root node is updated is probably very small relative to the life of the main text index. My initial though here is that this adds complexity for minimal benefit.

Use template based arrays instead of vectors for shards. Keep per key index separate to avoid 40 bytes mutex added for shard Signed-off-by: Ram Prasad Voleti <ramvolet@amazon.com>

Signed-off-by: Ram Prasad Voleti <ramvolet@amazon.com>

VoletiRam requested review from BCathcart and allenss-amazon March 10, 2026 09:05

VoletiRam force-pushed the rax_shard branch from 8baabb4 to 3679139 Compare March 10, 2026 09:30

VoletiRam requested a review from KarthikSubbarao March 10, 2026 09:41

allenss-amazon reviewed Mar 10, 2026

View reviewed changes

Ram Prasad Voleti added 2 commits March 11, 2026 03:34

Use template based arrays instead of vectors for shards

815cdbf

Use template based arrays instead of vectors for shards. Keep per key index separate to avoid 40 bytes mutex added for shard Signed-off-by: Ram Prasad Voleti <ramvolet@amazon.com>

Merge remote-tracking branch 'upstream/main' into rax_shard

09cfca0

Signed-off-by: Ram Prasad Voleti <ramvolet@amazon.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shard the radix into small trees#881

Shard the radix into small trees#881
VoletiRam wants to merge 3 commits intovalkey-io:mainfrom
VoletiRam:rax_shard

VoletiRam commented Mar 10, 2026

Uh oh!

allenss-amazon Mar 10, 2026

Uh oh!

BCathcart commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		std::vector<std::unique_ptr<Shard>> shards_;
		size_t num_shards_;

Conversation

VoletiRam commented Mar 10, 2026

Uh oh!

allenss-amazon Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

BCathcart commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants