Skip to content

Inference Extension support#204

Merged
shaneutt merged 1 commit intokube-rs:mainfrom
nitishkumar71:add_inference_extension
Jan 15, 2026
Merged

Inference Extension support#204
shaneutt merged 1 commit intokube-rs:mainfrom
nitishkumar71:add_inference_extension

Conversation

@nitishkumar71
Copy link
Contributor

@nitishkumar71 nitishkumar71 commented Dec 4, 2025

PR is still in development phase and purpose of this PR is just to show the approach we are taking in to support inference extension. Few things which are still need to be implented here are

  • experimental API support for inference extension which is not working right now due to error Error: CRD '' has no versions in version v1.0.2
  • Inference extension will published as separate create which is not done yet
  • Tests

PR adds support of the inference extension as independent crate which will add CRDs support based on upstream version. It's possible there will be some duplicate code between gateway-api and inference extension, which is accepted approach for now. There is also some code duplication in test code, which should be handled when more extension support is being added.

Supporting issue #175

@nitishkumar71 nitishkumar71 force-pushed the add_inference_extension branch 3 times, most recently from c8451dd to 0da1861 Compare December 14, 2025 10:48
@nitishkumar71 nitishkumar71 force-pushed the add_inference_extension branch 2 times, most recently from 0d3dfe8 to 5b43ea1 Compare December 26, 2025 17:54
@nitishkumar71 nitishkumar71 marked this pull request as ready for review December 26, 2025 18:00
@nitishkumar71 nitishkumar71 changed the title [Draft] Inference Extension support Inference Extension support Dec 26, 2025
@dawid-nowak
Copy link
Contributor

I am not sure what the overall decision about different versioning for Inference Extension and Gateway API, but it looks like this change will use same version for both packages?

@dawid-nowak
Copy link
Contributor

It is more of a nit than anything else...
I would prefer to rename extension folder to extensions assuming that there will be more of them in the future.
Otherwise, I think it might be more clear to have gateway-api and inference-extension-api folders at the top level.

@nitishkumar71
Copy link
Contributor Author

nitishkumar71 commented Jan 2, 2026

I am not sure what the overall decision about different versioning for Inference Extension and Gateway API, but it looks like this change will use same version for both packages?

Ohh, we meant to have different release version for rust API. Sorry, I did not get that. I was under impression for different version of upstream CRDs. Should we start a new versioning like v0.0.1 or follow upstream?

@nitishkumar71 nitishkumar71 force-pushed the add_inference_extension branch from 5b43ea1 to 23d73f1 Compare January 2, 2026 11:51
@nitishkumar71
Copy link
Contributor Author

It is more of a nit than anything else... I would prefer to rename extension folder to extensions assuming that there will be more of them in the future. Otherwise, I think it might be more clear to have gateway-api and inference-extension-api folders at the top level.

Agree, i missed it.

@dawid-nowak
Copy link
Contributor

I am not sure what the overall decision about different versioning for Inference Extension and Gateway API, but it looks like this change will use same version for both packages?

Ohh, we meant to have different release version for rust API. Sorry, I did not get that. I was under impression for different version of upstream CRDs. Should we start a new versioning like v0.0.1 or follow upstream?

I think we need @shaneutt to make a call here.
Personally, I would prefer if gateway-api and inference-extension-api were somehow tracking the releases of Kubernetes standards in their versions. At the moment, in crates.io we have release 0.19 but it seems that release 0.16 is covering Kubvernetes version 1.2.1. So from the developer perspective you need to check the readme to find what you are looking for.

I have seen that some projects would use custom versioning. In this case for example 0.19+1.2.1 for inference-extension-api and 0.19-1.4 for gateway-api, would have all the information necessary.

@shaneutt
Copy link
Member

I'm open to the suggestion of doing a version tag that tracks the upstream version. We can always have a couple of tags if needed.

@shaneutt shaneutt force-pushed the add_inference_extension branch from 23d73f1 to 68142c3 Compare January 14, 2026 19:59
Copy link
Member

@shaneutt shaneutt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @nitishkumar71!

Took a quick high-level review pass, and added a couple of comments. Also needs some updates to make CI happy.

Did you run into much trouble while working on this, or did it seem pretty straightforward?

@nitishkumar71 nitishkumar71 force-pushed the add_inference_extension branch 3 times, most recently from 388e4b8 to eccb23c Compare January 15, 2026 09:59
@nitishkumar71
Copy link
Contributor Author

Took a quick high-level review pass, and added a couple of comments. Also needs some updates to make CI happy.

It took me sometime but delay was mostly due to some other personal things at my end.

Copy link
Member

@shaneutt shaneutt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's definitely a few things we could continue to tweak here, but most of that is reasonable to tweak in upcoming iterations, so I think we're mostly ready to go.

There are a couple things in the new script for safety we need to do now, and can't wait for future iterations, which I've added comments for.

After those are resolved, I think we're good to go and we can continue moving this forward with some smaller incremental PRs from here.

Signed-off-by: Nitishkumar Singh <nitishkumarsingh71@gmail.com>

seperate generators for gateway api and extensions

Signed-off-by: Nitishkumar Singh <nitishkumarsingh71@gmail.com>
@nitishkumar71 nitishkumar71 force-pushed the add_inference_extension branch from 0ce626b to 97a5917 Compare January 15, 2026 14:56
@nitishkumar71
Copy link
Contributor Author

There's definitely a few things we could continue to tweak here, but most of that is reasonable to tweak in upcoming iterations, so I think we're mostly ready to go.

There are a couple things in the new script for safety we need to do now, and can't wait for future iterations, which I've added comments for.

After those are resolved, I think we're good to go and we can continue moving this forward with some smaller incremental PRs from here.

Thanks for pointing them out, changes are done

@shaneutt shaneutt merged commit 3fd26cf into kube-rs:main Jan 15, 2026
5 checks passed
@shaneutt shaneutt linked an issue Jan 15, 2026 that may be closed by this pull request
@nitishkumar71
Copy link
Contributor Author

There's definitely a few things we could continue to tweak here, but most of that is reasonable to tweak in upcoming iterations, so I think we're mostly ready to go.

There are a couple things in the new script for safety we need to do now, and can't wait for future iterations, which I've added comments for.

After those are resolved, I think we're good to go and we can continue moving this forward with some smaller incremental PRs from here.

@shaneutt what were the additional things you were planning to tweak here? I can pick them up

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Gateway API Inference Extension

3 participants