Replies: 1 comment 6 replies
-
|
The error: Is completely normal as Flannel hasn't started yet, and kubelet will keep restarting the pod until it succeeds to run (once Flannel initializes itself). So what is the issue with Flannel? You need to grab more logs to understand things in a "broken" state. |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I'm on attempt number two to upgrade to 1.12 and am running into the same issue on 1.12.4 as I did on 1.12.0 - on boot, pods seem like they may be starting midway through the Flannel startup process, or something like that? It manifests as pods such as my ingress controller never successfully receiving any liveness or readiness probes. To resolve I've got to delete the Flannel pod, wait for it to come up, then delete the impacted pod.
The pods started during this time have the error
Failed to create pod sandbox: rpc error: code = Unknown desc = failed to setup network for sandbox "cf21ae921e096b3bea3f0c405fa08bd0db1048c96674e33527a5f01c89c3a3f1": plugin type="flannel" failed (add): loadFlannelSubnetEnv failed: open /run/flannel/subnet.env: no such file or directoryI threw Claude Code at the problem and let it poke around in my cluster for a bit before posting here and its leading theory is 1.12 has sped up CRI startup times making an existing race condition more prevalent, and it seems reasonable but I'm not sure how true that is (it initially thought it was something related to 1.12 trusting KVM's network device but I could find no evidence of that in the docs or changelog and it came back with this after). Regardless, its claim is that:
The suggestion was to use machine.files to write this file on boot automatically but I figure there's zero chance that's the right answer 😆
If it's relevant:
ghcr.io/siderolabs/installer:v1.12.4installer image, no addonscluster.tfandmodules/node. An example complete machineconfig is:Beta Was this translation helpful? Give feedback.
All reactions