Fix TektonInstallerSet deadlock when resources have deletionTimestamp#3217
Fix TektonInstallerSet deadlock when resources have deletionTimestamp#3217jkhelil wants to merge 1 commit intotektoncd:mainfrom
Conversation
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
b1a9ca5 to
e803a2e
Compare
|
/kind bug |
|
@jkhelil - could you help with steps to reproduce the issue? |
| } | ||
|
|
||
| // Resource is being deleted by another controller/InstallerSet, skip it | ||
| ressourceLogger.Debug("resource is being deleted by another owner, skipping") |
There was a problem hiding this comment.
Would it be ok to log the finalizer name as well and Deletion time stamp here?
to reproduce, install once, have some workload, check finaliers are there, delete tektonconfig, reinstall or do an upgrade |
e803a2e to
2188fed
Compare
Changes
Fixes #2474
The operator enters a deadlock when any resource (e.g., CRD) has a
deletionTimestampduring InstallerSet reconciliation. The current code immediately aborts the entire reconciliation phase, preventing critical namespace-scoped resources (ServiceAccounts, RBAC) from being created.Symptoms:
openshift-pipelinesnamespaceserviceaccount not found)Impact: Complete operator failure during installations, upgrades, downgrades, or recovery operations.
Root Cause
Location:
pkg/reconciler/kubernetes/tektoninstallerset/install.go:166-168Submitter Checklist
These are the criteria that every PR should meet, please check them off as you
review them:
make test lintbefore submitting a PRSee the contribution guide for more details.
Release Notes