Skip to content

helm: restore kataManager defaults#2403

Open
Ayush-Rathor wants to merge 1 commit intoNVIDIA:mainfrom
Ayush-Rathor:fix-katamanager-defaults-26.3.1
Open

helm: restore kataManager defaults#2403
Ayush-Rathor wants to merge 1 commit intoNVIDIA:mainfrom
Ayush-Rathor:fix-katamanager-defaults-26.3.1

Conversation

@Ayush-Rathor
Copy link
Copy Markdown

Description

This fixes a regression in the Helm chart defaults for kataManager, issue 2398.

In v25.10.0, the chart shipped a complete default kataManager block including:

  • config
  • repository
  • image
  • version

In v26.3.1, those defaults were removed, leaving kataManager incomplete when enabled through values. This caused the rendered ClusterPolicy.spec.kataManager to miss required image metadata and broke reconciliation.

What changed

Restored the missing default kataManager values in deployments/gpu-operator/values.yaml.

Validation

I verified the following:

  1. Stock v26.3.1 reproduced the issue and the ClusterPolicy reported:

    • Failed to reconcile state-kata-manager: empty image path provided through both ClusterPolicy CR and ENV KATA_MANAGER_IMAGE
  2. With this patch applied and the chart rendered locally, the kataManager block was complete again.

  3. Upgrading from the patched local chart resulted in:

    • ClusterPolicy.status.state: ready
    • ClusterPolicy is ready as all resources have been successfully reconciled

Notes

For local source-based validation only, I temporarily adjusted Chart.yaml so the raw source chart would not use main-latest, but that change is not part of this PR.

Signed-off-by: Ayush-Rathor <ayushrathor104@gmail.com>
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Apr 27, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Copy link
Copy Markdown
Contributor

@rajatchopra rajatchopra left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Ayush-Rathor thanks for this PR.
However, we intend to deprecate KataManager. It is not needed anymore as functionality has moved to cc-manager and sandbox-device-plugin.

the operator hits a reconcile panic in the Kata runtime class path is what we really need to fix here.

@Ayush-Rathor-1
Copy link
Copy Markdown

Ayush-Rathor-1 commented Apr 28, 2026

Got that, just saw your PR for the same

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants