Skip to content

telco5g: Add retries to all downloads from Web#78727

Merged
openshift-merge-bot[bot] merged 1 commit intoopenshift:mainfrom
sshnaidm:retries1
May 4, 2026
Merged

telco5g: Add retries to all downloads from Web#78727
openshift-merge-bot[bot] merged 1 commit intoopenshift:mainfrom
sshnaidm:retries1

Conversation

@sshnaidm
Copy link
Copy Markdown
Contributor

@sshnaidm sshnaidm commented May 3, 2026

Summary by CodeRabbit

Release Notes

  • Chores
    • Improved CI/CD pipeline resilience by implementing automatic retry mechanisms with delays for external file downloads across build and test workflows, reducing transient network failures.

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 3, 2026

Warning

Rate limit exceeded

@sshnaidm has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 31 minutes and 36 seconds before requesting another review.

To keep reviews running without waiting, you can enable usage-based add-on for your organization. This allows additional reviews beyond the hourly cap. Account admins can enable it under billing.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Enterprise

Run ID: 638def55-3216-4769-8948-63e722183145

📥 Commits

Reviewing files that changed from the base of the PR and between 84cd2c2 and e95ed08.

📒 Files selected for processing (4)
  • ci-operator/step-registry/telco5g/add-interface/telco5g-add-interface-commands.sh
  • ci-operator/step-registry/telco5g/cnf/tests/telco5g-cnf-tests-commands.sh
  • ci-operator/step-registry/telco5g/delete-interface/telco5g-delete-interface-commands.sh
  • ci-operator/step-registry/telco5g/jobs-router/telco5g-jobs-router-commands.sh

Walkthrough

Five CI operator telco5g step-registry scripts add transient-failure handling to external downloads: AWS CLI, Terraform, and OpenShift client archives. Each script now uses curl or wget with --retry and related delay/backoff flags instead of single-attempt downloads.

Changes

Download Resilience via Retry Flags

Layer / File(s) Summary
AWS CLI & Terraform Downloads
ci-operator/step-registry/telco5g/add-interface/telco5g-add-interface-commands.sh
Both AWS CLI and Terraform zip downloads gain --retry 5 --retry-delay 10 flags to curl for transient-failure resilience.
Terraform Download Robustness
ci-operator/step-registry/telco5g/delete-interface/telco5g-delete-interface-commands.sh
Terraform zip download now includes --retry 5 --retry-delay 10 flags in curl command.
Release Image Fetch Robustness
ci-operator/step-registry/telco5g/jobs-router/telco5g-jobs-router-commands.sh
Both periodic and presubmit/rehears branches gain --retry 5 --retry-delay 10 flags when fetching nightly releasestream IMG_URL via curl.
GitHub PR Metadata Fetch
ci-operator/step-registry/telco5g/cnf/tests/telco5g-cnf-tests-commands.sh
PR commit message retrieval in check_commit_message_for_prs adds --retry 5 --retry-delay 10 flags to curl when fetching commit body from GitHub API.
Go & OpenShift Client Downloads
ci-operator/step-registry/telco5g/ptp/tests/telco5g-ptp-tests-commands.sh
Go tarball and OpenShift oc client tarball downloads switch from plain wget to wget --tries=5 --waitretry=10 --retry-connrefused for connection resilience.

Estimated Code Review Effort

🎯 2 (Simple) | ⏱️ ~10 minutes

🚥 Pre-merge checks | ✅ 11 | ❌ 1

❌ Failed checks (1 warning)

Check name Status Explanation Resolution
Docstring Coverage ⚠️ Warning Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%. Write docstrings for the functions missing them to satisfy the coverage threshold.
✅ Passed checks (11 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately reflects the main change across all modified files: adding retry logic to curl and wget commands used for downloading files from the web.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.
Stable And Deterministic Test Names ✅ Passed PR modifies only bash shell scripts in ci-operator/step-registry/telco5g/ directory to add retry logic to download commands, with no Ginkgo test definitions present.
Test Structure And Quality ✅ Passed The custom check is designed to review Ginkgo test code for quality requirements, but this PR contains only bash shell script modifications without any Ginkgo test code, making the check not applicable.
Microshift Test Compatibility ✅ Passed PR modifies only shell scripts in CI step registry with retry logic additions, not introducing any new Ginkgo e2e tests.
Single Node Openshift (Sno) Test Compatibility ✅ Passed The custom check for SNO test compatibility is not applicable to this PR. All modified files are bash shell scripts in ci-operator/step-registry/telco5g/ that add retry logic to download commands. No new Ginkgo e2e tests or Go test files are added.
Topology-Aware Scheduling Compatibility ✅ Passed PR modifies only CI/CD shell scripts adding retry logic to network downloads; does not affect Kubernetes manifests, operator code, controllers, or scheduling constraints.
Ote Binary Stdout Contract ✅ Passed The OTE Binary Stdout Contract check validates stdout communication in Go test binaries, but this PR modifies only shell scripts (.sh files) in CI infrastructure that add retry logic to curl and wget commands.
Ipv6 And Disconnected Network Test Compatibility ✅ Passed This PR modifies shell scripts in ci-operator/step-registry/telco5g/ to add retry logic to curl and wget commands. These are infrastructure scripts, not Ginkgo e2e tests, so the custom check targeting new Ginkgo e2e tests is not applicable.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share
Review rate limit: 0/1 reviews remaining, refill in 31 minutes and 36 seconds.

Comment @coderabbitai help to get the list of available commands and usage tips.

@openshift-ci openshift-ci Bot requested review from dgoodwin and smg247 May 3, 2026 16:17
@openshift-ci openshift-ci Bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 3, 2026
Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (2)
ci-operator/step-registry/telco5g/delete-interface/telco5g-delete-interface-commands.sh (1)

24-24: ⚡ Quick win

Consider adding --fail so HTTP-error responses don't silently end up as the zip payload.

Without --fail, if the server returns a non-retryable HTTP error (e.g., 404 for a missing version), curl exits 0 and writes the HTML error body to /tmp/terraform.zip. The subsequent unzip then fails with a cryptic "invalid archive" message instead of a clear download error. The same applies to the AWS CLI curl call in telco5g-add-interface-commands.sh.

✨ Proposed fix
-curl -sL --retry 5 --retry-delay 10 "https://releases.hashicorp.com/terraform/${TERRAFORM_VERSION}/terraform_${TERRAFORM_VERSION}_linux_amd64.zip" -o /tmp/terraform.zip
+curl -sLf --retry 5 --retry-delay 10 "https://releases.hashicorp.com/terraform/${TERRAFORM_VERSION}/terraform_${TERRAFORM_VERSION}_linux_amd64.zip" -o /tmp/terraform.zip
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/telco5g/delete-interface/telco5g-delete-interface-commands.sh`
at line 24, The curl invocation that downloads Terraform (the line using
TERRAFORM_VERSION and outputting to /tmp/terraform.zip in
telco5g-delete-interface-commands.sh) should add the --fail flag so HTTP errors
cause curl to exit non‑zero instead of writing an error HTML page; update that
curl command to include --fail and do the same for the AWS CLI curl in
telco5g-add-interface-commands.sh to ensure invalid HTTP responses fail fast and
produce clear errors rather than an invalid zip/archive on unzip.
ci-operator/step-registry/telco5g/jobs-router/telco5g-jobs-router-commands.sh (1)

48-51: ⚡ Quick win

Retry flags look correct; consider adding --fail to avoid a silent null IMG_URL.

Without --fail, if the releases API returns a non-retryable HTTP error, curl exits 0 and pipes the error body to jq. jq -r ".pullSpec" then yields "null", which is silently written into $MAINENV as T5_JOB_RELEASE_IMAGE='null'. Downstream steps would receive a bad release image reference with no error surfaced here.

✨ Proposed fix (applies to both branches)
-    IMG_URL=$(curl -q -L -s --retry 5 --retry-delay 10 \
+    IMG_URL=$(curl -q -L -sf --retry 5 --retry-delay 10 \
         "https://amd64.ocp.releases.ci.openshift.org/api/v1/releasestream/${T5CI_VERSION}.0-0.nightly/latest" \
         | jq -r ".pullSpec")

Also applies to: 59-62

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/step-registry/telco5g/jobs-router/telco5g-jobs-router-commands.sh`
around lines 48 - 51, The curl invocation that populates IMG_URL should include
--fail and then validate the result to avoid silently writing "null"; update the
curl command used to set IMG_URL (the command assigning IMG_URL via curl | jq -r
".pullSpec") to add the --fail flag and after the assignment check that IMG_URL
is non-empty and not "null", and if it is, emit a clear error and exit non-zero
so downstream steps don't receive a bad T5_JOB_RELEASE_IMAGE; apply the same
change to the second identical curl usage later in the script.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In
`@ci-operator/step-registry/telco5g/delete-interface/telco5g-delete-interface-commands.sh`:
- Line 24: The curl invocation that downloads Terraform (the line using
TERRAFORM_VERSION and outputting to /tmp/terraform.zip in
telco5g-delete-interface-commands.sh) should add the --fail flag so HTTP errors
cause curl to exit non‑zero instead of writing an error HTML page; update that
curl command to include --fail and do the same for the AWS CLI curl in
telco5g-add-interface-commands.sh to ensure invalid HTTP responses fail fast and
produce clear errors rather than an invalid zip/archive on unzip.

In
`@ci-operator/step-registry/telco5g/jobs-router/telco5g-jobs-router-commands.sh`:
- Around line 48-51: The curl invocation that populates IMG_URL should include
--fail and then validate the result to avoid silently writing "null"; update the
curl command used to set IMG_URL (the command assigning IMG_URL via curl | jq -r
".pullSpec") to add the --fail flag and after the assignment check that IMG_URL
is non-empty and not "null", and if it is, emit a clear error and exit non-zero
so downstream steps don't receive a bad T5_JOB_RELEASE_IMAGE; apply the same
change to the second identical curl usage later in the script.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Enterprise

Run ID: 9fe34395-9ff0-420d-8843-21175ca58a57

📥 Commits

Reviewing files that changed from the base of the PR and between 74d126e and 84cd2c2.

📒 Files selected for processing (5)
  • ci-operator/step-registry/telco5g/add-interface/telco5g-add-interface-commands.sh
  • ci-operator/step-registry/telco5g/cnf/tests/telco5g-cnf-tests-commands.sh
  • ci-operator/step-registry/telco5g/delete-interface/telco5g-delete-interface-commands.sh
  • ci-operator/step-registry/telco5g/jobs-router/telco5g-jobs-router-commands.sh
  • ci-operator/step-registry/telco5g/ptp/tests/telco5g-ptp-tests-commands.sh

@sshnaidm
Copy link
Copy Markdown
Contributor Author

sshnaidm commented May 3, 2026

/pj-rehearse pull-ci-openshift-sriov-network-operator-release-4.22-operator-e2e-sriov-sno periodic-ci-openshift-release-main-nightly-4.22-e2e-telco5g-cnftests

@openshift-merge-bot
Copy link
Copy Markdown
Contributor

@sshnaidm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@openshift-merge-bot
Copy link
Copy Markdown
Contributor

[REHEARSALNOTIFIER]
@sshnaidm: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name Repo Type Reason
pull-ci-openshift-sriov-network-operator-main-operator-e2e-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-5.1-operator-e2e-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-5.0-operator-e2e-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.23-operator-e2e-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.22-operator-e2e-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-main-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-5.1-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-5.0-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.23-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.22-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.21-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.20-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.19-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.18-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.17-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.16-e2e-telco5g-sriov-sno openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-main-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-5.1-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-5.0-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.23-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.22-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.21-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.20-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.19-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed
pull-ci-openshift-sriov-network-operator-release-4.18-e2e-telco5g-sriov openshift/sriov-network-operator presubmit Registry content changed

A total of 110 jobs have been affected by this change. The above listing is non-exhaustive and limited to 25 jobs.

A full list of affected jobs can be found here

Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

Copy link
Copy Markdown
Contributor

@SchSeba SchSeba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
/approve

@openshift-ci openshift-ci Bot added the lgtm Indicates that a PR is ready to be merged. label May 4, 2026
@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented May 4, 2026

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: SchSeba, sshnaidm

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@sshnaidm
Copy link
Copy Markdown
Contributor Author

sshnaidm commented May 4, 2026

/pj-rehearse ack

@openshift-merge-bot
Copy link
Copy Markdown
Contributor

@sshnaidm: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@openshift-merge-bot openshift-merge-bot Bot added the rehearsals-ack Signifies that rehearsal jobs have been acknowledged label May 4, 2026
@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented May 4, 2026

@sshnaidm: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/rehearse/openshift/sriov-network-operator/release-4.22/operator-e2e-sriov-sno e95ed08 link unknown /pj-rehearse pull-ci-openshift-sriov-network-operator-release-4.22-operator-e2e-sriov-sno
ci/rehearse/periodic-ci-openshift-release-main-nightly-4.22-e2e-telco5g-cnftests e95ed08 link unknown /pj-rehearse periodic-ci-openshift-release-main-nightly-4.22-e2e-telco5g-cnftests

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@openshift-merge-bot openshift-merge-bot Bot merged commit 0f86d58 into openshift:main May 4, 2026
10 of 12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged. rehearsals-ack Signifies that rehearsal jobs have been acknowledged

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants