summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* qa/suites/upgrade/reef-x: ignore more POOL_FULL variantsIlya Dryomov2024-12-102-24/+22
| | | | | | | | | | | | | | | Cover warnings like [WRN] POOL_FULL: 2 pool(s) full" in cluster log [WRN] pool 'test-librbd-smithi184-145008-24' is full (running out of quota)" in cluster log [WRN] Health detail: HEALTH_WARN 2 pool(s) full" in cluster log POOL_FULL is already ignored, but only in a parenthesized form. The "... (XYZ)" vs "XYZ: ..." variety isn't specific to POOL_FULL, so get rid of parenthesis throughout. While at it, drop POOL_APP_NOT_ENABLED, PG_AVAILABILITY and MON_DOWN which are duplicated in *-start.yaml. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* qa/suites/upgrade/*-x: ignore telemetry re-opt-in nudgesIlya Dryomov2024-12-092-11/+4
| | | | | | | | | | | | | | | | | | | | | Cover warnings like [WRN] Health check failed: Telemetry requires re-opt-in (TELEMETRY_CHANGED)" in cluster log [WRN] telemetry module includes new collections; please re-opt-in to new collections with `ceph telemetry on`" in cluster log Re-opt-in can happen in a respective workunit (test_telemetry_quincy_x.sh or test_telemetry_reef_x.sh), but it gets run only at the very end after both "workload" and "upgrade-sequence" complete. Over an hour passes in the interim: 2024-12-08T00:06:31.197 INFO:teuthology.task.print:**** done end upgrade, wait... ... 2024-12-08T01:28:38.588 INFO:tasks.workunit:Running workunit test_telemetry_reef_x.sh... The existing list is now duplicated in 0-start.yaml, so replace it entirely. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* qa/suites/upgrade/*-x: add OSD_UPGRADE_FINISHED to log-ignorelistIlya Dryomov2024-12-094-0/+4
| | | | | | | | | | | Cover warnings like [WRN] Health check failed: all OSDs are running squid or later but require_osd_release < squid (OSD_UPGRADE_FINISHED)" in cluster log They are inherently transient and should ideally be delayed for a grace period instead of being raised immediately just to be ignored. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* qa/suites/upgrade/*-x: add FS_DEGRADED to log-ignorelistIlya Dryomov2024-12-094-0/+4
| | | | | | | | | | | | Cover warnings like [WRN] Health check failed: 1 filesystem is degraded (FS_DEGRADED)" in cluster log MDS_ALL_DOWN is already ignored in ignorelist_health.yaml for reef-x. Not sure why it's not ignored for quincy-x -- ignorelist_health.yaml isn't present there at all. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* qa/suites/upgrade/*-x: add more PG states to log-ignorelistIlya Dryomov2024-12-074-2/+12
| | | | | | | | | | | Cover warnings like [WRN] pg 2.7 is active+undersized+degraded, acting [6,7]" in cluster log This is based on commit 4a4fc7bad533 ("qa: ignore pg availability/degraded warnings"). Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* qa/suites/upgrade/*-x: add "is down" to log-ignorelistIlya Dryomov2024-12-074-0/+4
| | | | | | | | | | Cover warnings like [WRN] osd.4 (root=default,host=smithi184) is down" in cluster log which OSD_DOWN doesn't. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* qa/suites/upgrade/reef-x: sync log-ignorelist with quincy-xIlya Dryomov2024-12-052-1/+16
| | | | | | | | | | Daemons are terminated by cephadm during the upgrade, so health checks like OSD_DOWN must be ignored. Since there shouldn't be any fundamental difference between upgrading from quincy and upgrading from reef, make quincy-x and reef-x ignorelists the same. Fixes: https://tracker.ceph.com/issues/69135 Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* Merge pull request #60947 from idryomov/wip-67759Ilya Dryomov2024-12-056-0/+12
|\ | | | | | | | | | | qa/suites/upgrade/*-x: skip TestClsRbd.group_snap_{set,remove} Reviewed-by: N Balachandran <nibalach@redhat.com> Reviewed-by: Laura Flores <lflores@ibm.com>
| * qa/suites/upgrade/*-x: skip TestClsRbd.group_snap_{set,remove}Ilya Dryomov2024-12-046-0/+12
| | | | | | | | | | | | | | | | | | | | These tests assert on the number of keys on the group header object, which changed with commits ec6224db3e55 ("cls/rbd: save group snapshot creation order in a new key") and 3662bc7b3dc3 ("cls/rbd: save max group snap order"). Fixes: https://tracker.ceph.com/issues/67759 Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* | Merge pull request #60966 from gbregman/mainGil Bregman2024-12-057-0/+26
|\ \ | | | | | | mgr/cephadm/nvmeof: Add encryption key file for nvmeof encryption
| * | mgr/cephadm/nvmeof: Add key encryption support to nvmeof configurationGil Bregman2024-12-057-0/+26
| | | | | | | | | | | | Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
* | | Merge pull request #60948 from idryomov/wip-68588Ilya Dryomov2024-12-051-3/+6
|\ \ \ | |/ / |/| | | | | | | | qa/suites/rados/thrash-old-clients: skip TestClsRbd.group_snap_{set,remove} Reviewed-by: N Balachandran <nibalach@redhat.com>
| * | qa/suites/rados/thrash-old-clients: skip TestClsRbd.group_snap_{set,remove}Ilya Dryomov2024-12-041-0/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | These tests assert on the number of keys on the group header object, which changed with commits ec6224db3e55 ("cls/rbd: save group snapshot creation order in a new key") and 3662bc7b3dc3 ("cls/rbd: save max group snap order"). Fixes: https://tracker.ceph.com/issues/68588 Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
| * | qa/suites/rados/thrash-old-clients: switch to cls/test_cls_rbd.sh workunitIlya Dryomov2024-12-041-3/+4
| |/ | | | | | | | | | | | | For consistency with other upgrade jobs, don't invoke ceph_test_cls_rbd directly. While at it, drop a long outdated filter. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* | Merge pull request #60926 from ktdreyer/container-labelerDan Mick2024-12-051-0/+1
|\ \ | | | | | | labeler: add container labelers
| * | labeler: add container labelersKen Dreyer2024-12-031-0/+1
| | | | | | | | | | | | Signed-off-by: Ken Dreyer <kdreyer@ibm.com>
* | | Merge pull request #60946 from nbalacha/wip-nbalacha-69112Ilya Dryomov2024-12-052-4/+4
|\ \ \ | | | | | | | | | | | | | | | | cls/rbd: fix mirror_image_map_list object Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
| * | | cls/rbd: fix mirror_image_map_list objectN Balachandran2024-12-042-4/+4
| | |/ | |/| | | | | | | | | | | | | | | | | | | | | | The rbd-mirror image map is stored in the rbd_mirror_leader object. Fixed mirror_image_map_list() to operate on the correct object. Fixes: https://tracker.ceph.com/issues/69112 Signed-off-by: N Balachandran <nibalach@redhat.com>
* | | Merge pull request #60913 from myoungwon/wip-fix-null-pointer-crimsonYingxin2024-12-051-3/+10
|\ \ \ | | | | | | | | | | | | | | | | | | | | | | | | crimson/osd/ops_executer: prevent access to null pointers caused by gcc bug Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com> Reviewed-by: Samuel Just <sjust@redhat.com> Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
| * | | crimson/osd/ops_executer: prevent access to null pointers caused by gcc bugmyoungwon oh2024-12-041-3/+10
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit assigns the parameter eariler to work around a coroutine related gcc bug that causes a null pointer access. Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com> Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
* | | | Merge pull request #60944 from dalgaaf/dalgaaf-fix-doc-ecAnthony D'Atri2024-12-041-1/+1
|\ \ \ \ | | | | | | | | | | doc: fix EC overhead table m-numbering
| * | | | doc: fix EC overhead table m-numberingDanny Al-Gaaf2024-12-031-1/+1
| | |_|/ | |/| | | | | | | | | | Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
* | | | Merge pull request #60954 from ↵Anthony D'Atri2024-12-041-8/+8
|\ \ \ \ | | | | | | | | | | | | | | | | | | | | zdover23/wip-doc-2024-12-04-dev-development-workflow-backporter-manual doc/dev: update link to backporter manual
| * | | | doc/dev: update link to backporter manualZac Dover2024-12-041-8/+8
|/ / / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | Update the link to the backporter manual, in doc/dev/development-workflow.rst. Laura Flores requested this change on 04 Dec 2024. Signed-off-by: Zac Dover <zac.dover@proton.me>
* | | | Merge pull request #60924 from ktdreyer/containerfile-date-utcKen Dreyer2024-12-041-1/+1
|\ \ \ \ | | | | | | | | | | container/build.sh: use UTC dates for tags
| * | | | container/build.sh: use UTC dates for tagsKen Dreyer2024-12-031-1/+1
| |/ / / | | | | | | | | | | | | | | | | | | | | Do not rely on the builder host's local timezone setting. Signed-off-by: Ken Dreyer <kdreyer@ibm.com>
* | | | Merge pull request #60930 from ↵Zac Dover2024-12-041-10/+13
|\ \ \ \ | |_|_|/ |/| | | | | | | | | | | | | | | | | | | zdover23/wip-doc-2024-12-03-rados-ops-health-checks-3 doc/rados: fix sentences in health-checks (3 of x) Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
| * | | doc/rados: fix sentences in health-checks (3 of x)Zac Dover2024-12-041-10/+13
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Make sentences agree at the head of each section in doc/rados/operations/health-checks.rst. The sentences were sometimes in the imperative mood and sometimes in the declarative mood. This commit edits the second third of doc/rados/operations/health-checks.rst. Note to (I hope soon) future Zac: There are a a couple of places near the end of this file where the sentences are ungrammatical. Update these in a separate PR (in isolation, so that the grammar and technical accuracy of these sentences can be the primary focus of the reviewers). Zac: cf. 000228 Signed-off-by: Zac Dover <zac.dover@proton.me>
* | | | Merge pull request #60514 from rhcs-dashboard/fix-68733-mainafreen232024-12-046-157/+226
|\ \ \ \ | | | | | | | | | | | | | | | | | | | | mgr/dashboard: fix total objects/Avg object size in RGW Overview Page Reviewed-by: Afreen Misbah <afreen@ibm.com>
| * | | | mgr/dashboard: fix total objects/Avg object size in RGW Overview PageAashish Sharma2024-11-266-157/+226
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Till now we are calculating the total number of objects and the average object size in the RGW Overview Page using `ceph df` command's output. As per the discussion with RGW team, this data is not correct as S3 objects in rgw can occupy more than one rados object. This PR tends to make the overview page's info in sync with the RGW bucket page's info. Fixes: https://tracker.ceph.com/issues/68733 Signed-off-by: Aashish Sharma <aasharma@redhat.com>
* | | | | Merge pull request #60904 from cyx1231st/wip-seastore-hex-logYingxin2024-12-0423-189/+162
|\ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | crimson/os/seastore: convert offset/length in logs to hex format Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
| * | | | | crimson/os/seastore: convert offset/length in seastore logs to hex formatYingxin Cheng2024-12-0313-83/+83
| | | | | | | | | | | | | | | | | | | | | | | | Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
| * | | | | crimson/os/seastore: convert offset/length in segment logs to hex formatYingxin Cheng2024-12-038-71/+77
| | | | | | | | | | | | | | | | | | | | | | | | Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
| * | | | | crimson/os/seastore: minor cleanupYingxin Cheng2024-12-021-2/+2
| | | | | | | | | | | | | | | | | | | | | | | | Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
| * | | | | crimson/os/seastore: drop unused fileYingxin Cheng2024-12-021-33/+0
| | |_|_|/ | |/| | | | | | | | | | | | | Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
* | | | | Merge pull request #60755 from dmick/wip-release-containerDan Mick2024-12-043-50/+164
|\ \ \ \ \ | | | | | | | | | | | | Modify container/ software to support release containers and the promotion of prerelease containers
| * | | | | container/{Containerfile,build.sh}: use a secret to pass secretsDan Mick2024-12-042-8/+15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Use podman's build --secret mechanism to pass credentials for download.ceph.com Signed-off-by: Dan Mick <dan.mick@redhat.com>
| * | | | | container/Containerfile: allow failure from curl to stop buildDan Mick2024-12-041-6/+6
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | curl doesn't actually fail unless -f is used; also, add some sh -e flags. Signed-off-by: Dan Mick <dan.mick@redhat.com>
| * | | | | container/Containerfile: purge .repo files with secrets before commitDan Mick2024-12-041-1/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | ceph.repo had creds in it for download.ceph.com/prerelease. Remove the .repo files we construct, since they're not necessary once the container is built (no one should be dnf'ing anything in the container). Signed-off-by: Dan Mick <dan.mick@redhat.com>
| * | | | | container/Containerfile fix boolean IS_RELEASE usageDan Mick2024-12-041-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Existing syntax was just wrong. Signed-off-by: Dan Mick <dan.mick@redhat.com>
| * | | | | container/make-manifest-list.py: use LOCALMANIFEST throughoutDan Mick2024-12-031-3/+3
| | | | | | | | | | | | | | | | | | | | | | | | Signed-off-by: Dan Mick <dan.mick@redhat.com>
| * | | | | containers/make-manifest.py: Add --promoteDan Mick2024-12-031-21/+96
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The promote function finds the latest image on the prerelease repo, finds all its tags, and copies them all to the release repo. Signed-off-by: Dan Mick <dan.mick@redhat.com>
| * | | | | container/Containerfile, build.sh: Add PRERELEASE_USERNAME/PASSWORDDan Mick2024-12-032-2/+15
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Used for accessing download.ceph.com Signed-off-by: Dan Mick <dan.mick@redhat.com>
| * | | | | container/build.sh:Dan Mick2024-12-031-7/+12
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - change repo path to prerelease - remove image id arg for release push - include 'v' in version name for release push Signed-off-by: Dan Mick <dan.mick@redhat.com>
| * | | | | container/make-manifest-list.pyDan Mick2024-12-031-19/+32
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - don't print command failure in worker; let the caller print them if desired (allow silent failure) - allow for empty tags list - look for CEPH_SHA1. GIT_COMMIT was the sha1 of the ceph-container.git commit - change default paths to prerelease - add --dry-run to avoid final push - rename 'HOST' to 'CONTAINER_HOST' - Use ARCH_SPECIFIC_HOST instead of CONTAINER_HOST (which is used by podman) Signed-off-by: Dan Mick <dan.mick@redhat.com>
* | | | | | Merge pull request #60018 from smanjara/rgw-script-multiple-zgShilpa Jagannath2024-12-042-9/+53
|\ \ \ \ \ \ | | | | | | | | | | | | | | rgw/multisite: test-rgw-multisite.sh can create multiple zonegroups
| * | | | | | rgw/multisite: add zonegroup parameter to test-rgw-mulitisite.shShilpa Jagannath2024-09-272-9/+53
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
* | | | | | | Merge pull request #60919 from ↵Zac Dover2024-12-041-34/+31
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | zdover23/wip-doc-2024-12-03-rados-ops-health-checks-2 doc/rados: fix sentences in health-checks (2 of x) Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
| * | | | | | | doc/rados: fix sentences in health-checks (2 of x)Zac Dover2024-12-031-34/+31
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Make sentences agree at the head of each section in doc/rados/operations/health-checks.rst. The sentences were sometimes in the imperative mood and sometimes in the declarative mood. This commit edits the second third of doc/rados/operations/health-checks.rst. Zac: cf. 000228 Signed-off-by: Zac Dover <zac.dover@proton.me>
* | | | | | | | Merge pull request #59862 from jamiepryde/isa-xor-handle-misaligned-buffersYuri Weinstein2024-12-032-11/+50
|\ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | erasure-code/isa: Handle case where data and coding buffers are not 3… Reviewed-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>