summaryrefslogtreecommitdiffstats
path: root/qa (follow)
Commit message (Collapse)AuthorAgeFilesLines
* test/rgw/lua: install luarocksYuval Lifshitz2025-01-161-1/+9
| | | | | | | | | since it is only a suggestion for debian. this is a regression from: https://github.com/ceph/ceph/pull/60666 Fixes: https://tracker.ceph.com/issues/69473 Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
* Merge pull request #61284 from shraddhaag/wip-shraddhaag-fix-rados-monthrashLaura Flores2025-01-131-0/+1
|\ | | | | rados/monthrash: add PG_DEGRADED to ignorelist
| * rados/monthrash: add PG_DEGRADED to ignorelistShraddha Agrawal2025-01-091-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Issue: mon/test_mon_osdmap_prune.sh is failing due to PG_DEGRADED state. Cause: We are writing an objects to a newly created PG before it has enough OSDs in acting set to create enough replicas. This is expected behaviour, thus the solution is to add the warning to ignorelist. Fixes: https://tracker.ceph.com/issues/69288 Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
* | Merge pull request #61311 from ↵Yuri Weinstein2025-01-131-2/+0
|\ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | zdover23/wip-doc-2025-01-10-releases-remove-quincy-from-active doc/releases: remove Quincy from "Active" Reviewed by: Laura Flores <lflores@redhat.com> Reviewed-by: Yuri Weinstein <yweinste@redhat.com> Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
| * | qa/tests: removed quincy from crontabYuri Weinstein2025-01-101-2/+0
| | | | | | | | | | | | Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
* | | Merge pull request #60762 from ljflores/wip-tracker-67181SrinivasaBharathKanta2025-01-131-0/+1
|\ \ \ | | | | | | | | qa/suites/rados/verify/validater: ignore OSD_DOWN
| * | | qa/suites/rados/verify/validater: ignore OSD_DOWNLaura Flores2024-11-181-0/+1
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | In these tests, we are intentionally stopping and restoring OSDs, so it makes sense to ignore these warnings. Fixes: https://tracker.ceph.com/issues/67181 Signed-off-by: Laura Flores <lflores@ibm.com>
* | | | Merge pull request #60969 from idryomov/wip-69135Ilya Dryomov2025-01-108-39/+79
|\ \ \ \ | |_|/ / |/| | | | | | | | | | | | | | | | | | | qa/suites/upgrade/reef-x: sync log-ignorelist with quincy-x Reviewed-by: Laura Flores <lflores@ibm.com> Reviewed-by: Samuel Just <sjust@redhat.com> Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
| * | | qa/suites/upgrade/*-x: ignore more POOL_APP_NOT_ENABLED variantsIlya Dryomov2024-12-104-4/+16
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Cover warnings like [WRN] Health detail: HEALTH_WARN 1 pool(s) do not have an application enabled" in cluster log [WRN] application not enabled on pool 'cephfs_metadata'" in cluster log [WRN] use 'ceph osd pool application enable <pool-name> <app-name>', where <app-name> is 'cephfs', 'rbd', 'rgw', or freeform for custom applications." in cluster log and also the non-parenthesized form. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
| * | | qa/suites/upgrade/reef-x: ignore more POOL_FULL variantsIlya Dryomov2024-12-102-24/+22
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Cover warnings like [WRN] POOL_FULL: 2 pool(s) full" in cluster log [WRN] pool 'test-librbd-smithi184-145008-24' is full (running out of quota)" in cluster log [WRN] Health detail: HEALTH_WARN 2 pool(s) full" in cluster log POOL_FULL is already ignored, but only in a parenthesized form. The "... (XYZ)" vs "XYZ: ..." variety isn't specific to POOL_FULL, so get rid of parenthesis throughout. While at it, drop POOL_APP_NOT_ENABLED, PG_AVAILABILITY and MON_DOWN which are duplicated in *-start.yaml. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
| * | | qa/suites/upgrade/*-x: ignore telemetry re-opt-in nudgesIlya Dryomov2024-12-092-11/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Cover warnings like [WRN] Health check failed: Telemetry requires re-opt-in (TELEMETRY_CHANGED)" in cluster log [WRN] telemetry module includes new collections; please re-opt-in to new collections with `ceph telemetry on`" in cluster log Re-opt-in can happen in a respective workunit (test_telemetry_quincy_x.sh or test_telemetry_reef_x.sh), but it gets run only at the very end after both "workload" and "upgrade-sequence" complete. Over an hour passes in the interim: 2024-12-08T00:06:31.197 INFO:teuthology.task.print:**** done end upgrade, wait... ... 2024-12-08T01:28:38.588 INFO:tasks.workunit:Running workunit test_telemetry_reef_x.sh... The existing list is now duplicated in 0-start.yaml, so replace it entirely. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
| * | | qa/suites/upgrade/*-x: add OSD_UPGRADE_FINISHED to log-ignorelistIlya Dryomov2024-12-094-0/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Cover warnings like [WRN] Health check failed: all OSDs are running squid or later but require_osd_release < squid (OSD_UPGRADE_FINISHED)" in cluster log They are inherently transient and should ideally be delayed for a grace period instead of being raised immediately just to be ignored. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
| * | | qa/suites/upgrade/*-x: add FS_DEGRADED to log-ignorelistIlya Dryomov2024-12-094-0/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Cover warnings like [WRN] Health check failed: 1 filesystem is degraded (FS_DEGRADED)" in cluster log MDS_ALL_DOWN is already ignored in ignorelist_health.yaml for reef-x. Not sure why it's not ignored for quincy-x -- ignorelist_health.yaml isn't present there at all. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
| * | | qa/suites/upgrade/*-x: add more PG states to log-ignorelistIlya Dryomov2024-12-074-2/+12
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Cover warnings like [WRN] pg 2.7 is active+undersized+degraded, acting [6,7]" in cluster log This is based on commit 4a4fc7bad533 ("qa: ignore pg availability/degraded warnings"). Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
| * | | qa/suites/upgrade/*-x: add "is down" to log-ignorelistIlya Dryomov2024-12-074-0/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Cover warnings like [WRN] osd.4 (root=default,host=smithi184) is down" in cluster log which OSD_DOWN doesn't. Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
| * | | qa/suites/upgrade/reef-x: sync log-ignorelist with quincy-xIlya Dryomov2024-12-052-1/+16
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Daemons are terminated by cephadm during the upgrade, so health checks like OSD_DOWN must be ignored. Since there shouldn't be any fundamental difference between upgrading from quincy and upgrading from reef, make quincy-x and reef-x ignorelists the same. Fixes: https://tracker.ceph.com/issues/69135 Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* | | | Merge pull request #52791 from clwluvw/location-constraintCasey Bodley2025-01-091-0/+2
|\ \ \ \ | | | | | | | | | | | | | | | | | | | | rgw: check for location constraint on master zonegroup Reviewed-by: Casey Bodley <cbodley@redhat.com>
| * | | | qa/rgw: set api_name if not provided in create_zonegroupSeena Fallah2024-12-191-0/+2
| | |/ / | |/| | | | | | | | | | | | | | | | | | | | | | If api_name is not set in the config, use name as the api_name, otherwise on RGW it will be set to an empty string. Signed-off-by: Seena Fallah <seenafallah@gmail.com>
* | | | Merge pull request #61262 from VallariAg/fix-nvmeof-teuthology-basic-testVallari Agrawal2025-01-093-4/+4
|\ \ \ \ | | | | | | | | | | qa/workunits/nvmeof/basic_tests.sh: fix nvme list assert
| * | | | qa/workunits/nvmeof/basic_tests.sh: fix connect-all assertVallari Agrawal2025-01-083-4/+4
| | |_|/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | There seems to be change in 'nvme list' json output which caused failures in asserts after 'nvme connect-all' command. Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
* | | | Merge pull request #61255 from Matan-B/wip-matanb-crimson-seastore-deafult-2Matan Breizman2025-01-0848-80/+329
|\ \ \ \ | |/ / / |/| | | | | | | | | | | | | | | | | | | common/options/crimson.yaml.in: fallback to Bluestore by default Reviewed-by: Samuel Just <sjust@redhat.com> Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com> Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
| * | | qa/suites/crimson-rados-exp: add Seastore/thrashMatan Breizman2025-01-0742-1/+352
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | As Seastore/trash was removed from the non-exp suite, it is moved here until fully supported. Follow-up to: 5150dae471c Signed-off-by: Matan Breizman <mbreizma@redhat.com>
| * | | qa/suites/crimson-rados-exp: remove basicMatan Breizman2025-01-0715-102/+0
| | | | | | | | | | | | | | | | | | | | | | | | "basic" directory is included in the non-exp suite and is fully supported Signed-off-by: Matan Breizman <mbreizma@redhat.com>
| * | | qa/suites/crimson-rados: disable thrash/seastoreMatan Breizman2025-01-071-0/+0
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Seastore supports thrash_simple only until https://tracker.ceph.com/issues/69405 is resolved. Signed-off-by: Matan Breizman <mbreizma@redhat.com>
* | | | Merge pull request #60720 from batrick/i68913Venky Shankar2025-01-061-0/+2
|\ \ \ \ | |/ / / |/| | | | | | | | | | | qa: write out ESubtreeMap more frequently to find large events Reviewed-by: Venky Shankar <vshankar@redhat.com>
| * | | qa: write out ESubtreeMap more frequently to find large eventsPatrick Donnelly2024-12-301-0/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | With the trimming changes by 9d2b3aa, ESubtreeMap wasn't written reliably often enough to pass the test. Fixes: https://tracker.ceph.com/issues/68913 Signed-off-by: Patrick Donnelly <pdonnell@ibm.com> Signed-off-by: Venky Shankar <vshankar@redhat.com>
* | | | Merge pull request #61216 from athanatos/sjust/wip-crimson-backfill-teuthMatan Breizman2025-01-052-0/+0
|\ \ \ \ | | | | | | | | | | | | | | | | | | | | qa/suites/crimson-rados: enable short_pg_log Reviewed-by: Matan Breizman <mbreizma@redhat.com>
| * | | | qa/suites/crimson-rados: enable short_pg_logSamuel Just2024-12-182-0/+0
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This should add backfill testing. Signed-off-by: Samuel Just <sjust@redhat.com>
* | | | | Merge pull request #61148 from ronen-fr/wip-rf-abortReservRonen Friedman2025-01-052-2/+337
|\ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | osd/scrub: abort running scrub in replica-reservation if an operator-initiated scrub is requested Reviewed-by: Samuel Just <sjust@redhat.com>
| * | | | | qa/standalone/scrub: osd-scrub-test.sh - test operator overridesRonen Friedman2024-12-311-0/+231
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | verify that an operator scrub aborts a reserving scrub of the same PG. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
| * | | | | qa/standalone/scrub: add build_pg_dicts()Ronen Friedman2024-12-311-2/+106
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | a helper function that builds bash dictionaries: pg to acting set, pg to primary & pg to pool. Also added are two helper functions that make use of the dictionaries: count_common_active() to count the number of common OSDs in the acting set of two PGs, and find_disjoint_but_primary() to find a PG that is disjoint from the first PG, apart from possibly having the same primary OSD. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
* | | | | | Merge PR #55616 into mainVenky Shankar2024-12-272-0/+56
|\ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * refs/pull/55616/head: PendingReleaseNotes: add note for replay completion warning qa: test to verify `MDS_ESTIMATED_REPLAY_TIME` warning doc: add a note for `MDS_ESTIMATED_REPLAY_TIME` MDS warning mds: emit warning for estinated replay time Reviewed-by: Patrick Donnelly <pdonnell@ibm.com> Reviewed-by: Milind Changire <mchangir@redhat.com>
| * | | | | | qa: test to verify `MDS_ESTIMATED_REPLAY_TIME` warningVenky Shankar2024-11-292-0/+56
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Signed-off-by: Venky Shankar <vshankar@redhat.com>
* | | | | | | Merge PR #60640 into mainVenky Shankar2024-12-271-0/+1
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * refs/pull/60640/head: qa: use a larger timeout for kernel_untar_build workunit Reviewed-by: Milind Changire <mchangir@redhat.com>
| * | | | | | | qa: use a larger timeout for kernel_untar_build workunitVenky Shankar2024-11-211-0/+1
| | |_|_|_|/ / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | Fixes: http://tracker.ceph.com/issues/68855 Signed-off-by: Venky Shankar <vshankar@redhat.com>
* | | | | | | Merge pull request #60988 from thuvh/feat/kafka_multiple_brokersYuval Lifshitz2024-12-266-1/+274
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | rgw: allow bucket notification send message to kafka with multiple brokers
| * | | | | | | rgw: allow bucket notification send message to kafka with multiple brokersHoai-Thu Vuong2024-12-266-1/+274
| | |_|/ / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - add new parameter to define list of broker - update document - change nose to nose-py3 - add test case for multiple brokers (happy case) Signed-off-by: Hoai-Thu Vuong <thuvh87@gmail.com>
* | | | | | | Merge pull request #61143 from kotreshhr/qa-nfsganesha-fio-testVenky Shankar2024-12-262-0/+58
|\ \ \ \ \ \ \ | |/ / / / / / |/| | | | | | | | | | | | | | | | | | | | qa: Add async io test to nfs ganesha using fio Reviewed-by: Venky Shankar <vshankar@redhat.com>
| * | | | | | qa: Add async io test to nfs ganesha using fioKotresh HR2024-12-192-0/+58
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Fixes: https://tracker.ceph.com/issues/69311 Signed-off-by: Kotresh HR <khiremat@redhat.com>
* | | | | | | Merge pull request #60821 from Matan-B/wip-matanb-crimson-seastore-deafultMatan Breizman2024-12-2319-15/+42
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | crimson: switch to SeaStore as default object store Reviewed-by: Samuel Just <sjust@redhat.com> Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
| * | | | | | | qa/suites/crimson-rados: make use of crimson_osd_objectstoreMatan Breizman2024-12-1919-15/+42
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Signed-off-by: Matan Breizman <mbreizma@redhat.com>
* | | | | | | | rbd: handle --{group,image}-namespace in "rbd group image {add,rm}"Ilya Dryomov2024-12-201-0/+5
| |/ / / / / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Currently only passing the namespace as part of the group or image spec works. If --group-namespace or --image-namespace options are used, the namespace isn't picked up. Fixes: https://tracker.ceph.com/issues/69324 Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
* | | | | | | Merge pull request #60431 from VallariAg/fix-nvmeof-ns-testVallari Agrawal2024-12-1913-53/+128
|\ \ \ \ \ \ \ | | | | | | | | | | | | | | | | qa: fix nvmeof scalability and namespace test
| * | | | | | | qa/suites/nvmeof/thrash: add 10-subsys-90-namespace-no_huge_pages.yamlVallari Agrawal2024-11-262-1/+38
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add test for no-huge-pages by using config "spdk_mem_size: 4096" in 10 subsystems and 90 namespaces each setup. Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
| * | | | | | | qa/workunits/nvmeof/basic_tests.sh: Assert number of devicesVallari Agrawal2024-11-261-3/+7
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Check number of devices connected after connect-all. It should be equal to number of namespaces created. Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
| * | | | | | | qa/workunits/nvmeof/setup_subsystem.sh: add list_namespaces() funcVallari Agrawal2024-11-261-8/+9
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Add list_namespaces function which could be useful for debugging later. Remove extra call of list_subsystems so it's only logged once after subsystems are completely setup. Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
| * | | | | | | qa/tasks/nvmeof.py: Improve thrasher and rbd image creationVallari Agrawal2024-11-262-17/+9
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Create rbd images in one command using ";" to queue them, instead of running "cephadm shell -- rbd create" again and again for each image. Improve the method to select to-be-thrashed daemons. Use randint() and sample(), instead of weights/skip. Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
| * | | | | | | qa/suites/nvmeof/thrasher: use 120 subsystems and 8 ns eachVallari Agrawal2024-11-263-5/+12
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | For tharsher test: 1. Run it on 120 subsystems with 8 namespaces each 2. Run FIO for 20 mins (instead of 15mins) 2. Run FIO for few randomly picked devices (using `--random_devices 200`) Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
| * | | | | | | qa/suites/nvmeof/thrash: Add "is unavailable" to log-ignorelistVallari Agrawal2024-11-264-3/+5
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This commit also: - Remove --rbd_iostat from thrasher fio - Log iteration details before printing stats in nvmeof_tharsher Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
| * | | | | | | qa/suites/nvmeof/basic: use default image in nvmeof_initiator.yamlVallari Agrawal2024-11-261-1/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Instead of using quay.io/ceph/nvmeof:latest, use default image in ceph build. Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>