| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
|
|
|
|
|
| |
since it is only a suggestion for debian.
this is a regression from: https://github.com/ceph/ceph/pull/60666
Fixes: https://tracker.ceph.com/issues/69473
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
|
|\
| |
| | |
rados/monthrash: add PG_DEGRADED to ignorelist
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
Issue: mon/test_mon_osdmap_prune.sh is failing due to
PG_DEGRADED state.
Cause: We are writing an objects to a newly created PG before
it has enough OSDs in acting set to create enough replicas.
This is expected behaviour, thus the solution is to add the
warning to ignorelist.
Fixes: https://tracker.ceph.com/issues/69288
Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
|
|\ \
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | |
| | | |
zdover23/wip-doc-2025-01-10-releases-remove-quincy-from-active
doc/releases: remove Quincy from "Active"
Reviewed by: Laura Flores <lflores@redhat.com>
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
|
| | |
| | |
| | |
| | | |
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
|
|\ \ \
| | | |
| | | | |
qa/suites/rados/verify/validater: ignore OSD_DOWN
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
In these tests, we are intentionally stopping and restoring OSDs,
so it makes sense to ignore these warnings.
Fixes: https://tracker.ceph.com/issues/67181
Signed-off-by: Laura Flores <lflores@ibm.com>
|
|\ \ \ \
| |_|/ /
|/| | |
| | | |
| | | |
| | | |
| | | | |
qa/suites/upgrade/reef-x: sync log-ignorelist with quincy-x
Reviewed-by: Laura Flores <lflores@ibm.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Cover warnings like
[WRN] Health detail: HEALTH_WARN 1 pool(s) do not have an application enabled" in cluster log
[WRN] application not enabled on pool 'cephfs_metadata'" in cluster log
[WRN] use 'ceph osd pool application enable <pool-name> <app-name>', where <app-name> is 'cephfs', 'rbd', 'rgw', or freeform for custom applications." in cluster log
and also the non-parenthesized form.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Cover warnings like
[WRN] POOL_FULL: 2 pool(s) full" in cluster log
[WRN] pool 'test-librbd-smithi184-145008-24' is full (running out of quota)" in cluster log
[WRN] Health detail: HEALTH_WARN 2 pool(s) full" in cluster log
POOL_FULL is already ignored, but only in a parenthesized form. The
"... (XYZ)" vs "XYZ: ..." variety isn't specific to POOL_FULL, so get
rid of parenthesis throughout. While at it, drop POOL_APP_NOT_ENABLED,
PG_AVAILABILITY and MON_DOWN which are duplicated in *-start.yaml.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Cover warnings like
[WRN] Health check failed: Telemetry requires re-opt-in (TELEMETRY_CHANGED)" in cluster log
[WRN] telemetry module includes new collections; please re-opt-in to new collections with `ceph telemetry on`" in cluster log
Re-opt-in can happen in a respective workunit
(test_telemetry_quincy_x.sh or test_telemetry_reef_x.sh), but it gets
run only at the very end after both "workload" and "upgrade-sequence"
complete. Over an hour passes in the interim:
2024-12-08T00:06:31.197 INFO:teuthology.task.print:**** done end upgrade, wait...
...
2024-12-08T01:28:38.588 INFO:tasks.workunit:Running workunit test_telemetry_reef_x.sh...
The existing list is now duplicated in 0-start.yaml, so replace it
entirely.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Cover warnings like
[WRN] Health check failed: all OSDs are running squid or later but require_osd_release < squid (OSD_UPGRADE_FINISHED)" in cluster log
They are inherently transient and should ideally be delayed for a grace
period instead of being raised immediately just to be ignored.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Cover warnings like
[WRN] Health check failed: 1 filesystem is degraded (FS_DEGRADED)" in cluster log
MDS_ALL_DOWN is already ignored in ignorelist_health.yaml for reef-x.
Not sure why it's not ignored for quincy-x -- ignorelist_health.yaml
isn't present there at all.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Cover warnings like
[WRN] pg 2.7 is active+undersized+degraded, acting [6,7]" in cluster log
This is based on commit 4a4fc7bad533 ("qa: ignore pg
availability/degraded warnings").
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Cover warnings like
[WRN] osd.4 (root=default,host=smithi184) is down" in cluster log
which OSD_DOWN doesn't.
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Daemons are terminated by cephadm during the upgrade, so health checks
like OSD_DOWN must be ignored. Since there shouldn't be any fundamental
difference between upgrading from quincy and upgrading from reef, make
quincy-x and reef-x ignorelists the same.
Fixes: https://tracker.ceph.com/issues/69135
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
|\ \ \ \
| | | | |
| | | | |
| | | | |
| | | | | |
rgw: check for location constraint on master zonegroup
Reviewed-by: Casey Bodley <cbodley@redhat.com>
|
| | |/ /
| |/| |
| | | |
| | | |
| | | |
| | | |
| | | | |
If api_name is not set in the config, use name as the api_name,
otherwise on RGW it will be set to an empty string.
Signed-off-by: Seena Fallah <seenafallah@gmail.com>
|
|\ \ \ \
| | | | |
| | | | | |
qa/workunits/nvmeof/basic_tests.sh: fix nvme list assert
|
| | |_|/
| |/| |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
There seems to be change in 'nvme list' json output
which caused failures in asserts after 'nvme connect-all'
command.
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
|
|\ \ \ \
| |/ / /
|/| | |
| | | |
| | | |
| | | |
| | | | |
common/options/crimson.yaml.in: fallback to Bluestore by default
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
As Seastore/trash was removed from the non-exp suite, it is
moved here until fully supported.
Follow-up to: 5150dae471c
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
"basic" directory is included in the non-exp suite and is fully supported
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
Seastore supports thrash_simple only until https://tracker.ceph.com/issues/69405
is resolved.
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
|
|\ \ \ \
| |/ / /
|/| | |
| | | |
| | | | |
qa: write out ESubtreeMap more frequently to find large events
Reviewed-by: Venky Shankar <vshankar@redhat.com>
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
With the trimming changes by 9d2b3aa, ESubtreeMap wasn't written reliably often
enough to pass the test.
Fixes: https://tracker.ceph.com/issues/68913
Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
Signed-off-by: Venky Shankar <vshankar@redhat.com>
|
|\ \ \ \
| | | | |
| | | | |
| | | | |
| | | | | |
qa/suites/crimson-rados: enable short_pg_log
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
|
| | | | |
| | | | |
| | | | |
| | | | |
| | | | |
| | | | | |
This should add backfill testing.
Signed-off-by: Samuel Just <sjust@redhat.com>
|
|\ \ \ \ \
| | | | | |
| | | | | |
| | | | | |
| | | | | | |
osd/scrub: abort running scrub in replica-reservation if an operator-initiated scrub is requested
Reviewed-by: Samuel Just <sjust@redhat.com>
|
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | | |
verify that an operator scrub aborts a reserving scrub of the
same PG.
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
|
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | |
| | | | | | |
a helper function that builds bash dictionaries:
pg to acting set, pg to primary & pg to pool.
Also added are two helper functions that make use of the dictionaries:
count_common_active() to count the number of common OSDs
in the acting set of two PGs, and find_disjoint_but_primary()
to find a PG that is disjoint from the first PG, apart from
possibly having the same primary OSD.
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
|
|\ \ \ \ \ \
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | | |
* refs/pull/55616/head:
PendingReleaseNotes: add note for replay completion warning
qa: test to verify `MDS_ESTIMATED_REPLAY_TIME` warning
doc: add a note for `MDS_ESTIMATED_REPLAY_TIME` MDS warning
mds: emit warning for estinated replay time
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
|
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | | |
Signed-off-by: Venky Shankar <vshankar@redhat.com>
|
|\ \ \ \ \ \ \
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
* refs/pull/60640/head:
qa: use a larger timeout for kernel_untar_build workunit
Reviewed-by: Milind Changire <mchangir@redhat.com>
|
| | |_|_|_|/ /
| |/| | | | |
| | | | | | |
| | | | | | |
| | | | | | | |
Fixes: http://tracker.ceph.com/issues/68855
Signed-off-by: Venky Shankar <vshankar@redhat.com>
|
|\ \ \ \ \ \ \
| | | | | | | |
| | | | | | | | |
rgw: allow bucket notification send message to kafka with multiple brokers
|
| | |_|/ / / /
| |/| | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | | |
- add new parameter to define list of broker
- update document
- change nose to nose-py3
- add test case for multiple brokers (happy case)
Signed-off-by: Hoai-Thu Vuong <thuvh87@gmail.com>
|
|\ \ \ \ \ \ \
| |/ / / / / /
|/| | | | | |
| | | | | | |
| | | | | | | |
qa: Add async io test to nfs ganesha using fio
Reviewed-by: Venky Shankar <vshankar@redhat.com>
|
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | | |
Fixes: https://tracker.ceph.com/issues/69311
Signed-off-by: Kotresh HR <khiremat@redhat.com>
|
|\ \ \ \ \ \ \
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
crimson: switch to SeaStore as default object store
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
|
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
|
| |/ / / / / /
|/| | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | |
| | | | | | | |
Currently only passing the namespace as part of the group or image spec
works. If --group-namespace or --image-namespace options are used, the
namespace isn't picked up.
Fixes: https://tracker.ceph.com/issues/69324
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
|
|\ \ \ \ \ \ \
| | | | | | | |
| | | | | | | | |
qa: fix nvmeof scalability and namespace test
|
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
Add test for no-huge-pages by using config
"spdk_mem_size: 4096" in 10 subsystems
and 90 namespaces each setup.
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
|
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
Check number of devices connected after connect-all.
It should be equal to number of namespaces created.
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
|
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
Add list_namespaces function which could be useful for debugging later.
Remove extra call of list_subsystems so it's only logged once after
subsystems are completely setup.
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
|
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
Create rbd images in one command using ";" to queue them,
instead of running "cephadm shell -- rbd create" again
and again for each image.
Improve the method to select to-be-thrashed daemons.
Use randint() and sample(), instead of weights/skip.
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
|
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
For tharsher test:
1. Run it on 120 subsystems with 8 namespaces each
2. Run FIO for 20 mins (instead of 15mins)
2. Run FIO for few randomly picked devices
(using `--random_devices 200`)
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
|
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
This commit also:
- Remove --rbd_iostat from thrasher fio
- Log iteration details before printing stats in nvmeof_tharsher
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
|
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | |
| | | | | | | | |
Instead of using quay.io/ceph/nvmeof:latest, use default
image in ceph build.
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
|