ceph - ceph

	Commit message (Collapse)	Author	Files	Lines
2025-01-15	rgw: S3 Delete Bucket Policy should return 204 on success	Simon Jürgensmeyer	1	-1/+4
	Currently, RGW returns a 200 on a successful DELETE on a bucket policy but the S3 API expects a 204: https://docs.aws.amazon.com/AmazonS3/latest/API/API_DeleteBucketPolicy.html The sample response in the example above is a 204. This patch checks the op_ret in `RGWDeleteBucketPolicy::send_response()` and on a success we turn it to a 204 (or STATUS_NO_CONTENT). Fixes: https://tracker.ceph.com/issues/69539 Signed-off-by: Simon Jürgensmeyer <simon.juergensmeyer@hetzner-cloud.de>
2025-01-15	AsyncMessenger.cc : improve error messages	Anthony D'Atri	1	-4/+4
	Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
2025-01-14	qa/tasks: Include stderr on tasks badness check.	Christopher Hoffman	3	-6/+24
	Make sure that first_in_ceph_log() doesn't return None (which is treated as success/"no badness" by the caller) if the cluster log file is missing. Fixes: https://tracker.ceph.com/issues/57864 Co-authored-by: Ilya Dryomov <idryomov@gmail.com> Signed-off-by: Christopher Hoffman <choffman@redhat.com> Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2025-01-14	rgw: RGWRados::get_olh() needs to use the correct attr	Yehuda Sadeh	1	-1/+1
	Fixes: https://tracker.ceph.com/issues/69536 Signed-off-by: Yehuda Sadeh <ysadehwe@ibm.com>
2025-01-14	doc/radosgw/config-ref: fix lifecycle workload tuning description	Laimis Juzeliūnas	1	-4/+5
	This commit updates RGW Config Reference - Lifecycle Settings section. In particular it changes the section where two different tunings for two different workloads are suggested. It aims for a more understandable description as asked by the contributors. Signed-off-by: Laimis Juzeliunas <laimis.juzeliunas@oxylabs.io>
2025-01-14	mgr/cephadm: Retry command when SSH connection closes	Joshua Blanch	1	-1/+6
	Handle scenarios where the SSH connection is closed or broken. Ensure commands like exiting maintenance attempt to reconnect rather than throwing an error to the user. Fixes: https://tracker.ceph.com/issues/67905 Signed-off-by: Joshua Blanch <joshua.blanch@clyso.com>
2025-01-14	qa/scrub: more delay when waiting for noscrub to take effect	Ronen Friedman	1	-4/+3
	Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2025-01-14	mgr/cephadm: re-factoring the dependencies calculation code	Redouane Kachach	11	-210/+225
	currently, the dependency logic is duplicated between the different Service classes and the module.py::_calc_daemon_deps function, which can lead to issues such as BUGSs, difficulty in maintenance, and other problems associated with duplicated code. In this change, we are consolidating all the dependency logic into the Service subclasses to eliminate this duplication. This way, we also force anybody creating a new Service to think about its potential dependencies. Fixes: https://tracker.ceph.com/issues/69021 Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2025-01-14	common: config_cacher: use set::contains() instead of count()	Ronen Friedman	1	-1/+1
	moved out of the main commit to facilitate backporting it to pre-C++20. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2025-01-14	qa/scrub: change 'bin/ceph' to 'ceph'	Ronen Friedman	3	-39/+39
	Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2025-01-14	common: fix md_config_cacher_t	Ronen Friedman	1	-7/+16
	In its get_tracked_conf_keys() member function, the cacher (in the existing code) initializes a static function-block variable ('keys'), and uses it for registering the observer. But the cacher is instantiated on the type of the configuration value. Thus, multiple cacher objects for which the configuration values are of the same type - share the static 'keys'. Only one of the observers is registered. Note that the code could have been simplified somewhat, if the signature of the get_tracked_conf_keys() function was changed to return 'const char* const *'. Fixes: https://tracker.ceph.com/issues/69236 Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2025-01-14	mgr/cephadm: Changes for creating root cert with manager cluster fsid	Anuradha Gadge	3	-3/+5
	Signed-off-by: Anuradha Gadge <anuradha.gadge@ibm.com>
2025-01-14	script/ceph-backport: Remove stray `\` from grep command	Anoop C S	1	-1/+1
	GNU grep v3.8 release notes[1] has the following mention about the usage of backslashes: "Regular expressions with stray backslashes now cause warnings, as their unspecified behavior can lead to unexpected results." . . . "The warnings are intended as a transition aid; they are likely to be errors in future releases." As a result we see the warning "grep: warning: stray \ before /" during script execution. Therefore remove the extra `\` from grep command. [1] https://git.savannah.gnu.org/cgit/grep.git/tree/NEWS#n85 Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
2025-01-14	Documentation changes for updating non-ceph image service	Shweta Bhosale	1	-0/+16
	Fixes: https://tracker.ceph.com/issues/68979 Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
2025-01-14	mgr/cephadm: Command to upgrade non-ceph image services	Shweta Bhosale	6	-8/+65
	Fixes: https://tracker.ceph.com/issues/68979 Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
2025-01-13	crimson/.../replicated_request: work around gcc 11 bug in with_pg_interruptible	Samuel Just	1	-6/+15
	Signed-off-by: Samuel Just <sjust@redhat.com>
2025-01-13	doc/releases: add actual_eol for quincy	Zac Dover	1	-0/+1
	Add the actual EOL date for the Quincy release (it's 2025-01-13). Signed-off-by: Zac Dover <zac.dover@proton.me>
2025-01-13	service_spec: force ceph-exporter sock_dir to be unest or "/var/run/ceph/"	Adam King	1	-1/+6
	As discussed in https://tracker.ceph.com/issues/69475 this setting is effectively useless as it only controls the directory inside the container where the ceph-exporter will write out its asok file, and has no influence over where it is on the host where the ceph-exporter daemon is deployed. Given any custom values for the sock_dir setting would have always been broken, we decided to skip out on writing a proper migration step in cephadm to deal with this, and instead just force this field to be unset Signed-off-by: Adam King <adking@redhat.com>
2025-01-13	cephadm: fix handling of ceph-exporter sock-dir	Adam King	1	-13/+5
	Fixes: https://tracker.ceph.com/issues/69475 It turns out the sock-dir for ceph-exporter only needs to exist within the container, not on the host. Previous code, including the validation function this commit removes and previous patches trying to fix the ceph-exporter asok file not appearing on the host, were all done assuming it mattered what was on the host. This patch changes things so all we do with the sock dir is mount it to /var/run/ceph/<fsid> and don't worry about whether that dir exists on the host. Additionally, the patch makes it so /var/run/ceph/<fsid> is created during ceph-exporter deployment. Signed-off-by: Adam King <adking@redhat.com>
2025-01-13	common/io_exerciser: Reorder lines in RadosIo send_command functions	Jon Bailey	1	-2/+4
	Signed-off-by: Jon Bailey <jonathan.bailey1@ibm.com>
2025-01-13	librbd/migration/HttpClient: Use asio::ssl::stream	Adam Emerson	3	-25/+25
	`beast::ssl_stream` is deprecated as of 1.86, and its loss of the move constructor keeps it from compiling on that version. Hopefully this passes tests on 1.85, too, or it will have to wait until the boost bump. Signed-off-by: Adam Emerson <aemerson@redhat.com>
2025-01-13	osd/scrub: increase the default scrub load limit	Ronen Friedman	1	-5/+3
	Modifying the default value of osd_scrub_load_threshold from 0.5 to 10.0, rendering it mostly irrelevant. That is done following a combination of reasons: - the new mclock scheduler, which provides a better mechanism to control scrub load under all conditions; - the inapplicability of the existing mechanism to Crimson; and - users` requests. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2025-01-13	ceph-volume: add type annotation to api.lvm	Guillaume Abrioux	1	-104/+118
	This adds Python type annotations to `api.lvm`, along with all necessary adjustments to ensure compatibility and maintain code clarity. Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2025-01-13	common/io_exerciser: Clear the stringstream in RadosIO class	Jon Bailey	1	-36/+58
	Ensures the stringstream is empty before sending a command in the RadosIO class Signed-off-by: Jon Bailey <jonathan.bailey1@ibm.com>
2025-01-13	os/bluestore: Fix BlueFS::truncate()	Adam Kupczyk	3	-2/+7
	In `struct bluefs_fnode_t` there is a vector `extents` and the vector `extents_index` that is a log2 seek cache. Until modifications to truncate() we never removed extents from files. Modified truncate() did not update extents_index. For example 10 extents long files when truncated to 0 will have: 0 extents, 10 extents_index. After writing some data to file: 1 extents, 11 extents_index. Now, `bluefs_fnode_t::seek` will binary search extents_index, lets say it located seek at item #3. It will then jump up from #0 extent (that exists) to #3 extent which does not exist at. The worst part is that code is now broken, as #3 != extent.end(). There are 3 parts of the fix: 1) assert in `bluefs_fnode_t::seek` to protect against jumping outside extents 2) code in BlueFS::truncate to sync up `extents_index` with `extents` 3) dampening down assert in _replay to give a way out of cases where incorrect "offset 12345" (12345 is file size) instead of "offset 20000" (allocations occupied) was written to log. Fixes: https://tracker.ceph.com/issues/69481 Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2025-01-13	rgw/logging: clean pending objects	Yuval Lifshitz	13	-129/+562
	* when source bucket is deleted * when logging is disabled * when logging conf changes * when target bucket is deleted also add "bucket logging info" admin command returning logging conf for source bucket, and list of source buckets for log bucket Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2025-01-13	rgw/logging: retry attribuite set in case of race	Yuval Lifshitz	1	-45/+56
	Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2025-01-13	rgw/logging: log bucket must not have encryption	Yuval Lifshitz	1	-1/+6
	Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2025-01-13	rgw/logging: source and log bucket must be different	Yuval Lifshitz	2	-0/+11
	Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2025-01-13	rgw/logging: support source and destination buckets on different tenants	Yuval Lifshitz	3	-44/+77
	Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2025-01-12	mgr: Work around bug in Boost MPI/Python	Adam Emerson	1	-0/+12
	Thanks to the maintainers of the Arch User Repository Ceph PKGBUILD for this fix. Signed-off-by: Adam Emerson <aemerson@redhat.com>
2025-01-10	client: Fix signed comparison warning	Adam Emerson	1	-1/+1
	https://tracker.ceph.com/issues/68552 Signed-off-by: Adam Emerson <aemerson@redhat.com>
2025-01-10	rgw_attrs: fixes for 3 more iam/perms mappings	Matt Benjamin	3	-0/+13
	Help from Pritha and Casey appreciated Plus, fix 3 explicit mapping tests in test_rgw_iam_policy.cc. Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
2025-01-10	ceph-volume: add type annotations to util.device	Guillaume Abrioux	2	-71/+80
	This adds Python type annotations to `ceph_volume.util.device`, along with all necessary adjustments to ensure compatibility and maintain code clarity. Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2025-01-10	doc: improve tests-integration-testing-teuthology-workflow.rst	Vallari Agrawal	2	-7/+52
	This commit adds: 1. workflow summary in the first section along with an image. 2. sub-section "Pushing to ceph-ci repository" to second section. 3. file doc/dev/developer_guide/testing_integration_tests/workflow.png Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
2025-01-10	os/bluestore: bluefs unittest for truncate bug	Adam Kupczyk	1	-0/+85
	Unittest showing 2 different flavours of problems: 1) bluefs log corruption 2) bluefs sigsegv Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2025-01-10	crimson/os/seastore: refactor LBAMapping	Xuehan Xu	14	-190/+221
	So that PhyscalNodeMapping doesn't contain LBAMapping specific interfaces Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
2025-01-10	crimson/.../replicated_request: hold pg ref until complete() resolves	Samuel Just	1	-2/+3
	02b70a62a4 moved the call to complete() into the finally block, so we need to hold a reference to the pg until the complete() resolves. Signed-off-by: Samuel Just <sjust@redhat.com>
2025-01-10	crimson/.../replicated_request: fix op lifetime in with_pg finally	Samuel Just	1	-2/+3
	02b70a62a4 moved the call to handle.complete() into the finally, but didn't extend the op ref lifetime until after the complete resolved. Signed-off-by: Samuel Just <sjust@redhat.com>
2025-01-10	qa/tests: removed quincy from crontab	Yuri Weinstein	1	-2/+0
	Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2025-01-10	doc/releases: remove Quincy from "Active"	Zac Dover	1	-1/+1
	Remove "Quincy" from the list of "Active Releases" in doc/releases/index.rst. Signed-off-by: Zac Dover <zac.dover@proton.me>
2025-01-09	rgw:fix radosgw-admin use after free	Alex Wojno	1	-8/+6
	Converted string_view members to string to avoid use after free. Bug introduced in commit: e84c7fc Signed-off-by: Alex Wojno <awojno@bloomberg.net>
2025-01-09	doc/rgw: document UserName requirements for account migration	Casey Bodley	1	-0/+5
	clarify the following error when migrating a user into an account: > could not modify user: unable to modify user, UserName contains invalid characters Fixes: https://tracker.ceph.com/issues/69470 Signed-off-by: Casey Bodley <cbodley@redhat.com>
2025-01-09	rgw: Don't crash on exceptions from pool listing	Adam Emerson	1	-9/+23
	Fixes: https://tracker.ceph.com/issues/69303 Signed-off-by: Adam Emerson <aemerson@redhat.com>
2025-01-09	rgw: /admin/user api dumps account-related user info	Casey Bodley	1	-0/+5
	the admin api uses a separate dump_user_info() function that wasn't updated with the account changes to RGWUserInfo::dump() Fixes: https://tracker.ceph.com/issues/69462 Signed-off-by: Casey Bodley <cbodley@redhat.com>
2025-01-09	rados/monthrash: add PG_DEGRADED to ignorelist	Shraddha Agrawal	1	-0/+1
	Issue: mon/test_mon_osdmap_prune.sh is failing due to PG_DEGRADED state. Cause: We are writing an objects to a newly created PG before it has enough OSDs in acting set to create enough replicas. This is expected behaviour, thus the solution is to add the warning to ignorelist. Fixes: https://tracker.ceph.com/issues/69288 Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
2025-01-09	monitoring: Update nvmeof alert limits in config	Vallari Agrawal	3	-24/+91
	Update these in config.libsonnet: - NVMeoFMaxGatewaysPerGroup (4->8) - NVMeoFMaxGatewaysPerCluster (4->32) - NVMeoFMaxNamespaces (1024->2048) - NVMeoFHighClientCount (32->128) Also update prometheus_alerts.yml and test_alerts.yml accordingly. Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
2025-01-09	rgw/lc: make lc worker thread name shorter	lightmelodies	1	-1/+1
	Fixes: https://tracker.ceph.com/issues/69459 Signed-off-by: lightmelodies <lightmelodies@outlook.com>
2025-01-09	doc/rados/operations/stretch-mode: Improve doc	Kamoltat Sirivadhna	1	-7/+23
	Added more content and rewrite some sections Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
2025-01-08	doc/radosgw/config-ref: fix lc worker thread tuning	Laimis Juzeliūnas	1	-1/+1
	This commit updates RGW Config Reference - Lifecycle Settings section. In particular it addresses an incorrect suggestion to decrease parallel threads in the workers pool for a more aggressive/accelerated per-bucket lifecycle processing. A more aggressive lifecycle processing for a bucket containing higher number of objects is achieved by increasing, not decreasing parallel threads. Current suggestion is miss-leading. Fixes: https://tracker.ceph.com/issues/63659 Signed-off-by: Laimis Juzeliunas <laimis.juzeliunas@oxylabs.io>