summaryrefslogtreecommitdiffstats
path: root/src/common/options.cc
diff options
context:
space:
mode:
authorMauricio Faria de Oliveira <mfo@canonical.com>2021-01-07 23:44:44 +0100
committerMauricio Faria de Oliveira <mfo@canonical.com>2021-01-26 16:56:59 +0100
commitc75734729764868c5c501722fc8de08dac9ebd4a (patch)
tree4acba6e0e8838616b02cf9cfcb45cd442fa45f04 /src/common/options.cc
parentMerge pull request #39064 from idryomov/wip-doc-osdtimeout (diff)
downloadceph-c75734729764868c5c501722fc8de08dac9ebd4a.tar.xz
ceph-c75734729764868c5c501722fc8de08dac9ebd4a.zip
osd: add osd_fast_shutdown_notify_mon option (default false)
The osd_fast_shutdown option may cause the cluster log to receive too many entries of 'osd.X reported immediately failed by osd.Y', depending on cluster scale. This might be an issue for LMA stacks/tools that check ceph logs for failed lines, and then require additional logic to filter on an intended OSD (fast) shutdown; might not be an option/possible, and require an admin to analyze. So, add osd_fast_shutdown_notify_mon option for OSD to also tell the monitor it is shutting down (done in slow/non-fast shutdown) under osd_fast_shutdown. This introduces minimal delay (the ack from the mon is required to prevent the messages), and addresses the cluster log issue. Note: the osd_mon_shutdown_timeout option can be used to control the maximum amount of time waiting for the monitor ack to arrive. Fixes: http://tracker.ceph.com/issues/46978 Signed-off-by: Mauricio Faria de Oliveira <mfo@canonical.com>
Diffstat (limited to 'src/common/options.cc')
-rw-r--r--src/common/options.cc6
1 files changed, 6 insertions, 0 deletions
diff --git a/src/common/options.cc b/src/common/options.cc
index 8e15babf9e3..e17822d16a7 100644
--- a/src/common/options.cc
+++ b/src/common/options.cc
@@ -3711,6 +3711,12 @@ std::vector<Option> get_global_options() {
.set_description("Fast, immediate shutdown")
.set_long_description("Setting this to false makes the OSD do a slower teardown of all state when it receives a SIGINT or SIGTERM or when shutting down for any other reason. That slow shutdown is primarilyy useful for doing memory leak checking with valgrind."),
+ Option("osd_fast_shutdown_notify_mon", Option::TYPE_BOOL, Option::LEVEL_ADVANCED)
+ .set_default(false)
+ .set_description("Tell mon about OSD shutdown on immediate shutdown")
+ .set_long_description("Tell the monitor the OSD is shutting down on immediate shutdown. This helps with cluster log messages from other OSDs reporting it immediately failed.")
+ .add_see_also({"osd_fast_shutdown", "osd_mon_shutdown_timeout"}),
+
Option("osd_fast_fail_on_connection_refused", Option::TYPE_BOOL, Option::LEVEL_ADVANCED)
.set_default(true)
.set_description(""),