ceph - ceph

	Commit message (Collapse)	Author	Age	Files	Lines
*	common/RefCountedObj: cleanup con/des	Patrick Donnelly	2019-09-16	1	-51/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Also, don't allow children to set nref (to 0). This is the more significant change as it required fixing various code to not do this: <reftype> ptr = new RefCountedObjectFoo(..., 0); as a way to create a starting reference with nref==1. This is a pretty bad code smell so I've converted all the code doing this to use the new factory method which produces the reference safely: auto ptr = ceph::make_ref<RefCountedObjectFoo>(...); libradosstriper was particularly egregious in its abuse of setting the starting nref. :( Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
*	include: convert FunctionContext usage to generic LambdaContext	Patrick Donnelly	2019-09-16	1	-2/+2
\| \| \| \| \| \| \| \|	The main motivation for this change is to avoid copies due to the use of boost::function/std::function where captures of std::unique_ptr (in subsequent commits) would fail to compile. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
*	journal: fix race between player shut down and cache rebalance	Mykola Golub	2019-08-21	1	-0/+6
\| \| \| \| \| \| \| \| \|	25a23364 was supposed to fix this race, but it was not enough: there was still a window between `prefetch` is queued for execution in handle_cache_rebalanced and is actually executed, during which shut_down can be called and completed. Signed-off-by: Mykola Golub <mgolub@suse.com>
*	journal: s/Mutex/ceph::mutex/	Kefu Chai	2019-08-03	1	-27/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	* FutureImpl::m_lock is an exception. as, before this change, the lock was initialized like `m_lock("FutureImpl::m_lock", false, false)`, see the declaration of `Mutex(const std::string &n, bool r = false, bool ld=true, bool bt=false)` so `m_lock` is actually not using the extra features offered by `Mutex` like runtime lockdeps check. and `mutex_debugging_base` does not allow us to disable lockdeps individually. but it does use the `is_locked()` method. so instead of using `ceph::mutex` directly, a cutomized `ceph::mutex` is added for `CEPH_DEBUG_MUTEX` build. Signed-off-by: Kefu Chai <kchai@redhat.com>
*	journal: fix race between player shut down and cache rebalance	Mykola Golub	2019-06-25	1	-1/+1
\| \| \| \|	Signed-off-by: Mykola Golub <mgolub@suse.com>
*	journal: auto-tune journal fetch params based on memory target	Mykola Golub	2019-06-11	1	-5/+81
\| \| \| \| \| \|	(if a cache manager is specified) Signed-off-by: Mykola Golub <mgolub@suse.com>
*	journal: Use ceph_assert for asserts.	Adam C. Emerson	2018-08-27	1	-37/+37
\| \| \| \|	Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
*	misc: fix various spelling errors	Shengjing Zhu	2018-03-10	1	-1/+1
\| \| \| \|	Signed-off-by: Shengjing Zhu <i@zhsj.me>
*	journal: assert(false)->ceph_abort()	Li Wang	2017-10-02	1	-2/+2
\| \| \| \|	Signed-off-by: Li Wang <laurence.liwang@gmail.com>
*	common: add override in header file	liuchang0812	2017-03-03	1	-2/+2
\| \| \| \|	Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
*	common: add override for common submodule and misc	liuchang0812	2017-02-16	1	-4/+4
\| \| \| \| \| \|	Fixes: http://tracker.ceph.com/issues/18922 Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
*	journal: optimize speed of live replay journal pruning	Jason Dillaman	2016-07-21	1	-6/+22
\| \| \| \| \| \| \|	When streaming playback, avoid the unnecessary watch delay when one or more entries have been pruned. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: improve debug log messages	Jason Dillaman	2016-07-21	1	-1/+1
\| \| \| \| \| \| \| \|	rbd-mirror debugging involved potentially thousands of journals concurrently running. The instance address will correlate log messages between journals. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: support streaming entry playback	Jason Dillaman	2016-07-21	1	-70/+100
\| \| \| \| \| \| \| \|	Now that it's possible for the ObjectPlayer to only read a partial subset of available entries, the JournalPlayer needs to detect that more entries might be available. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: replay should only read from a single object set	Jason Dillaman	2016-07-21	1	-61/+26
\| \| \| \| \| \| \|	Previously it was prefetching up to 2 object sets worth of journal data objects which consumed too much memory. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: optionally fetch entries in small chunks during replay	Jason Dillaman	2016-07-21	1	-1/+2
\| \| \| \| \| \| \|	Support fetching the full object or incremental chunks (with a minimum of at least a single decoded entry if available). Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: player shutdown is now handled asynchronously	Jason Dillaman	2016-05-25	1	-13/+40
\| \| \| \| \|	Fixes: http://tracker.ceph.com/issues/15949 Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: eliminate watch delay for object refetches	Jason Dillaman	2016-05-24	1	-1/+5
\| \| \| \| \| \| \| \| \|	The randomized write sizes of the modified rbd-mirror stress test results in a lot of journal object with few entries. Immediately fetch objects when performing a refetch check prior to closing an empty object. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: keep active tag to assist with pruning watched objects	Jason Dillaman	2016-05-24	1	-99/+113
\| \| \| \| \| \| \| \|	It's possible that there might be additional entries to prune in objects that haven't been prefetched yet. Keep the active tag to allow these entries to be pruned after they have been loaded. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: cleanup watch refetch flag handling	Jason Dillaman	2016-05-24	1	-11/+25
\| \| \| \| \| \| \| \| \| \| \|	Clear the refetch required flag while scheduling the watch and remove the stale object after the watch completes if still empty. Previously, it was possible for the flag to become out-of-sync with whether or not it was actually refreshed and pruned. Fixes: http://tracker.ceph.com/issues/15993 Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	Merge pull request #9211 from dillaman/wip-15938	Mykola Golub	2016-05-20	1	-1/+1
\|\ \| \| \| \| \| \| \| \|	librbd: write-after-write might result in an inconsistent replicated image Reviewed-by: Mykola Golub <mgolub@mirantis.com>
\| *	journal: replay position might change after pruning stale tags	Jason Dillaman	2016-05-20	1	-1/+1
\| \| \| \| \| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
* \|	journal: reset watch step after pruning expired tag	Jason Dillaman	2016-05-19	1	-0/+1
\|/ \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: skip partially complete tag entries during playback	Jason Dillaman	2016-05-18	1	-58/+194
\| \| \| \| \| \| \| \| \|	If a journal client does not fully write out its buffered entries before quiting, replay should skip over all remaining out-of- sequence entries for the tag. Fixes: http://tracker.ceph.com/issues/15864 Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: re-fetch active object before advancing set during replay	Jason Dillaman	2016-05-18	1	-9/+8
\| \| \| \| \| \| \| \| \| \|	During a live replay, it's possible that an append and and overflow into the next object could race with the live playback of the same object. Re-fetch an "empty" object at least once before advancing to next set to ensure all records have been read. Fixes: http://tracker.ceph.com/issues/15665 Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	librbd: potential concurrent event processing during journal ↵	Josh Durgin	2016-05-10	1	-2/+3
\|\ \| \| \| \| \| \| \| \|	replayReviewed-by: Josh Durgin <jdurgin@redhat.com> librbd: potential concurrent event processing during journal replay
\| *	journal: suppress notifications if client still in try_pop_front loop	Jason Dillaman	2016-05-07	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	One such example is popping the last entry from an object. The next object will be automatically prefetched. When that object is received, we do not want to alert the user that entries are available since try_pop_front already indicated more records were available. Fixes: http://tracker.ceph.com/issues/15755 Signed-off-by: Jason Dillaman <dillaman@redhat.com>
* \|	journal: incorrectly computed object offset within set	Jason Dillaman	2016-05-09	1	-1/+1
\|/ \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: potential for double-free of context on shut down	Jason Dillaman	2016-04-13	1	-0/+5
\| \| \| \| \| \| \|	The context associated with a scheduled watch might be freed by two ObjectPlayers. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: possible race condition during live replay	Jason Dillaman	2016-04-13	1	-4/+6
\| \| \| \| \| \| \| \| \|	When two objects are being actively watched, it was possible for the watch context to complete before the second watch was associated. Fixes: http://tracker.ceph.com/issues/15352 Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: refetch active object before defaulting to new tag	Jason Dillaman	2016-03-16	1	-0/+9
\| \| \| \| \| \| \| \|	If a live replay is in progress, it's possible that object offset 0 was pulled and a new tag is discovered before the current object is (re-)pulled to determine that the old tag still has entries remaining. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: reschedule watch if no entries available during live replay	Jason Dillaman	2016-03-15	1	-1/+3
\| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	Merge pull request #7906 from dillaman/wip-14869	Josh Durgin	2016-03-09	1	-3/+2
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	journal: re-use common threads between journalers Conflicts: src/journal/JournalPlayer.cc src/librbd/Journal.cc src/test/rbd_mirror/image_replay.cc src/tools/rbd_mirror/ImageReplayer.h src/tools/rbd_mirror/Mirror.cc (merged interface changes to ImageReplayer, and reduced scope for change to JournalPlayer due to pr #7884 (wip-14663)). Reviewed-by: Josh Durgin <jdurgin@redhat.com>
\| *	journal: use provided work queue and timer	Jason Dillaman	2016-03-08	1	-7/+6
\| \| \| \| \| \| \| \| \| \| \| \|	This avoids the need to open two threads per journaler. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
* \|	journal: possible race condition during fetch playback	Jason Dillaman	2016-03-08	1	-7/+13
\| \| \| \| \| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
* \|	journal: clean up playback notification handling	Jason Dillaman	2016-03-08	1	-20/+31
\| \| \| \| \| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
* \|	journal: properly handle tag transition	Jason Dillaman	2016-03-08	1	-57/+143
\|/ \| \| \| \| \| \| \| \|	Now that the tag concept has been re-used for delineating epochs for librbd, we need playback to properly handle the cases where the active playback tag abruptly ends and a newer tag is inserted in the first splay offset object. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: update JournalPlayer to support new commit tracking	Jason Dillaman	2016-02-26	1	-31/+58
\| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: track commit position for each object splay offset	Jason Dillaman	2016-02-26	1	-7/+11
\| \| \| \| \| \| \| \|	It's possible, when delaying appends to the journal, that the current commit position might be in object set X while future events for a different offset might be in an object set < X. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: differentiate corruption vs missing entry errors	Jason Dillaman	2016-02-26	1	-1/+1
\| \| \| \| \| \| \| \| \|	librbd should treat the corruption of the journal differently from missing journal entries. If entries are missing, it might be the result of a crash and the journal should just be replayed through the most recent, consistent entry. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: switched entry tags to use id instead of string	Jason Dillaman	2016-02-05	1	-15/+19
\| \| \| \| \| \| \|	Later commits will add the ability to allocate tags and associate them with registered clients. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	make ctors with one argument explicit	Danny Al-Gaaf	2016-01-29	1	-2/+2
\| \| \| \| \| \| \|	Use explicit keyword for constructors with one argument to prevent implicit usage as conversion functions. Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
*	journal: fire replay complete event after reading last object	Jason Dillaman	2015-12-02	1	-8/+18
\| \| \| \| \|	Fixes: #13924 Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: support replay passed skipped splay objects	Jason Dillaman	2015-11-23	1	-99/+122
\| \| \| \| \| \| \|	It's possible for a splay object within a set to be skipped if the set is closed due to a full object within the set. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: update allocated tid when skipping committed entry in player	Mykola Golub	2015-11-12	1	-0/+1
\| \| \| \| \| \| \| \|	Otherwise, if on image open, there are no any uncommitted entries in journal, allocated tid is not updated to the latest commited and recording always starts from tid=0. Signed-off-by: Mykola Golub <mgolub@mirantis.com>
*	journal: fix race condition with unwatch on shutdown	Jason Dillaman	2015-11-06	1	-0/+1
\| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: simplified commit position tracking	Jason Dillaman	2015-11-06	1	-6/+8
\| \| \| \| \| \| \| \| \|	Now the journal player and recorder will allocate a tid to represent the associated journal entry. The order of these allocations are tracked so that the commit position can be moved only when all prior commits are safely on disk. Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: fix issues discovered via valgrind	Jason Dillaman	2015-11-06	1	-10/+31
\| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: JournalPlayer::process_state should support positive result	Jason Dillaman	2015-11-06	1	-1/+1
\| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>
*	journal: signal playback complete via finisher thread	Jason Dillaman	2015-11-06	1	-9/+23
\| \| \| \|	Signed-off-by: Jason Dillaman <dillaman@redhat.com>