[sender] refactor for a simpler multi-thread behavior #209

remeh · 2021-09-27T13:32:14Z

DRAFT

Idea behind this PR is to improved the Sender objects lifecycles (especially the message_queue and the sender_thread) in order to have a simpler implementation not having to always check for their existence.

On top of that, this PR is synchronizing the #stop/close mechanism for it to be blocking. The worse scenario with multiple calls to close/add in parallel would now be that metrics submitted after a close call would not be flushed.

ivoanjo

I very much like this! I like how simpler it looks once is_closed becomes a terminal state: once the instance gets there, there's no going back.

I also very much like how the mutex usage is kept off the main path of the code.

ivoanjo · 2021-10-11T11:20:25Z

lib/datadog/statsd/sender.rb

+  class FlushQueue < Queue
+  end
+  class CloseQueue < Queue
+  end


Since these aren't used elsewhere, I suggest putting them inside the Sender class itself.

Also (very minor) if you want a single-line definition, you can use:

FlushQueue = Class.new(Queue) # OR class FlushQueue < Queue; end

ivoanjo · 2021-10-11T11:26:23Z

lib/datadog/statsd/sender.rb

+             blocking_queue = FlushQueue.new
+             channel << blocking_queue
+             blocking_queue.pop # wait for the bg thread to finish its work
+             blocking_queue.close if CLOSEABLE_QUEUES


To be honest, I'm growing increasingly unconvinced this whole business with the CLOSEABLE_QUEUES is worth it. It effectively creates two code paths for different Ruby versions, but it's not like we're going to drop support for the old Rubies soon (#close is a Ruby 2.3 feature, and we're still fighting to drop 2.0).

Would it be simpler to just remove this entirely? It doesn't even seem that it would particularly improve performance either.

ivoanjo · 2021-10-11T13:13:31Z

lib/datadog/statsd/sender.rb

      end

-      def rendez_vous


The forwarder will probably need to be updated to not use #rendez_vous anymore, right? (To be honest, I don't quite understand the use-case of having both #sync_with_outbound_io and #flush at the top-level).

ivoanjo · 2021-10-11T13:15:32Z

lib/datadog/statsd/sender.rb

-        # Initialize and get the thread's sync queue
-        queue = (Thread.current[:statsd_sync_queue] ||= Queue.new)


We lost this caching-the-queue behavior in the refactoring, which doesn't seem like an issue, but just doublechecking by asking if this is ok from a performance pov (I actually am not sure how expensive it is to create queues, probably not a lot)

ivoanjo · 2021-10-11T13:16:14Z

lib/datadog/statsd/sender.rb

+      # Compatibility with `Sender`
+      def start()
      end


With this change, neither Sender nor SingleThreadedSender use #start, so perhaps it would make sense to just remove them?

ivoanjo · 2021-10-11T13:20:09Z

lib/datadog/statsd/sender.rb

+            channel << blocking_queue
+            blocking_queue.pop # wait for the bg thread to finish its work
+            blocking_queue.close if CLOSEABLE_QUEUES
+            sender_thread.join(3) # wait for completion, timeout after 3 seconds
+            # TODO(remy): should I close `channel` here?


As I suggested above, I think it'd be simpler to just not use close; if we do, calling close here may be problematic if two stops get called concurrently, but the background thread is taking long to finish. E.g. something like T1: acquire mutex -> tell background thread to stop -> timeout join -> call close -> release mutex; T2: acquire mutex -> does not see previous @is_closed -> tries to write to channel -> channel has been closed.

Also, should the stop behavior be a bit more flexible? E.g. configurable timeout, or optionally run a block to decide what to do.

[sender] refactor for a simpler multi-thread behavior

6582237

remeh added the WIP label Sep 27, 2021

Base automatically changed from remeh/fork-detect-v3 to master September 28, 2021 15:17

ivoanjo reviewed Oct 11, 2021

View reviewed changes

remeh added this to the 5.4.0 milestone Oct 18, 2021

djmitche marked this pull request as draft November 1, 2021 17:05

This was referenced Dec 20, 2024

Periodic subsystem skipping ticks sidekiq/sidekiq#6560

Closed

Sidekiq Pro dogstatsd should allow a singleton instance sidekiq/sidekiq#6561

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[sender] refactor for a simpler multi-thread behavior #209

[sender] refactor for a simpler multi-thread behavior #209

remeh commented Sep 27, 2021

ivoanjo left a comment

ivoanjo Oct 11, 2021

ivoanjo Oct 11, 2021

ivoanjo Oct 11, 2021

ivoanjo Oct 11, 2021

ivoanjo Oct 11, 2021

ivoanjo Oct 11, 2021

ivoanjo Oct 11, 2021

		# Initialize and get the thread's sync queue
		queue = (Thread.current[:statsd_sync_queue] \|\|= Queue.new)

[sender] refactor for a simpler multi-thread behavior #209

Are you sure you want to change the base?

[sender] refactor for a simpler multi-thread behavior #209

Conversation

remeh commented Sep 27, 2021

ivoanjo left a comment

Choose a reason for hiding this comment

ivoanjo Oct 11, 2021

Choose a reason for hiding this comment

ivoanjo Oct 11, 2021

Choose a reason for hiding this comment

ivoanjo Oct 11, 2021

Choose a reason for hiding this comment

ivoanjo Oct 11, 2021

Choose a reason for hiding this comment

ivoanjo Oct 11, 2021

Choose a reason for hiding this comment

ivoanjo Oct 11, 2021

Choose a reason for hiding this comment

ivoanjo Oct 11, 2021

Choose a reason for hiding this comment