Issue #26: Scope stats on running, completed and panicked threads #27

oliver-giersch · 2018-07-25T09:34:40Z

I implemented the three method signatures suggested in issue #26 using three atomic counters which are updated each time a thread is spawned or exits using a guard/sentinel struct.
I also implemented two very basic test to be somewhat able to ensure their functionality.

…completed and panicked threads + some basic tests

ghost

Thank you for the PR!

Instead of introducing Sentinel<'s>, you can tweak the invocation of builder_spawn_unchecked instead:

builder_spawn_unchecked(self.builder, move || {
    self.scope.stats.running.fetch_add(1, Ordering::SeqCst);
    let mut result = Box::from_raw(result as *mut ManuallyDrop<T>);

    let res = panic::catch_unwind(panic::AssertUnwindSafe(f));
    self.scope.stats.running.fetch_sub(1, Ordering::SeqCst);

    match res {
        Ok(r) => {
            self.scope.stats.completed.fetch_add(1, Ordering::SeqCst);
            *result = r;
            mem::forget(result);
        }
        Err(e) => {
            self.scope.stats.panicked.fetch_add(1, Ordering::SeqCst);
            panic::resume_unwind(e),
        }
    }
})

But it works either way. :)

There seems to be a small window of time between decrementing running and incrementing panicked/completed, which means polling the counts might give inconsistent results.

An easy way of fixing this would be to put the stats inside a Mutex rather than have them as three independent atomic variables. Another solution might be to try squeezing the three counters into a single AtomicUsize.

But maybe it'd be best to do the following. Have three atomic counters, panicked, completed, and spawned. They are only incremented, never decremented. Then we can compute the number of running threads like this:

fn running_threads(&self) -> usize {
    loop {
        let p1 = self.stats.panicked.load(SeqCst);
        let c1 = self.stats.completed.load(SeqCst);

        let s = self.stats.spawned.load(SeqCst);

        let p2 = self.stats.panicked.load(SeqCst);
        let c2 = self.stats.completed.load(SeqCst);

        if p1 == p2 && c1 == c2 {
            return s - p1 - c1;
        }
    }
}

This method would give fully consistent results.

jeehoonkang · 2018-07-25T12:52:01Z

I just wonder -- are stats necessary to be very precise? I think the use case in #26 only requires that the stats are eventually consistent, i.e. the stats will be saturated.

ghost · 2018-07-25T12:58:18Z

Maybe not - perhaps I'm just being overly pedantic. :)
But the solution proposed above would be fairly simple and consistent.

jeehoonkang · 2018-07-25T13:15:53Z

@oliver-giersch First of all, thanks for the PR! It's very fast :)

I just want to say I hope stats are implemented as an add-on, rather than by default. Because maintaining stats costs a little bit, while very small, so I'd like users can opt-out.

ghost · 2018-07-25T13:48:04Z

@jeehoonkang I don't think stats have any measurable cost. :)

Spawning a thread is a very costly operation by itself (we also allocate some memory for the destructor chain and so on...) so the cost of three atomic operations per spawned thread is basically zero.

oliver-giersch · 2018-07-25T14:05:18Z

@jeehoonkang coincidentally I had been doing some reading of the scoped thread code for the last two days to get a better understanding, so when I saw the issue I just went ahead and tried my hands on it ;)

@stjepang
Thanks for the reply, I am not too experienced in this subject the subtleties of parallelism still often elude me.

I could go ahead and drop the sentinel struct in favor of a catch_unwind() approach, not sure what the benefits are.

I also had the following solution for true consistent atomic stats in mind (not tested yet):

#[derive(Default)]
struct AtomicScopeStats(AtomicUsize);

#[cfg(target_pointer_width = "64")]
impl AtomicScopeStats {
    const COUNTER_SIZE: usize = 21;

    const RUNNING_MASK   : usize = 0x00000000001FFFFF;
    const COMPLETED_MASK : usize = 0x000003FFFFE00000;
    const PANICKED_MASK  : usize = 0x7FFFFC0000000000;

    const INC_RUNNING    : usize = 0x0000000000000001;
    const INC_COMPLETED  : usize = 0x0000000000200000;
    const INC_PANICKED   : usize = 0x0000040000000000;
}

impl AtomicScopeStats {
    #[inline]
    pub fn running_count(&self) -> usize {
        self.0.load(Ordering::SeqCst) & Self::RUNNING_MASK
    }

    #[inline]
    pub fn completed_count(&self) -> usize {
        self.0.load(Ordering::SeqCst) & Self::COMPLETED_MASK
    }

    #[inline]
    pub fn panicked_count(&self) -> usize {
        self.0.load(Ordering::SeqCst) & Self::PANICKED_MASK
    }

    #[inline]
    pub fn inc_running_count(&self) {
        self.0.fetch_add(Self::INC_RUNNING, Ordering::SeqCst);
    }

    #[inline]
    pub fn update_completed_count(&self) {
        self.0.fetch_add(Self::INC_COMPLETED - Self::INC_RUNNING, Ordering::SeqCst);
    }

    #[inline]
    pub fn update_panicked_count(&self) {
        self.0.fetch_add(Self::INC_PANICKED - Self::INC_RUNNING, Ordering::SeqCst);
    }
}

This has the obvious issue of not being portable, and the limitation of pointer-width / 3 might be to restrictive on non-64 bit architectures.
In that case I would assume a RwLock would be preferable over the solution with the loop, no?

Also, I do have some questions regarding a bunch of details the code for the scoped thread, would it be OK if I sent you an email or something?

ghost · 2018-07-25T15:06:19Z

I could go ahead and drop the sentinel struct in favor of a catch_unwind() approach, not sure what the benefits are.

The benefit is somewhat simpler code, but in the end it's the same thing.

I also had the following solution for true consistent atomic stats in mind (not tested yet):

This will work, except the result needs to be shifted right in fn completed_count and fn panicked_count. But as you say, this isn't portable and there might not be enough bits to squeeze three numbers into one atomic usize. Also it's kind of messy.

Mutexes are fine, but polling counters (using the three public methods) might be a bit slower than necessary due to locking. If we want consistency, I think it's best to just implement fn running_threads like in this comment.

Also, I do have some questions regarding a bunch of details the code for the scoped thread, would it be OK if I sent you an email or something?

You can open an issue here, hop into #crossbeam at irc.mozilla.org, or send me an email. Whatever works for you. :)

oliver-giersch · 2018-07-25T16:17:57Z

OK will change these things then accordingly, probably tomorrow (and yeah, I forgot the right shift).

I'll also open a ticket for my questions/suggestions so others may chime in.

…ount + removing the Sentinel struct)

ghost · 2018-07-26T21:39:35Z

Why close this PR? I'm OK with merging the current changes. :)

oliver-giersch · 2018-07-26T21:46:14Z

Huh, I must have closed it by accident

ghost · 2018-07-26T22:04:50Z

I see. Can you restore the deleted branch and reopen the PR?

oliver-giersch · 2018-07-26T22:12:46Z

I'll See what I can do tomorrow.

jeehoonkang · 2018-07-27T12:48:24Z

I'm concerned with the generality of the stats. Different people have different requirements, and I think it's a little bit hard to justify a particular choice of stats. For example. I think it's enough to provide a querying method to check if there are any panicked threads, and based on it, you can implement the feature #26 wants (and a part of what this PR wants, too). In that case, I'd like to make the scoped thread library kinda minimal and provides other functionalities (such as stats) as add-ons.

ghost · 2018-07-27T13:34:31Z

Also, I'd just like to remind that currently it isn't even possible to access &Scope from inside a scoped thread (the borrow checker rejects such code). Maybe we should solve that problem first. :)

# Conflicts: # src/thread.rs

oliver-giersch · 2018-07-30T09:25:15Z

I think I'll try to rewrite the code to allow for opt-in stats keeping. I have something in mind like iterator adapters, so you turn the regular scope into a stat keeping scope which exposes the same functionality as a scope but with stats.

Also, I do have some concerns about the current running_threads() implementation with the loop, because I am not sure if there might be any (unlikely/rare) circumstances where the loop doesn't terminate. Also it seems a little icky. Maybe this stat could also be discarded, as I have no real idea what a use case for this stat would be.

@stjepang
Also, I'd just like to remind that currently it isn't even possible to access &Scope from inside a scoped thread (the borrow checker rejects such code). Maybe we should solve that problem first. :)

This probably another issue altogether and would require Scope to implement Send + Sync, but should be worth considering, though I am not entirely sure why you would want to use a scope in a spawned scoped thread instead of creating a new (inner) scope.

ghost · 2018-07-30T12:23:05Z

Just for comparison, the threadpool crate has similar methods for statistics: queued_count, active_count, panic_count.

The only two uses of panic_count I could find are:

ghost · 2018-07-30T12:24:50Z

though I am not entirely sure why you would want to use a scope in a spawned scoped thread instead of creating a new (inner) scope.

If you create a new inner scope, then all threads spawned inside it will be joined before the inner scope ends. Some users would like to spawn such inner threads without this restriction.

…sing derived trait

# Conflicts: # src/thread.rs

oliver-giersch · 2018-09-25T19:37:34Z

Summary of the latest commits:

stat counting is now entirely opt-in and separate in its own submodule
the code is entirely separate from the rest of the scoped thread module as well
the scope can keep track of spawned, running, completed and panicked threads

I have decided against the loop based solution for the running count for now, because I didn't see what purpose it is meant to achieve. If the goal is to return the running count at the exact moment the function is called, a mutex for the entire ScopeStats struct seems to be the only really valid solution, the loop does not really achieve a more accurate count than a simple substraction, IMHO.
I'm willing to discuss this more, however.

implemented 3 scope functions to keep track of the count of running, …

3cbddd8

…completed and panicked threads + some basic tests

ghost reviewed Jul 25, 2018

View reviewed changes

oliver-giersch mentioned this pull request Jul 25, 2018

Questions/Improvement suggestions for scoped thread implementation #29

Closed

implemented changes suggested in review (more exact running threads c…

cac8e37

…ount + removing the Sentinel struct)

oliver-giersch closed this Jul 26, 2018

oliver-giersch deleted the feature_scope_stats_#26 branch July 26, 2018 21:22

oliver-giersch mentioned this pull request Jul 26, 2018

Implements a more idiomatic thread result without using mem::uninitialized() and the associated undefined behaviour #35

Merged

oliver-giersch restored the feature_scope_stats_#26 branch July 27, 2018 12:35

oliver-giersch reopened this Jul 27, 2018

oliver-giersch added 2 commits July 28, 2018 21:14

Merge branch 'master' into feature_scope_stats_#26

054e386

# Conflicts: # src/thread.rs

removes duplicate DtorChain (caused by merge)

a9b3b64

jeehoonkang mentioned this pull request Jul 29, 2018

Use vector for dtors in thread and remove defer() #32

Merged

oliver-giersch added 2 commits August 14, 2018 14:32

implements opt-in stat counting

68bdcac

replaces PanicGuard with catch_unwind/resume_unwind from std::panic

21b13d0

oliver-giersch added 7 commits September 20, 2018 23:01

merge with master

ff2c39b

Merge remote-tracking branch 'upstream/master' into stat_counting

82cec85

stat counting in own module (wip)

68e080e

fixes doc test, adds module level tests, adds doc comments, fixes mis…

8e32efd

…sing derived trait

some slight changes/refactoring (mostly docs)

97c922a

Merge branch 'stat_counting' into feature_scope_stats_#26

20d75b2

# Conflicts: # src/thread.rs

fixes test, removes unnecessary imports

345abbe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue #26: Scope stats on running, completed and panicked threads #27

Issue #26: Scope stats on running, completed and panicked threads #27

oliver-giersch commented Jul 25, 2018

ghost left a comment •

edited by ghost

Loading

jeehoonkang commented Jul 25, 2018

ghost commented Jul 25, 2018

jeehoonkang commented Jul 25, 2018

ghost commented Jul 25, 2018

oliver-giersch commented Jul 25, 2018

ghost commented Jul 25, 2018

oliver-giersch commented Jul 25, 2018 •

edited

Loading

ghost commented Jul 26, 2018

oliver-giersch commented Jul 26, 2018

ghost commented Jul 26, 2018

oliver-giersch commented Jul 26, 2018

jeehoonkang commented Jul 27, 2018 •

edited

Loading

ghost commented Jul 27, 2018

oliver-giersch commented Jul 30, 2018

ghost commented Jul 30, 2018

ghost commented Jul 30, 2018

oliver-giersch commented Sep 25, 2018

Issue #26: Scope stats on running, completed and panicked threads #27

Are you sure you want to change the base?

Issue #26: Scope stats on running, completed and panicked threads #27

Conversation

oliver-giersch commented Jul 25, 2018

ghost left a comment • edited by ghost Loading

Choose a reason for hiding this comment

jeehoonkang commented Jul 25, 2018

ghost commented Jul 25, 2018

jeehoonkang commented Jul 25, 2018

ghost commented Jul 25, 2018

oliver-giersch commented Jul 25, 2018

ghost commented Jul 25, 2018

oliver-giersch commented Jul 25, 2018 • edited Loading

ghost commented Jul 26, 2018

oliver-giersch commented Jul 26, 2018

ghost commented Jul 26, 2018

oliver-giersch commented Jul 26, 2018

jeehoonkang commented Jul 27, 2018 • edited Loading

ghost commented Jul 27, 2018

oliver-giersch commented Jul 30, 2018

ghost commented Jul 30, 2018

ghost commented Jul 30, 2018

oliver-giersch commented Sep 25, 2018

ghost left a comment •

edited by ghost

Loading

oliver-giersch commented Jul 25, 2018 •

edited

Loading

jeehoonkang commented Jul 27, 2018 •

edited

Loading