docs: clarify reschedule, migrate, and replacement terminology #24929

tgross · 2025-01-23T20:14:11Z

Our vocabulary around scheduler behaviors outside of the reschedule and migrate blocks leaves room for confusion around whether the reschedule tracker should be propagated between allocations. There are effectively five different behaviors we need to cover:

restart: when the tasks of an allocation fail and we try to restart the tasks in place.
reschedule: when the restart block runs out of attempts (or the allocation fails before tasks even start), and we need to move the allocation to another node to try again.
migrate: when the user has asked to drain a node and we need to move the allocations. These are not failures, so we don't want to propagate the reschedule tracker.
replacement: when a node is lost, we don't count that against the reschedule tracker for the allocations on the node (it's not the allocation's "fault", after all). We don't want to run the migrate machinery here here either, as we can't contact the down node. To the scheduler, this is effectively the same as if we bumped the group.count
replacement for disconnect.replace = true: this is a replacement, but the replacement is intended to be temporary, so we propagate the reschedule tracker.

Add a section to the reschedule, migrate, and disconnect blocks explaining when each item applies. Update the use of the word "reschedule" in several places where "replacement" is correct, and vice-versa.

Fixes: #24918

major preview links:

Our vocabulary around scheduler behaviors outside of the `reschedule` and `migrate` blocks leaves room for confusion around whether the reschedule tracker should be propagated between allocations. There are effectively five different behaviors we need to cover: * restart: when the tasks of an allocation fail and we try to restart the tasks in place. * reschedule: when the `restart` block runs out of attempts (or the allocation fails before tasks even start), and we need to move the allocation to another node to try again. * migrate: when the user has asked to drain a node and we need to move the allocations. These are not failures, so we don't want to propagate the reschedule tracker. * replacement: when a node is lost, we don't count that against the `reschedule` tracker for the allocations on the node (it's not the allocation's "fault", after all). We don't want to run the `migrate` machinery here here either, as we can't contact the down node. To the scheduler, this is effectively the same as if we bumped the `group.count` * replacement for `disconnect.replace = true`: this is a replacement, but the replacement is intended to be temporary, so we propagate the reschedule tracker. Add a section to the `reschedule`, `migrate`, and `disconnect` blocks explaining when each item applies. Update the use of the word "reschedule" in several places where "replacement" is correct, and vice-versa. Fixes: #24918

schmichael · 2025-01-24T01:26:18Z

command/job_restart.go

@@ -132,7 +132,7 @@ Usage: nomad job restart [options] <job>
  groups are restarted.

  When rescheduling, the current allocations are stopped triggering the Nomad
-  scheduler to create replacement allocations that may be placed in different
+  scheduler to create new allocations that may be placed in different


So this whole command text keeps using the term "reschedule" despite internally using the migrate infrastructure when -reschedule is specified:

nomad/nomad/alloc_endpoint.go

Lines 318 to 326 in c1dc9ed

transitionReq := &structs.AllocUpdateDesiredTransitionRequest{

Evals: []*structs.Evaluation{eval},

Allocs: map[string]*structs.DesiredTransition{

args.AllocID: {

Migrate: pointer.Of(true),

NoShutdownDelay: pointer.Of(args.NoShutdownDelay),

},

},

}

This is important for users to understand as it means it is safe to job restart -reschedule and not risk causing downtime if your migrate{} block is properly configured.

Sadly we had no rigor around our definitions of "reschedule" when this command was written: it uses it in the most straightforward definition of "is scheduled again" and not in reference to the reschedule{} block or associated scheduling logic. For example the RescheduleTracker is empty for jobs "rescheduled" with nomad job restart -reschedule.

So now what?

Good question. To enforce purity we'd have to break compatibility as the command flag itself is "-reschedule". Not only is that unacceptable, but what would we call it? -migrate would be "correct" but endlessly confusing to users.

So I think we're going to have to live with 2 definitions of "reschedule" in the Nomad code base:

reschedule - to schedule again

reschedule{} - the parameters and associated scheduling logic that occurs when restart{}s are exhausted.

So ... are you actually suggesting any changes?

...no. I think the original "replacement" was fine. The new "new" is obviously also accurate. None of this command text adheres to our nomenclature though, but I'm not sure it's worth updating.

It's not ideal, but maybe we could at least explain that the command flag name is inaccurate? And/or we could make -reschedule an alias for -migrate/-replace(?) and mark -reschedule for deprecation?

vercel bot deployed to Preview – nomad-ui January 23, 2025 20:15 View deployment

tgross mentioned this pull request Jan 23, 2025

lost allocation drops reschedule tracker #24918

Open

vercel bot deployed to Preview – nomad January 23, 2025 20:20 View deployment

tgross force-pushed the docs-replacement-vs-reschedule branch from 39f153d to e3afb1f Compare January 23, 2025 20:24

vercel bot deployed to Preview – nomad-ui January 23, 2025 20:25 View deployment

vercel bot deployed to Preview – nomad January 23, 2025 20:30 View deployment

tgross marked this pull request as ready for review January 23, 2025 20:46

tgross requested review from a team as code owners January 23, 2025 20:46

tgross requested review from LeahMarieBush, pkazmierczak, schmichael, mismithhisler and aimeeu January 23, 2025 20:46

schmichael reviewed Jan 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: clarify reschedule, migrate, and replacement terminology #24929

docs: clarify reschedule, migrate, and replacement terminology #24929

tgross commented Jan 23, 2025 •

edited

Loading

schmichael Jan 24, 2025

tgross Jan 24, 2025 •

edited

Loading

	transitionReq := &structs.AllocUpdateDesiredTransitionRequest{
	Evals: []*structs.Evaluation{eval},
	Allocs: map[string]*structs.DesiredTransition{
	args.AllocID: {
	Migrate: pointer.Of(true),
	NoShutdownDelay: pointer.Of(args.NoShutdownDelay),
	},
	},
	}

docs: clarify reschedule, migrate, and replacement terminology #24929

Are you sure you want to change the base?

docs: clarify reschedule, migrate, and replacement terminology #24929

Conversation

tgross commented Jan 23, 2025 • edited Loading

schmichael Jan 24, 2025

Choose a reason for hiding this comment

tgross Jan 24, 2025 • edited Loading

Choose a reason for hiding this comment

tgross commented Jan 23, 2025 •

edited

Loading

tgross Jan 24, 2025 •

edited

Loading