Support configuration of num_workers and max_samples_per_sequence in llava_next_pretrain #12195

bernardhan33 · 2025-02-14T20:43:09Z

Is your feature request related to a problem? Please describe.

The llava_next_pretrain has been the go-to pretraining script for my LLaVA model pretraining because of #11931 .

It would be great to support modifying the num_workers instead of the hardcoded 32 for Energon data loader. ref.

Additionally, we noticed that it would be important to pass in the max_samples_per_sequence parameter (ref). Otherwise, there are no limits to allocating shards to the workers and there seems to be an issue with Energon (that I will file a bug to Energon repo and share here) that there are duplicate shard ranges allocated to the same worker.

Describe the solution you'd like

Support configuration of num_workers and max_samples_per_sequence through flags.

Describe alternatives you've considered

Modify / Patch the source code directly.

Additional context

Creating this issue and I can follow up with a CL.

Related Energon issue: NVIDIA/Megatron-Energon#70

The text was updated successfully, but these errors were encountered:

bernardhan33 assigned okuchaiev Feb 14, 2025

This was referenced Feb 14, 2025

Allocation of duplicate shard_range to a single worker NVIDIA/Megatron-Energon#70

Closed

Support customization of a few parameters in scripts/vlm/llava_next_pretrain #12218

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support configuration of num_workers and max_samples_per_sequence in llava_next_pretrain #12195

Support configuration of num_workers and max_samples_per_sequence in llava_next_pretrain #12195

bernardhan33 commented Feb 14, 2025 •

edited

Loading

Support configuration of num_workers and max_samples_per_sequence in llava_next_pretrain #12195

Support configuration of num_workers and max_samples_per_sequence in llava_next_pretrain #12195

Comments

bernardhan33 commented Feb 14, 2025 • edited Loading

bernardhan33 commented Feb 14, 2025 •

edited

Loading