Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workflow checkpoints #73

Open
LouisCarpentier42 opened this issue Jan 14, 2025 · 1 comment
Open

Workflow checkpoints #73

LouisCarpentier42 opened this issue Jan 14, 2025 · 1 comment
Labels
Workflow Improvements regarding the workflow

Comments

@LouisCarpentier42
Copy link
Collaborator

Currently, the workflow executes all the pipelines, scores the performance, and returns the results. However, if there are many algorithms and many datasets, then this might take a long time.

It would be benificial to do some checkpointing in the workflow, for example every 100 jobs are the results-so-far saved, and then the workflow is continued. How often to be saved can be passed as an argument to the constructor of the Workflow. In this regard, it might also be good to save the results automatically in the workflow (maybe decided on a hyperparameter), instead of only returning the results, to ensure a unified format.

In addition, some methods to decide which jobs to still execute can help to restart the process if some problem occurred. In addition, some methods can be created to obtain all the jobs with an error in the workflow.

@LouisCarpentier42 LouisCarpentier42 added the Workflow Improvements regarding the workflow label Jan 14, 2025
@LouisCarpentier42
Copy link
Collaborator Author

A further improvement of this would be to keep track of some internal state of the workflow (i.e., which jobs have already been executed, which jobs should still be executed?) Then, if something happens while running the workflow (e.g., an interrupt signal is send), the workflow can save the current state. This would allow to continue running the workflow from the current state, without having to redo all passed experiments.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Workflow Improvements regarding the workflow
Projects
None yet
Development

No branches or pull requests

1 participant