Make `MlflowMetricsDataSet` capable of saving a `Dict[str, float]` object, as `mlflow.log_metrics()` does #440

yury-fedotov · 2023-08-02T15:35:04Z

Description

At the moment MlflowMetricsDataSet can only save an object which is a nested Dict. For example:

{
    "metric1": {"value": 1.1, "step": 1},
    "metric2": [{"value": 1.1, "step": 1}, {"value": 1.2, "step": 2}],
}

However, it cannot save an object like this:

{
    "metric1": 1.1,
    "metric2": 1.2,
}

Which is frequently used and e.g. mlflow's native log_metrics() API deals fine with it.

Context

Having this support would be great to:

Make MlflowMetricsDataSet more compatible with mlflow.log_metrics()
Allow users to save simple non-nested Dict, without the need to convert it to nested format.

Possible Implementation

Either add a new Dataset or handle this within existing MlflowMetricsDataSet.

The text was updated successfully, but these errors were encountered:

yury-fedotov · 2023-08-02T15:49:24Z

P.S. I read about flatten and recursive settings in mlflow.yml, but at least to my understanding those serve other purposes than the one I described above.

Galileo-Galilei · 2023-08-02T21:24:48Z

Hi, this is indeed really something needed and it's been on my to do liste for a long time. As for MlflowMetricDataSet we should likely modify MlflowMetricsDataSet to have suggested behaviour and move the existing behaviour to a new MlflowMetricsHistoryDataSet.

The reason why it is done like this currently is to maintain consistency between save and load method but I acknowledge that the user experience is a pain.

Feel free to open a PR to help this getting real ;)

lvijnck · 2024-08-06T14:48:51Z

Would be interested in helping with this one if needed. Any work on this one already done @Galileo-Galilei?

For the movement, I've implemented my own one that I'd like to see replaced soon.

"""Custom Mlflow datasets."""
import mlflow

from copy import deepcopy
from typing import Any, Dict, Union

from mlflow.tracking import MlflowClient

from kedro_mlflow.io.metrics.mlflow_abstract_metric_dataset import (
    MlflowAbstractMetricDataset,
)


class MlflowMetricsDataset(MlflowAbstractMetricDataset):
    """Custom Mlflow dataset to save multiple metrics.

    Current mlflow implementation does not support storing multiple metrics through
    a single dataset. To be decomissioned on addition of the following issue:

    https://github.com/Galileo-Galilei/kedro-mlflow/issues/440
    """

    SUPPORTED_SAVE_MODES = {"overwrite", "append"}
    DEFAULT_SAVE_MODE = "overwrite"

    def __init__(
        self,
        run_id: str = None,
        load_args: Dict[str, Any] = None,
        save_args: Dict[str, Any] = None,
        key_prefix: str = None,
    ):
        """Initialise MlflowMetricDataset.

        Args:
            run_id (str): The ID of the mlflow run where the metric should be logged
            load_args: dataset load arguments
            save_args: dataset save arguments
            key_prefix: key prefix used for all metrics
        """
        self.run_id = run_id
        self._load_args = load_args or {}
        self._save_args = save_args or {}
        self._logging_activated = True
        self._key_prefix = key_prefix

        # We add an extra argument mode="overwrite" / "append" to enable logging update an existing metric
        # this is not an offical mlflow argument for log_metric, so we separate it from the others
        # "overwrite" corresponds to the default mlflow behaviour
        self.mode = self._save_args.pop("mode", self.DEFAULT_SAVE_MODE)

    @property
    def run_id(self) -> Union[str, None]:
        """Get run id."""
        run = mlflow.active_run()
        if (self._run_id is None) and (run is not None):
            # if no run_id is specified, we try to retrieve the current run
            # this is useful because during a kedro run, we want to be able to retrieve
            # the metric from the active run to be able to reload a metric
            # without specifying the (unknown) run id
            return run.info.run_id

        # else we return the _run_id which can eventually be None.
        # In this case, saving will work (a new run will be created)
        # but loading will fail,
        # according to mlflow's behaviour
        return self._run_id

    @run_id.setter
    def run_id(self, run_id: str):
        self._run_id = run_id

    def _load(self):
        raise NotImplementedError()

    def _exists(self) -> bool:
        """Check if the metric exists in remote mlflow storage exists.

        Returns:
            bool: Does the metric name exist in the given run_id?
        """
        return False

    def _describe(self) -> Dict[str, Any]:
        """Describe MLflow metrics dataset.

        Returns:
            Dict[str, Any]: Dictionary with MLflow metrics dataset description.
        """
        return {
            "run_id": self.run_id,
        }

    def _save(self, data: Dict[str, Any]):
        if self._logging_activated:
            self._validate_run_id()
            run_id = self.run_id  # we access it once instead of calling self.run_id everywhere to avoid looking or an active run each time

            mlflow_client = MlflowClient()

            save_args = deepcopy(self._save_args)
            if not self.mode == "overwrite":
                raise NotImplementedError()

            for key, value in data.items():
                mlflow_client.log_metric(
                    run_id=run_id,
                    key=f"{self._key_prefix}_{key}",
                    value=value,
                    step=None,
                    **save_args,
                )

Galileo-Galilei · 2024-08-06T15:09:39Z

I don't think it was ever tackled properly, glad to get a PR if you can @lvijnck !

Galileo-Galilei mentioned this issue Aug 22, 2023

Logging a finetuned Transformer model with the mlflow.transformers flavor results in '...ForSequenceClassification' object has no attribute 'model' #424

Closed

Galileo-Galilei added a commit that referenced this issue Oct 23, 2023

💥 ♻️ Rename MlflowMetricsDataset to MlflowMetricsHistoryDataset (#440)

948e07f

Galileo-Galilei added a commit that referenced this issue Oct 25, 2023

💥 ♻️ Rename MlflowMetricsDataset to MlflowMetricsHistoryDataset (#440)

b0c953b

Galileo-Galilei added the enhancement New feature or request label Oct 25, 2023

Galileo-Galilei added this to kedro-mlflow roadmap Oct 28, 2023

Galileo-Galilei moved this to 🆕 New in kedro-mlflow roadmap Oct 28, 2023

Galileo-Galilei moved this from 🆕 New to 🔖 Ready in kedro-mlflow roadmap Oct 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make `MlflowMetricsDataSet` capable of saving a `Dict[str, float]` object, as `mlflow.log_metrics()` does #440

Make `MlflowMetricsDataSet` capable of saving a `Dict[str, float]` object, as `mlflow.log_metrics()` does #440

yury-fedotov commented Aug 2, 2023 •

edited

Loading

yury-fedotov commented Aug 2, 2023

Galileo-Galilei commented Aug 2, 2023

lvijnck commented Aug 6, 2024

Galileo-Galilei commented Aug 6, 2024

Make MlflowMetricsDataSet capable of saving a Dict[str, float] object, as mlflow.log_metrics() does #440

Make MlflowMetricsDataSet capable of saving a Dict[str, float] object, as mlflow.log_metrics() does #440

Comments

yury-fedotov commented Aug 2, 2023 • edited Loading

Description

Context

Possible Implementation

yury-fedotov commented Aug 2, 2023

Galileo-Galilei commented Aug 2, 2023

lvijnck commented Aug 6, 2024

Galileo-Galilei commented Aug 6, 2024

Make `MlflowMetricsDataSet` capable of saving a `Dict[str, float]` object, as `mlflow.log_metrics()` does #440

Make `MlflowMetricsDataSet` capable of saving a `Dict[str, float]` object, as `mlflow.log_metrics()` does #440

yury-fedotov commented Aug 2, 2023 •

edited

Loading