Skip to content

Commit 1d1ed78

Browse files
yoonhyejinhsheth2
andauthored
docs: update mlflow ingestion docs to include new concept mappings (#12791)
Co-authored-by: Harshal Sheth <[email protected]>
1 parent cf0dc3a commit 1d1ed78

File tree

1 file changed

+7
-5
lines changed

1 file changed

+7
-5
lines changed

metadata-ingestion/docs/sources/mlflow/mlflow_pre.md

+7-5
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,10 @@
22

33
This ingestion source maps the following MLflow Concepts to DataHub Concepts:
44

5-
| Source Concept | DataHub Concept | Notes |
6-
|:---------------------------------------------------------------------------------:|:-------------------------------------------------------------------------------------------:|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
7-
| [`Registered Model`](https://mlflow.org/docs/latest/model-registry.html#concepts) | [`MlModelGroup`](https://datahubproject.io/docs/generated/metamodel/entities/mlmodelgroup/) | The name of a Model Group is the same as a Registered Model's name (e.g. my_mlflow_model) |
8-
| [`Model Version`](https://mlflow.org/docs/latest/model-registry.html#concepts) | [`MlModel`](https://datahubproject.io/docs/generated/metamodel/entities/mlmodel/) | The name of a Model is `{registered_model_name}{model_name_separator}{model_version}` (e.g. my_mlflow_model_1 for Registered Model named my_mlflow_model and Version 1, my_mlflow_model_2, etc.) |
9-
| [`Model Stage`](https://mlflow.org/docs/latest/model-registry.html#concepts) | [`Tag`](https://datahubproject.io/docs/generated/metamodel/entities/tag/) | The mapping between Model Stages and generated Tags is the following:<br/>- Production: mlflow_production<br/>- Staging: mlflow_staging<br/>- Archived: mlflow_archived<br/>- None: mlflow_none |
5+
| Source Concept | DataHub Concept | Notes |
6+
|:-----------------------------------------------------------------------------------:|:---------------------------------------------------------------------------------------------------------:|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
7+
| [`Registered Model`](https://mlflow.org/docs/latest/model-registry/#registered-model) | [`MlModelGroup`](https://datahubproject.io/docs/generated/metamodel/entities/mlmodelgroup/) | The name of a Model Group is the same as a Registered Model's name (e.g. my_mlflow_model). Registered Models serve as containers for multiple versions of the same model in MLflow. |
8+
| [`Model Version`](https://mlflow.org/docs/latest/model-registry/#model-version) | [`MlModel`](https://datahubproject.io/docs/generated/metamodel/entities/mlmodel/) | The name of a Model is `{registered_model_name}{model_name_separator}{model_version}` (e.g. my_mlflow_model_1 for Registered Model named my_mlflow_model and Version 1, my_mlflow_model_2, etc.). Each Model Version represents a specific iteration of a model with its own artifacts and metadata. |
9+
| [`Experiment`](https://mlflow.org/docs/latest/tracking/#experiments) | [`Container`](https://datahubproject.io/docs/generated/metamodel/entities/container/) | Each Experiment in MLflow is mapped to a Container in DataHub. Experiments organize related runs and serve as logical groupings for model development iterations, allowing tracking of parameters, metrics, and artifacts. |
10+
| [`Run`](https://mlflow.org/docs/latest/tracking/#runs) | [`DataProcessInstance`](https://datahubproject.io/docs/generated/metamodel/entities/dataprocessinstance/) | Captures the run's execution details, parameters, metrics, and lineage to a model. |
11+
| [`Model Stage`](https://mlflow.org/docs/latest/model-registry/#deprecated-using-model-stages) | [`Tag`](https://datahubproject.io/docs/generated/metamodel/entities/tag/) | The mapping between Model Stages and generated Tags is the following:<br/>- Production: mlflow_production<br/>- Staging: mlflow_staging<br/>- Archived: mlflow_archived<br/>- None: mlflow_none. Model Stages indicate the deployment status of each version. |

0 commit comments

Comments
 (0)