MLFlow#
MLFlow is an open-source platform for managing the end-to-end machine learning lifecycle, including experiment tracking, model versioning, and deployment. With ArchiTXT’s integration, you can effortlessly log experiment executions, traces, and key metrics directly to MLFlow for streamlined monitoring and analysis.
ArchiTXT creates a dedicated MLFlow experiment for logging and every simplification process is recorded as a separate run in this experiment. This detailed logging allows you to:
Track the progress and performance of each simplification step.
Compare different runs to identify the most effective parameters.
Easily navigate through experiment histories using the MLFlow UI.
Configure MLFlow#
Tip
By default, MLFlow logs experiments to a local directory. It is the recommended solution if you just want to try MLFlow.
To connect ArchiTXT to a remote MLFlow tracking server, set the environment variable MLFLOW_TRACKING_URI:
$ export MLFLOW_TRACKING_URI=http://127.0.0.1:5000 # Replace with your remote host
You can also set the tracking URI in your Python code:
import mlflow
mlflow.set_tracking_uri("http://127.0.0.1:5000") # Replace with your remote host
Run Experiments#
ArchiTXT can log experiments to MLFlow if it is executed within an active MLFlow run. In Python, you can create a run as follows:
import mlflow
with mlflow.start_run():
... # <- Your code here
Once the run is started, execute your experiments as usual, and ArchiTXT will automatically handle the logging.
You can also enable MLFlow logging whe using the CLI by using the –log option.
Explore Data#
Visualize your logged data using the MLFlow web interface. If running locally, you can start the MLFlow UI by running:
$ mlflow ui
Open your browser and navigate to the default URL (usually http://127.0.0.1:5000). In the web UI, you can review your experiment details and performance metrics.
See also
MLFlow Documentation for more details on MLFlow and its capabilities.