Model Inference Step#

To run locally: model_inference

An ML lifecycle can be broken up into two main, distinct parts. The first is the training phase, in which the model is created or “trained” by running a specified subset of the dataset through the model. ML inference is the second phase, in which the model is deployed to make predictions on live data, giving actionable outputs.

The inference step follows this workflow which corresponds to sections in the inference config file:

Load an existing model:
- model_specs
Load live test data:
- input_data_specs
Perform model inference at intervals:
- run_specs
Save live predictions:
- output_data_specs

The following is an example of a config file together with descriptions of its parts.

Step config example#

name: model_inference

input_data_specs:
  default:
  - datastore_type: influxdb
    settings:
      query_type: dataframe
      query_template_path: ./configs/data/inference_query.txt
      query_config:
        start: 5d
        bucket: live-metrics
        measurement: def-metrics
        tags:
          MY_TAG: value_0

output_data_specs:
  default:
  - datastore_type: influxdb
    settings:
      bucket: "test-write"
      measurement: "model-outputs"
  - datastore_type: local
    settings:
      file_path: "./logs/results"

model_specs:
  name: Machine_RUL
  type: ridge_reg
  version: "2.0"

run_specs:
  prediction_period: 10s
  save_results: True
  onnx_pred: false

Input and Output Data Specs#

input_data_specs and output_data_specs follow a standard format for all the pipeline steps; see Octaipipe Steps.

Model Specs#

 model_specs:
     name: Machine_RUL
     type: ridge_reg
     version: '2.0'

Specifications of the model to be loaded for inference. See Model Training Step.

Run Specs#

 run_specs:
     prediction_period: 10s
     onnx_pred: false

prediction_period: sleeping time interval between predictions.
onnx_pred: boolean of whether to use the onnx file of the trained model for inference. If false, the joblib model file is used.