Inference API Deployment#

Launches a REST API which takes data and serves inferences based on the loaded model (defined in deployment_specs. model_specs)

By default launches a grafana instance to monitor the model performance.

class InferenceAPIDeployment

Kuberenetes Objects:

Fields:

type: inferenceAPIDeployment
name: string
- required - unique name of deployment
template string
- default: src/octaipipe/configs/cloud_deployment/inference_api_deployment_template.yml
kubernetes_namespace string
- default: default
monitor_model bool
- default: True
- Sets up a Grafana Dashboard which monitors model performance.
deployment_specs dict
- See placeholders

Placeholders (deployment_specs or env variables):