Pre-installation Configuration

Install Alauda AI Essentials

Install Alauda AI

Upgrade from AI 1.3

Infrastructure Management

Device Management

About Alauda Build of Hami

About Alauda Build of NVIDIA GPU Device Plugin

Namespace Management

Create WorkspaceKind

Create Workbench

Model Deployment & Inference

Inference Service

Inference Service

Extend Inference Runtimes

Configure External Access for Inference Services

Configure Scaling for Inference Services

Troubleshooting

Experiencing Inference Service Timeouts with MLServer Runtime

Inference Service Fails to Enter Running State

Model Management

Model Repository

Monitoring & Ops

Features Overview

Logging & Tracing

Resource Monitoring

Resource Monitoring

Kubernetes APIs

Inference Service APIs

ClusterServingRuntime [serving.kserve.io/v1alpha1]

InferenceService [serving.kserve.io/v1beta1]

Workspace Kind [kubeflow.org/v1beta1]

Workspace [kubeflow.org/v1beta1]

AmlNamespace [manage.aml.dev/v1alpha1]

AmlCluster [amlclusters.aml.dev/v1alpha1]

Model Deployment & Inference

Overview

Introduction

Model Management
Inference Service

Features

Model Management
Inference Service

Inference Service

Introduction

Guides

How To

Troubleshooting

Model Management

Introduction

Guides

📝 Edit this page

Previous pageCreate Workbench Next pageOverview