English

English

Overview

Install

Pre-installation Configuration

Install Alauda AI Essentials

Install Alauda AI

Upgrade

Upgrade from AI 1.3

Uninstall

Infrastructure Management

Device Management

About Alauda Build of Hami

About Alauda Build of NVIDIA GPU Device Plugin

Multi-Tenant

Guides

Namespace Management

Workbench

Overview

How To

Create WorkspaceKind

Create Workbench

Model Deployment & Inference

Overview

Inference Service

Guides

Inference Service

How To

Extend Inference Runtimes

Configure External Access for Inference Services

Configure Scaling for Inference Services

Troubleshooting

Experiencing Inference Service Timeouts with MLServer Runtime

Inference Service Fails to Enter Running State

Model Management

Guides

Model Repository

Monitoring & Ops

Overview

Features Overview

Logging & Tracing

Guides

Resource Monitoring

Guides

Resource Monitoring

API Reference

Kubernetes APIs

Inference Service APIs

ClusterServingRuntime [serving.kserve.io/v1alpha1]

InferenceService [serving.kserve.io/v1beta1]

Workbench APIs

Workspace Kind [kubeflow.org/v1beta1]

Workspace [kubeflow.org/v1beta1]

Manage APIs

AmlNamespace [manage.aml.dev/v1alpha1]

Operator APIs

AmlCluster [amlclusters.aml.dev/v1alpha1]

Previous PageFeatures

Next PageIntroduction

Inference Service

Introduction

Introduction

Guides

Inference Service

Advantages
Core Features
Create inference service
Inference Service Template Management
Inference service update
Calling the published inference service

How To

Extend Inference Runtimes

Introduction
Scenarios
Prerequisites
Steps

Configure External Access for Inference Services

Introduction
Steps

Configure Scaling for Inference Services

Introduction
Steps

Troubleshooting

Experiencing Inference Service Timeouts with MLServer Runtime

Problem Description
Root Cause Analysis
Solutions
Summary

Inference Service Fails to Enter Running State

Problem Description
Root Cause Analysis
Solutions
Summary