Skip to content

Azure online endpoint is Scaling taking long time #3515

@newstar85

Description

@newstar85

Operating System

Linux

Version Information

Using the latest version of ML online endpoint

Steps to reproduce

I have a project to run an AI model using an online endpoint as a backend service, the endpoint is configured (manually set in the portal) to be auto-scale based on the number of requests.

Image

Expected behavior

Expect the endpoint will scale up between 1-2 minutes like other services such as virtual machine scaleset, etc...

Actual behavior

With endpoint, scaling takes a long time, about 12-18 minutes.

Addition information

Do you have suggestions for speeding up the scaling time?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions