Skip to content

[Feature]: Add the ability to schedule a workload to different clusters depending on capacityΒ #51

@ritazh

Description

@ritazh

Is there an existing feature request for this?

  • I have searched the existing issues

Problem or Motivation

Due to GPU shortage, users could bring multiple clusters in different regions. When deploying a model for serving or AI workload, schedule the workload to one of the provided clusters depending on required and available capacity.

Proposed Solution

Enable multicluster fleet to manage multiple clusters and select the appropriate cluster based on required and available capacity.

Alternatives Considered

No response

Feature Area

Deployments / Model Management

How important is this feature to you?

Nice to have

Mockups or Examples

No response

Additional Context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions