Skip to content

Error: Qwen 2.5 Coder 32b - 400 #10020

@gary-guchm

Description

@gary-guchm

Error Details

Model: Qwen 2.5 Coder 32b
Provider: siliconflow
Status Code: 400

Error Output

"max_total_tokens (33298) must be less than or equal to max_seq_len (32768)"

Additional Context
Please add any additional context about the error here

Metadata

Metadata

Assignees

No one assigned

    Labels

    kind:bugIndicates an unexpected problem or unintended behavior

    Type

    No type

    Projects

    Status

    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions