[0.11.0]fix greedy temp detection #5418

realliujiaxu · 2025-12-27T03:57:57Z

What this PR does / why we need it?

fix greedy temperature detection from vllm-project/vllm#27077

Does this PR introduce any user-facing change?

No

How was this patch tested?

Signed-off-by: realliujiaxu <[email protected]>

gemini-code-assist

Code Review

This pull request addresses a bug in greedy sampling detection by changing the GREEDY_TEMPERATURE constant from -1 to 0. A temperature of 0 is the standard for greedy decoding, ensuring the model selects the highest probability token. This change aligns the implementation with common practice and the project's own unit tests, and is a correct and necessary fix for speculative decoding functionality.

fix greedy temp detection

fecaa8b

Signed-off-by: realliujiaxu <[email protected]>

gemini-code-assist bot reviewed Dec 27, 2025

View reviewed changes

realliujiaxu marked this pull request as draft December 27, 2025 04:51

realliujiaxu closed this Dec 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[0.11.0]fix greedy temp detection #5418

[0.11.0]fix greedy temp detection #5418

Uh oh!

realliujiaxu commented Dec 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[0.11.0]fix greedy temp detection #5418

[0.11.0]fix greedy temp detection #5418

Uh oh!

Conversation

realliujiaxu commented Dec 27, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant