[Feature] Add flag to disable FlashInfer autotune #31875

mmangkad · 2026-01-07T07:34:17Z

Purpose

FlashInfer autotuning can sometimes take a long time to complete during initialization. This PR introduces a flag to disable it, allowing users to bypass this step if they are okay with skipping optimization to speed up startup.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Mohammad Miadh Angkad <[email protected]>

gemini-code-assist

Code Review

This pull request introduces a feature flag to disable FlashInfer autotuning, which is a valuable addition for users prioritizing faster startup times. The implementation is clean and correctly propagates the new configuration from the command-line interface to the core logic. I have one suggestion regarding the robustness of accessing this new configuration parameter in vllm/model_executor/warmup/kernel_warmup.py to prevent potential silent failures in the future.

gemini-code-assist · 2026-01-07T07:38:54Z

vllm/model_executor/warmup/kernel_warmup.py

+    disable_flashinfer_autotune = getattr(
+        worker.vllm_config, "disable_flashinfer_autotune", False
+    )


Since disable_flashinfer_autotune is now a defined attribute in the VllmConfig dataclass, it will always be present. Using getattr with a default value can mask potential configuration issues. For example, if the attribute name is misspelled or refactored in the future, this code would silently default to False instead of raising an AttributeError, making debugging more difficult. Direct attribute access is safer and clearer in this case.

disable_flashinfer_autotune = worker.vllm_config.disable_flashinfer_autotune

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Add flag to disable FlashInfer autotune

8e99b9f

Signed-off-by: Mohammad Miadh Angkad <[email protected]>

Copilot AI review requested due to automatic review settings January 7, 2026 07:34

mmangkad requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256 and youkaichao as code owners January 7, 2026 07:34

Merge branch 'main' into disable-flashinfer-autotune

96bc332

gemini-code-assist bot reviewed Jan 7, 2026

View reviewed changes

Copilot AI reviewed Jan 7, 2026

View reviewed changes

Copilot started reviewing on behalf of mmangkad January 7, 2026 08:07 View session

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature] Add flag to disable FlashInfer autotune #31875

[Feature] Add flag to disable FlashInfer autotune #31875

mmangkad commented Jan 7, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 7, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

[Feature] Add flag to disable FlashInfer autotune #31875

Are you sure you want to change the base?

[Feature] Add flag to disable FlashInfer autotune #31875

Conversation

mmangkad commented Jan 7, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mmangkad commented Jan 7, 2026 •

edited by github-actions bot

Loading