[fix] improve Sglang kt-kernel detect time duration (#1887)

* Increase timeout for Check if --kt-gpu-prefill-token-threshold is in the help output to 90 seconds. In cloud environments,CUDA initialization and Python module loading can easily exceed 30 seconds. * Update kt-kernel/python/cli/utils/sglang_checker.py add comment about the change Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
2026-04-28 20:00:06 +00:00 · 2026-03-18 23:07:40 +08:00 · 2026-03-18 23:07:40 +08:00 · 8561a71dd1
commit 8561a71dd1
parent 7a4b9b0e87
1 changed files with 1 additions and 1 deletions
--- a/kt-kernel/python/cli/utils/sglang_checker.py
+++ b/kt-kernel/python/cli/utils/sglang_checker.py
@ -324,7 +324,7 @@ def check_sglang_kt_kernel_support(use_cache: bool = True, silent: bool = False)
            [sys.executable, "-m", "sglang.launch_server", "--help"],
            capture_output=True,
            text=True,
-            timeout=30,
+            timeout=90,  # Increased for slow CUDA init and module loading in some environments
        )

        help_output = result.stdout + result.stderr