kvcache-ai-ktransformers/kt-kernel/python
ErvinXie 3903c9afcc
(kt-kernel): add numa_nodes parameter for explicit NUMA node mapping (#1891)
Add numa_nodes parameter to BaseMoEWrapper and all subclasses, allowing
users to explicitly specify which NUMA node IDs to use for subpool
mapping instead of always defaulting to sequential [0, 1, ..., N-1].

This enables running multiple KTransformers instances on different NUMA
nodes of the same machine, e.g. --kt-threadpool-count 1 --kt-numa-nodes 1
to bind to NUMA node 1. Previously this required external numactl
workarounds since subpool_numa_map was hardcoded to start from 0.
2026-03-31 10:27:50 +08:00
..
cli [fix] improve Sglang kt-kernel detect time duration (#1887) 2026-03-18 23:07:40 +08:00
utils (kt-kernel): add numa_nodes parameter for explicit NUMA node mapping (#1891) 2026-03-31 10:27:50 +08:00
__init__.py [feat](kt-kernel): CPU-GPU experts sched (#1796) 2026-01-16 17:01:15 +08:00
_cpu_detect.py [feat](kt-kernel): Fix CPU instruction set variants for build & install (#1746) 2025-12-24 18:57:45 +08:00
experts.py (kt-kernel): add numa_nodes parameter for explicit NUMA node mapping (#1891) 2026-03-31 10:27:50 +08:00
experts_base.py (kt-kernel): add numa_nodes parameter for explicit NUMA node mapping (#1891) 2026-03-31 10:27:50 +08:00