mirror of
https://github.com/kvcache-ai/ktransformers.git
synced 2025-09-10 15:29:39 +00:00
The numa node location would be calculated based on the total number of worker threads. So we should always use the actual number of threads instead of using a min() op. |
||
---|---|---|
.. | ||
bench | ||
cmake | ||
cpu_backend | ||
cuda | ||
examples | ||
operators | ||
triton | ||
CMakeLists.txt | ||
ext_bindings.cpp |