kvcache-ai-ktransformers

mirror of https://github.com/kvcache-ai/ktransformers.git synced 2026-04-28 11:49:51 +00:00

History

Benjamin F 8484ef8b16 [feat](kt-kernel): adapt MXFP4 MoE backend for DeepSeek-V4-Flash (#1950 ) V4-Flash routed experts ship as native MXFP4 (E2M1 nibble + ue8m0 group scale). Expose AMXFP4_KGroup_MOE through NativeMoEWrapper, add a loader that handles V4's `layers.{L}.ffn.experts.{i}.{w1,w3,w2}.{weight,scale}` naming and converts ue8m0 → bf16 via a lossless bit-cast, register the model entry, and ship an end-to-end numerical validation script. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-04-25 18:11:53 +08:00
..
__init__.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
analyze_moe_model.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
console.py	Kt minimax (#1742 )	2025-12-24 15:39:44 +08:00
debug_configs.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
download_helper.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
environment.py	[fix](cli): handle edge cases with empty NUMA nodes (#1929 )	2026-04-13 16:45:41 +08:00
input_validators.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
kv_cache_calculator.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
model_discovery.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
model_registry.py	[feat](kt-kernel): adapt MXFP4 MoE backend for DeepSeek-V4-Flash (#1950 )	2026-04-25 18:11:53 +08:00
model_scanner.py	Handle unquoted paths and special characters in model scanner (#1840 )	2026-02-26 15:44:45 +08:00
model_table_builder.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
model_verifier.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
port_checker.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
quant_interactive.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
repo_detector.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
run_configs.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
run_interactive.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
sglang_checker.py	[fix] improve Sglang kt-kernel detect time duration (#1887 )	2026-03-18 23:07:40 +08:00
tuna_engine.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00
user_model_registry.py	kt-cli enhancement (#1834 )	2026-02-04 16:44:54 +08:00