kvcache-ai-ktransformers/ktransformers/tests
2025-04-22 12:50:39 +00:00
..
AIME_2024 update compile option for avx512vpopcntdq 2025-03-06 12:18:04 +08:00
humaneval fix flashinfer precision 2025-03-07 14:07:00 +00:00
.gitignore release v0.2.3 2025-03-05 20:21:04 +08:00
dequant_gpu.py [fix] format classes and files name 2024-08-15 10:44:59 +08:00
dequant_gpu_t.py [fix] format classes and files name 2024-08-15 10:44:59 +08:00
function_call_test.py roll back ktransformers backend, add max_tokens, max_completion_tokens param 2025-04-21 12:55:37 +00:00
mmlu_pro_test.py add humaneval support 2025-03-04 20:54:49 +08:00
mmlu_test.py change test 2025-04-22 12:50:39 +00:00
mmlu_test_multi.py change test 2025-04-22 12:50:39 +00:00
score.py fix params 2025-03-20 18:48:51 +08:00
test_client.py change test 2025-04-22 12:50:39 +00:00
test_pytorch_q8.py merge main; Add torch q8 linear 2025-03-14 05:52:07 -04:00
test_speed.py change test 2025-04-22 12:50:39 +00:00
triton_fp8gemm_test.py Add data loader to read special weights for fp8; Add special weight process script 2025-02-24 11:34:17 +00:00