This website requires JavaScript.
Explore
Help
Sign in
vrr
/
kvcache-ai-ktransformers
Watch
2
Star
0
Fork
You've already forked kvcache-ai-ktransformers
0
mirror of
https://github.com/kvcache-ai/ktransformers.git
synced
2025-09-05 20:19:51 +00:00
Code
Issues
Projects
Releases
Packages
Wiki
Activity
Actions
2
ca7366d2db
kvcache-ai-ktransformers
/
ktransformers
/
ktransformers_ext
/
examples
History
chenxl
4d1d561d28
[feature] release 0.1.3
2024-08-28 16:11:43 +00:00
..
test_attention.py
[feature] release 0.1.3
2024-08-28 16:11:43 +00:00
test_linear.py
1) Linear and MLP operators support qlen>1; 2) All operators now share a single memory buffer; 3) Refactor CPUInfer submit/sync logic.
2024-08-08 09:04:36 +00:00
test_mlp.py
1) Linear and MLP operators support qlen>1; 2) All operators now share a single memory buffer; 3) Refactor CPUInfer submit/sync logic.
2024-08-08 09:04:36 +00:00
test_moe.py
1) Linear and MLP operators support qlen>1; 2) All operators now share a single memory buffer; 3) Refactor CPUInfer submit/sync logic.
2024-08-08 09:04:36 +00:00