Lizonghang
|
a5ba34169a
|
add f32, f16, q4k_f32, q6k_f32 flops test and fix duplicate inp_embd in subgraphs
|
2024-11-23 21:36:34 +04:00 |
|
Zonghang Li
|
7ee1423006
|
add model_flops
|
2024-11-21 20:06:16 +04:00 |
|
Zonghang Li
|
80f6b72e71
|
remove device_flops from profiler api
|
2024-11-21 08:37:57 +04:00 |
|
Lizonghang
|
10f6f92c7e
|
add f32, f16, q8, q4k speed test for cuda
|
2024-11-10 23:41:13 +04:00 |
|
Lizonghang
|
f4260bb346
|
add device_flops() for cpu, metal, and cuda
|
2024-11-10 23:11:05 +04:00 |
|
Lizonghang
|
5fae6ac36f
|
add cpu flops test
|
2024-11-09 20:53:42 +04:00 |
|
Lizonghang
|
2bd4d03aa8
|
add automatic layer window size assignment workflow
|
2024-11-08 18:21:03 +04:00 |
|
Lizonghang
|
53cb3a6069
|
synchronize device info
|
2024-11-07 22:02:01 +04:00 |
|
Lizonghang
|
ef7fdf70cc
|
add LLAMA_API llama_profile_device
|
2024-11-07 09:30:39 +04:00 |
|
Lizonghang
|
407c71ae52
|
add cpu and gpu profile
|
2024-11-06 20:42:28 +04:00 |
|
Lizonghang
|
4e1be1065d
|
add memory speed test
|
2024-11-06 10:57:30 +04:00 |
|
Zonghang Li
|
9a03b52785
|
fix device get name on linux
|
2024-11-05 22:07:09 +04:00 |
|
Lizonghang
|
a7f3d917a1
|
add device get name
|
2024-11-05 22:04:14 +04:00 |
|
Zonghang Li
|
6657885816
|
fix swap detect on linux
|
2024-11-05 21:57:09 +04:00 |
|
Lizonghang
|
2d447266e9
|
add swap capacity test
|
2024-11-05 21:42:45 +04:00 |
|
Lizonghang
|
9eed6b14bf
|
add disk read speed test
|
2024-11-05 21:12:02 +04:00 |
|
Lizonghang
|
9cd66f2145
|
add profiler
|
2024-11-05 20:29:09 +04:00 |
|
Lizonghang
|
766ec7862b
|
test
|
2024-11-05 17:22:24 +04:00 |
|