Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- | model | size | params | backend | threads | test | t/s |
- | ------------------------------ | ---------: | ---------: | ---------- | ------: | ------------: | -------------------: |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 1 | pp32 | 30.90 ± 0.61 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 1 | pp64 | 31.02 ± 0.45 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 1 | pp128 | 30.57 ± 0.11 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 1 | tg32 | 9.44 ± 0.01 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 2 | pp32 | 61.34 ± 0.91 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 2 | pp64 | 62.83 ± 0.01 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 2 | pp128 | 61.99 ± 0.04 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 2 | tg32 | 17.86 ± 0.01 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 3 | pp32 | 90.36 ± 1.40 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 3 | pp64 | 92.84 ± 0.04 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 3 | pp128 | 91.63 ± 0.01 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 3 | tg32 | 22.50 ± 0.05 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 4 | pp32 | 116.23 ± 0.02 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 4 | pp64 | 118.64 ± 0.17 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 4 | pp128 | 117.15 ± 0.15 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 4 | tg32 | 23.16 ± 0.03 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 1 | pp32 | 26.66 ± 0.11 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 1 | pp64 | 26.72 ± 0.08 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 1 | pp128 | 26.56 ± 0.01 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 1 | tg32 | 8.23 ± 0.14 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 2 | pp32 | 54.21 ± 0.01 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 2 | pp64 | 54.60 ± 0.02 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 2 | pp128 | 53.84 ± 0.00 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 2 | tg32 | 15.97 ± 0.08 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 3 | pp32 | 80.30 ± 0.07 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 3 | pp64 | 80.96 ± 0.06 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 3 | pp128 | 79.27 ± 0.02 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 3 | tg32 | 22.13 ± 0.13 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 4 | pp32 | 104.02 ± 0.23 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 4 | pp64 | 105.38 ± 0.04 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 4 | pp128 | 103.86 ± 0.04 |
- | llama 1B IQ4_NL_4_4 - 4.5 bpw | 727.75 MiB | 1.24 B | CPU | 4 | tg32 | 22.92 ± 0.04 |
- build: 32e0862a (4037)
- | model | size | params | backend | threads | test | t/s |
- | ------------------------------ | ---------: | ---------: | ---------- | ------: | ------------: | -------------------: |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 1 | pp32 | 11.52 ± 0.00 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 1 | pp64 | 11.86 ± 0.00 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 1 | pp128 | 12.11 ± 0.00 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 1 | tg32 | 5.81 ± 0.02 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 2 | pp32 | 22.85 ± 0.05 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 2 | pp64 | 23.45 ± 0.00 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 2 | pp128 | 23.91 ± 0.00 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 2 | tg32 | 11.15 ± 0.00 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 3 | pp32 | 33.93 ± 0.12 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 3 | pp64 | 34.54 ± 0.02 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 3 | pp128 | 35.39 ± 0.01 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 3 | tg32 | 16.00 ± 0.12 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 4 | pp32 | 43.46 ± 0.31 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 4 | pp64 | 44.95 ± 0.07 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 4 | pp128 | 46.19 ± 0.00 |
- | llama 1B Q4_0 | 727.75 MiB | 1.24 B | CPU | 4 | tg32 | 20.43 ± 0.05 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 1 | pp32 | 8.70 ± 0.00 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 1 | pp64 | 8.72 ± 0.00 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 1 | pp128 | 8.68 ± 0.00 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 1 | tg32 | 5.81 ± 0.02 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 2 | pp32 | 17.42 ± 0.01 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 2 | pp64 | 17.52 ± 0.00 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 2 | pp128 | 17.41 ± 0.00 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 2 | tg32 | 11.15 ± 0.05 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 3 | pp32 | 25.68 ± 0.03 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 3 | pp64 | 25.95 ± 0.01 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 3 | pp128 | 25.83 ± 0.00 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 3 | tg32 | 16.09 ± 0.02 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 4 | pp32 | 33.72 ± 0.05 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 4 | pp64 | 34.11 ± 0.02 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 4 | pp128 | 33.92 ± 0.02 |
- | llama 1B IQ4_NL - 4.5 bpw | 733.75 MiB | 1.24 B | CPU | 4 | tg32 | 20.17 ± 0.34 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 1 | pp32 | 36.64 ± 0.18 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 1 | pp64 | 36.66 ± 0.08 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 1 | pp128 | 36.46 ± 0.01 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 1 | tg32 | 9.60 ± 0.10 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 2 | pp32 | 71.99 ± 3.23 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 2 | pp64 | 72.51 ± 2.17 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 2 | pp128 | 70.95 ± 0.81 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 2 | tg32 | 18.14 ± 0.06 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 3 | pp32 | 110.57 ± 3.03 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 3 | pp64 | 114.50 ± 0.04 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 3 | pp128 | 112.56 ± 0.02 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 3 | tg32 | 22.41 ± 0.07 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 4 | pp32 | 146.28 ± 0.74 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 4 | pp64 | 149.91 ± 0.04 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 4 | pp128 | 146.72 ± 0.05 |
- | llama 1B Q4_0_4_4 | 727.75 MiB | 1.24 B | CPU | 4 | tg32 | 22.43 ± 1.04 |
- build: a9e8a9a0 (4033)
Advertisement
Add Comment
Please, Sign In to add comment