Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- slot release: id 50 | task 27194 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 50 | task 27194 |
- prompt eval time = 2243.34 ms / 199 tokens ( 11.27 ms per token, 88.71 tokens per second)
- eval time = 8538.73 ms / 60 tokens ( 142.31 ms per token, 7.03 tokens per second)
- total time = 10782.06 ms / 259 tokens
- slot release: id 57 | task 27198 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 57 | task 27198 |
- prompt eval time = 1037.84 ms / 199 tokens ( 5.22 ms per token, 191.74 tokens per second)
- eval time = 7501.38 ms / 59 tokens ( 127.14 ms per token, 7.87 tokens per second)
- total time = 8539.22 ms / 258 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot launch_slot_: id 36 | task 27382 | processing task
- slot launch_slot_: id 40 | task 27383 | processing task
- slot launch_slot_: id 43 | task 26653 | processing task
- slot launch_slot_: id 46 | task 26654 | processing task
- slot launch_slot_: id 50 | task 26655 | processing task
- slot launch_slot_: id 57 | task 26656 | processing task
- slot update_slots: id 36 | task 27382 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 36 | task 27382 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 36 | task 27382 | kv cache rm [0, end)
- slot update_slots: id 36 | task 27382 | prompt processing progress, n_past = 199, n_tokens = 257, progress = 1.000000
- slot update_slots: id 36 | task 27382 | prompt done, n_past = 199, n_tokens = 257
- slot update_slots: id 40 | task 27383 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 27383 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 27383 | kv cache rm [0, end)
- slot update_slots: id 40 | task 27383 | prompt processing progress, n_past = 199, n_tokens = 456, progress = 1.000000
- slot update_slots: id 40 | task 27383 | prompt done, n_past = 199, n_tokens = 456
- slot update_slots: id 43 | task 26653 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 43 | task 26653 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 43 | task 26653 | kv cache rm [0, end)
- slot update_slots: id 43 | task 26653 | prompt processing progress, n_past = 199, n_tokens = 655, progress = 1.000000
- slot update_slots: id 43 | task 26653 | prompt done, n_past = 199, n_tokens = 655
- slot update_slots: id 46 | task 26654 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 46 | task 26654 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 46 | task 26654 | kv cache rm [0, end)
- slot update_slots: id 46 | task 26654 | prompt processing progress, n_past = 199, n_tokens = 854, progress = 1.000000
- slot update_slots: id 46 | task 26654 | prompt done, n_past = 199, n_tokens = 854
- slot update_slots: id 50 | task 26655 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 50 | task 26655 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 50 | task 26655 | kv cache rm [0, end)
- slot update_slots: id 50 | task 26655 | prompt processing progress, n_past = 199, n_tokens = 1053, progress = 1.000000
- slot update_slots: id 50 | task 26655 | prompt done, n_past = 199, n_tokens = 1053
- slot update_slots: id 57 | task 26656 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 57 | task 26656 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 57 | task 26656 | kv cache rm [0, end)
- slot update_slots: id 57 | task 26656 | prompt processing progress, n_past = 199, n_tokens = 1252, progress = 1.000000
- slot update_slots: id 57 | task 26656 | prompt done, n_past = 199, n_tokens = 1252
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27192
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot release: id 31 | task 26486 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 31 | task 26486 |
- prompt eval time = 1696.60 ms / 199 tokens ( 8.53 ms per token, 117.29 tokens per second)
- eval time = 24149.39 ms / 123 tokens ( 196.34 ms per token, 5.09 tokens per second)
- total time = 25845.99 ms / 322 tokens
- slot release: id 51 | task 26611 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 51 | task 26611 |
- prompt eval time = 3280.27 ms / 199 tokens ( 16.48 ms per token, 60.67 tokens per second)
- eval time = 8780.20 ms / 60 tokens ( 146.34 ms per token, 6.83 tokens per second)
- total time = 12060.47 ms / 259 tokens
- slot release: id 56 | task 27196 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 56 | task 27196 |
- prompt eval time = 1036.79 ms / 199 tokens ( 5.21 ms per token, 191.94 tokens per second)
- eval time = 8780.95 ms / 60 tokens ( 146.35 ms per token, 6.83 tokens per second)
- total time = 9817.73 ms / 259 tokens
- slot release: id 60 | task 26612 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 60 | task 26612 |
- prompt eval time = 1038.43 ms / 199 tokens ( 5.22 ms per token, 191.64 tokens per second)
- eval time = 8779.57 ms / 60 tokens ( 146.33 ms per token, 6.83 tokens per second)
- total time = 9818.00 ms / 259 tokens
- slot release: id 61 | task 26613 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 61 | task 26613 |
- prompt eval time = 1038.51 ms / 199 tokens ( 5.22 ms per token, 191.62 tokens per second)
- eval time = 8779.53 ms / 60 tokens ( 146.33 ms per token, 6.83 tokens per second)
- total time = 9818.05 ms / 259 tokens
- slot release: id 63 | task 26619 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 63 | task 26619 |
- prompt eval time = 1038.68 ms / 199 tokens ( 5.22 ms per token, 191.59 tokens per second)
- eval time = 8779.40 ms / 60 tokens ( 146.32 ms per token, 6.83 tokens per second)
- total time = 9818.08 ms / 259 tokens
- slot release: id 49 | task 27192 | stop processing: n_past = 132, truncated = 1
- slot launch_slot_: id 31 | task 27386 | processing task
- slot launch_slot_: id 51 | task 27385 | processing task
- slot launch_slot_: id 56 | task 27388 | processing task
- slot launch_slot_: id 60 | task 26660 | processing task
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot launch_slot_: id 61 | task 26661 | processing task
- slot launch_slot_: id 63 | task 26663 | processing task
- slot launch_slot_: id 49 | task 26665 | processing task
- slot update_slots: id 31 | task 27386 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 27386 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 27386 | kv cache rm [0, end)
- slot update_slots: id 31 | task 27386 | prompt processing progress, n_past = 199, n_tokens = 256, progress = 1.000000
- slot update_slots: id 31 | task 27386 | prompt done, n_past = 199, n_tokens = 256
- slot update_slots: id 49 | task 26665 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 49 | task 26665 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 49 | task 26665 | kv cache rm [0, end)
- slot update_slots: id 49 | task 26665 | prompt processing progress, n_past = 199, n_tokens = 455, progress = 1.000000
- slot update_slots: id 49 | task 26665 | prompt done, n_past = 199, n_tokens = 455
- slot update_slots: id 51 | task 27385 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 27385 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 27385 | kv cache rm [0, end)
- slot update_slots: id 51 | task 27385 | prompt processing progress, n_past = 199, n_tokens = 654, progress = 1.000000
- slot update_slots: id 51 | task 27385 | prompt done, n_past = 199, n_tokens = 654
- slot update_slots: id 56 | task 27388 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 27388 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 27388 | kv cache rm [0, end)
- slot update_slots: id 56 | task 27388 | prompt processing progress, n_past = 199, n_tokens = 853, progress = 1.000000
- slot update_slots: id 56 | task 27388 | prompt done, n_past = 199, n_tokens = 853
- slot update_slots: id 60 | task 26660 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 26660 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 26660 | kv cache rm [0, end)
- slot update_slots: id 60 | task 26660 | prompt processing progress, n_past = 199, n_tokens = 1052, progress = 1.000000
- slot update_slots: id 60 | task 26660 | prompt done, n_past = 199, n_tokens = 1052
- slot update_slots: id 61 | task 26661 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 61 | task 26661 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 61 | task 26661 | kv cache rm [0, end)
- slot update_slots: id 61 | task 26661 | prompt processing progress, n_past = 199, n_tokens = 1251, progress = 1.000000
- slot update_slots: id 61 | task 26661 | prompt done, n_past = 199, n_tokens = 1251
- slot update_slots: id 63 | task 26663 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 26663 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 26663 | kv cache rm [0, end)
- slot update_slots: id 63 | task 26663 | prompt processing progress, n_past = 199, n_tokens = 1450, progress = 1.000000
- slot update_slots: id 63 | task 26663 | prompt done, n_past = 199, n_tokens = 1450
- srv params_from_: Chat format: Content-only
- slot release: id 52 | task 26528 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 52 | task 26528 |
- prompt eval time = 2344.41 ms / 1 tokens ( 2344.41 ms per token, 0.43 tokens per second)
- eval time = 21578.66 ms / 123 tokens ( 175.44 ms per token, 5.70 tokens per second)
- total time = 23923.07 ms / 124 tokens
- slot launch_slot_: id 52 | task 27390 | processing task
- slot update_slots: id 52 | task 27390 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 52 | task 27390 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 52 | task 27390 | kv cache rm [0, end)
- slot update_slots: id 52 | task 27390 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 52 | task 27390 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 58 | task 26479 | stop processing: n_past = 195, truncated = 1
- slot print_timing: id 58 | task 26479 |
- prompt eval time = 2344.51 ms / 199 tokens ( 11.78 ms per token, 84.88 tokens per second)
- eval time = 22039.12 ms / 124 tokens ( 177.73 ms per token, 5.63 tokens per second)
- total time = 24383.62 ms / 323 tokens
- slot launch_slot_: id 58 | task 26677 | processing task
- slot update_slots: id 58 | task 26677 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 26677 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 26677 | kv cache rm [0, end)
- slot update_slots: id 58 | task 26677 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 58 | task 26677 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 20 | task 26497 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 20 | task 26497 |
- prompt eval time = 1696.32 ms / 1 tokens ( 1696.32 ms per token, 0.59 tokens per second)
- eval time = 26303.38 ms / 132 tokens ( 199.27 ms per token, 5.02 tokens per second)
- total time = 27999.69 ms / 133 tokens
- slot release: id 32 | task 26505 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 32 | task 26505 |
- prompt eval time = 1696.63 ms / 1 tokens ( 1696.63 ms per token, 0.59 tokens per second)
- eval time = 26305.10 ms / 132 tokens ( 199.28 ms per token, 5.02 tokens per second)
- total time = 28001.73 ms / 133 tokens
- slot release: id 47 | task 26526 | stop processing: n_past = 202, truncated = 1
- slot print_timing: id 47 | task 26526 |
- prompt eval time = 5188.07 ms / 199 tokens ( 26.07 ms per token, 38.36 tokens per second)
- eval time = 22815.74 ms / 131 tokens ( 174.17 ms per token, 5.74 tokens per second)
- total time = 28003.81 ms / 330 tokens
- slot release: id 53 | task 26536 | stop processing: n_past = 202, truncated = 1
- slot print_timing: id 53 | task 26536 |
- prompt eval time = 2344.44 ms / 1 tokens ( 2344.44 ms per token, 0.43 tokens per second)
- eval time = 22816.78 ms / 131 tokens ( 174.17 ms per token, 5.74 tokens per second)
- total time = 25161.22 ms / 132 tokens
- slot launch_slot_: id 20 | task 26678 | processing task
- slot launch_slot_: id 32 | task 26679 | processing task
- slot launch_slot_: id 47 | task 26681 | processing task
- slot launch_slot_: id 53 | task 26683 | processing task
- slot update_slots: id 20 | task 26678 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 26678 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 26678 | kv cache rm [0, end)
- slot update_slots: id 20 | task 26678 | prompt processing progress, n_past = 199, n_tokens = 259, progress = 1.000000
- slot update_slots: id 20 | task 26678 | prompt done, n_past = 199, n_tokens = 259
- slot update_slots: id 32 | task 26679 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 32 | task 26679 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 32 | task 26679 | kv cache rm [0, end)
- slot update_slots: id 32 | task 26679 | prompt processing progress, n_past = 199, n_tokens = 458, progress = 1.000000
- slot update_slots: id 32 | task 26679 | prompt done, n_past = 199, n_tokens = 458
- slot update_slots: id 47 | task 26681 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 47 | task 26681 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 47 | task 26681 | kv cache rm [0, end)
- slot update_slots: id 47 | task 26681 | prompt processing progress, n_past = 199, n_tokens = 657, progress = 1.000000
- slot update_slots: id 47 | task 26681 | prompt done, n_past = 199, n_tokens = 657
- slot update_slots: id 53 | task 26683 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 53 | task 26683 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 53 | task 26683 | kv cache rm [0, end)
- slot update_slots: id 53 | task 26683 | prompt processing progress, n_past = 199, n_tokens = 856, progress = 1.000000
- slot update_slots: id 53 | task 26683 | prompt done, n_past = 199, n_tokens = 856
- slot update_slots: id 24 | task 26621 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 4 | task 26622 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 8 | task 26623 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 55 | task 26624 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 59 | task 26625 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 4 | task 26622 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 4 | task 26622 |
- prompt eval time = 231.84 ms / 199 tokens ( 1.17 ms per token, 858.34 tokens per second)
- eval time = 11049.72 ms / 60 tokens ( 184.16 ms per token, 5.43 tokens per second)
- total time = 11281.56 ms / 259 tokens
- slot launch_slot_: id 4 | task 26685 | processing task
- slot update_slots: id 7 | task 26626 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 14 | task 26627 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 4 | task 26685 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 4 | task 26685 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 4 | task 26685 | kv cache rm [0, end)
- slot update_slots: id 4 | task 26685 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 4 | task 26685 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 55 | task 26624 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 55 | task 26624 |
- prompt eval time = 243.30 ms / 199 tokens ( 1.22 ms per token, 817.91 tokens per second)
- eval time = 10981.61 ms / 60 tokens ( 183.03 ms per token, 5.46 tokens per second)
- total time = 11224.91 ms / 259 tokens
- slot launch_slot_: id 55 | task 26686 | processing task
- slot update_slots: id 55 | task 26686 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 55 | task 26686 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 55 | task 26686 | kv cache rm [0, end)
- slot update_slots: id 55 | task 26686 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 55 | task 26686 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 39 | task 26545 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 39 | task 26545 |
- prompt eval time = 746.72 ms / 199 tokens ( 3.75 ms per token, 266.50 tokens per second)
- eval time = 21145.67 ms / 101 tokens ( 209.36 ms per token, 4.78 tokens per second)
- total time = 21892.39 ms / 300 tokens
- slot launch_slot_: id 39 | task 26687 | processing task
- slot update_slots: id 39 | task 26687 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 39 | task 26687 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 39 | task 26687 | kv cache rm [0, end)
- slot update_slots: id 39 | task 26687 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 39 | task 26687 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 7 | task 26626 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 26626 |
- prompt eval time = 241.90 ms / 199 tokens ( 1.22 ms per token, 822.66 tokens per second)
- eval time = 11175.53 ms / 60 tokens ( 186.26 ms per token, 5.37 tokens per second)
- total time = 11417.42 ms / 259 tokens
- slot release: id 11 | task 26575 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 11 | task 26575 |
- prompt eval time = 1891.10 ms / 199 tokens ( 9.50 ms per token, 105.23 tokens per second)
- eval time = 19414.38 ms / 101 tokens ( 192.22 ms per token, 5.20 tokens per second)
- total time = 21305.48 ms / 300 tokens
- slot launch_slot_: id 7 | task 26691 | processing task
- slot launch_slot_: id 11 | task 26693 | processing task
- slot update_slots: id 7 | task 26691 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 26691 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 26691 | kv cache rm [0, end)
- slot update_slots: id 7 | task 26691 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 7 | task 26691 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 11 | task 26693 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 11 | task 26693 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 11 | task 26693 | kv cache rm [0, end)
- slot update_slots: id 11 | task 26693 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 11 | task 26693 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 28 | task 26594 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 28 | task 26594 |
- prompt eval time = 2019.73 ms / 199 tokens ( 10.15 ms per token, 98.53 tokens per second)
- eval time = 17650.91 ms / 101 tokens ( 174.76 ms per token, 5.72 tokens per second)
- total time = 19670.64 ms / 300 tokens
- slot launch_slot_: id 28 | task 26652 | processing task
- slot update_slots: id 28 | task 26652 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 26652 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 26652 | kv cache rm [0, end)
- slot update_slots: id 28 | task 26652 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 28 | task 26652 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 33 | task 26544 | stop processing: n_past = 176, truncated = 1
- slot print_timing: id 33 | task 26544 |
- prompt eval time = 746.18 ms / 199 tokens ( 3.75 ms per token, 266.69 tokens per second)
- eval time = 22121.90 ms / 105 tokens ( 210.68 ms per token, 4.75 tokens per second)
- total time = 22868.08 ms / 304 tokens
- slot launch_slot_: id 33 | task 26694 | processing task
- slot update_slots: id 33 | task 26694 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 33 | task 26694 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 33 | task 26694 | kv cache rm [0, end)
- slot update_slots: id 33 | task 26694 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 33 | task 26694 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 9 | task 26628 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 30 | task 26629 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26630 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 0 | task 26632 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 26633 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 26638 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 6 | task 26605 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 26639 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 13 | task 26640 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 26641 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 9 | task 26628 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 9 | task 26628 |
- prompt eval time = 595.47 ms / 199 tokens ( 2.99 ms per token, 334.19 tokens per second)
- eval time = 11013.37 ms / 59 tokens ( 186.67 ms per token, 5.36 tokens per second)
- total time = 11608.84 ms / 258 tokens
- slot release: id 41 | task 26630 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 41 | task 26630 |
- prompt eval time = 598.28 ms / 199 tokens ( 3.01 ms per token, 332.62 tokens per second)
- eval time = 11017.27 ms / 59 tokens ( 186.73 ms per token, 5.36 tokens per second)
- total time = 11615.55 ms / 258 tokens
- slot launch_slot_: id 9 | task 26696 | processing task
- slot launch_slot_: id 41 | task 26697 | processing task
- slot update_slots: id 15 | task 27335 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 17 | task 27336 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 27337 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 27338 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 27 | task 27339 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 29 | task 27340 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 34 | task 27341 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 45 | task 27342 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 9 | task 26696 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 9 | task 26696 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 9 | task 26696 | kv cache rm [0, end)
- slot update_slots: id 9 | task 26696 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 9 | task 26696 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 41 | task 26697 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 26697 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 26697 | kv cache rm [0, end)
- slot update_slots: id 41 | task 26697 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 41 | task 26697 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 12 | task 26639 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 12 | task 26639 |
- prompt eval time = 1084.13 ms / 199 tokens ( 5.45 ms per token, 183.56 tokens per second)
- eval time = 10490.95 ms / 59 tokens ( 177.81 ms per token, 5.62 tokens per second)
- total time = 11575.08 ms / 258 tokens
- slot launch_slot_: id 12 | task 26698 | processing task
- slot update_slots: id 36 | task 27382 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 27383 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 43 | task 26653 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 46 | task 26654 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 50 | task 26655 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 57 | task 26656 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 26698 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 26698 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 26698 | kv cache rm [0, end)
- slot update_slots: id 12 | task 26698 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 26698 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 0 | task 26632 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 0 | task 26632 |
- prompt eval time = 1083.47 ms / 199 tokens ( 5.44 ms per token, 183.67 tokens per second)
- eval time = 11004.40 ms / 60 tokens ( 183.41 ms per token, 5.45 tokens per second)
- total time = 12087.86 ms / 259 tokens
- slot release: id 1 | task 26633 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 1 | task 26633 |
- prompt eval time = 1083.46 ms / 199 tokens ( 5.44 ms per token, 183.67 tokens per second)
- eval time = 11004.42 ms / 60 tokens ( 183.41 ms per token, 5.45 tokens per second)
- total time = 12087.88 ms / 259 tokens
- slot release: id 13 | task 26640 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 13 | task 26640 |
- prompt eval time = 1084.12 ms / 199 tokens ( 5.45 ms per token, 183.56 tokens per second)
- eval time = 11004.58 ms / 60 tokens ( 183.41 ms per token, 5.45 tokens per second)
- total time = 12088.70 ms / 259 tokens
- slot release: id 18 | task 27337 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 18 | task 27337 |
- prompt eval time = 1152.52 ms / 199 tokens ( 5.79 ms per token, 172.67 tokens per second)
- eval time = 9846.38 ms / 59 tokens ( 166.89 ms per token, 5.99 tokens per second)
- total time = 10998.90 ms / 258 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot launch_slot_: id 0 | task 26700 | processing task
- slot launch_slot_: id 1 | task 26701 | processing task
- slot launch_slot_: id 13 | task 26703 | processing task
- slot launch_slot_: id 18 | task 26705 | processing task
- slot update_slots: id 31 | task 27386 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 49 | task 26665 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 51 | task 27385 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 56 | task 27388 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 60 | task 26660 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 61 | task 26661 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 26663 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 0 | task 26700 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 0 | task 26700 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 0 | task 26700 | kv cache rm [0, end)
- slot update_slots: id 0 | task 26700 | prompt processing progress, n_past = 199, n_tokens = 259, progress = 1.000000
- slot update_slots: id 0 | task 26700 | prompt done, n_past = 199, n_tokens = 259
- slot update_slots: id 1 | task 26701 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 26701 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 26701 | kv cache rm [0, end)
- slot update_slots: id 1 | task 26701 | prompt processing progress, n_past = 199, n_tokens = 458, progress = 1.000000
- slot update_slots: id 1 | task 26701 | prompt done, n_past = 199, n_tokens = 458
- slot update_slots: id 13 | task 26703 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 13 | task 26703 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 13 | task 26703 | kv cache rm [0, end)
- slot update_slots: id 13 | task 26703 | prompt processing progress, n_past = 199, n_tokens = 657, progress = 1.000000
- slot update_slots: id 13 | task 26703 | prompt done, n_past = 199, n_tokens = 657
- slot update_slots: id 18 | task 26705 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 18 | task 26705 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 18 | task 26705 | kv cache rm [0, end)
- slot update_slots: id 18 | task 26705 | prompt processing progress, n_past = 199, n_tokens = 856, progress = 1.000000
- slot update_slots: id 18 | task 26705 | prompt done, n_past = 199, n_tokens = 856
- slot release: id 15 | task 27335 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 15 | task 27335 |
- prompt eval time = 1151.27 ms / 199 tokens ( 5.79 ms per token, 172.85 tokens per second)
- eval time = 10588.47 ms / 60 tokens ( 176.47 ms per token, 5.67 tokens per second)
- total time = 11739.74 ms / 259 tokens
- srv cancel_tasks: cancel task, id_task = 27335
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 17 | task 27336 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 17 | task 27336 |
- prompt eval time = 1152.33 ms / 199 tokens ( 5.79 ms per token, 172.69 tokens per second)
- eval time = 10587.56 ms / 60 tokens ( 176.46 ms per token, 5.67 tokens per second)
- total time = 11739.89 ms / 259 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 29 | task 27340 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 29 | task 27340 |
- prompt eval time = 1154.84 ms / 199 tokens ( 5.80 ms per token, 172.32 tokens per second)
- eval time = 10587.88 ms / 60 tokens ( 176.46 ms per token, 5.67 tokens per second)
- total time = 11742.72 ms / 259 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 34 | task 27341 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 34 | task 27341 |
- prompt eval time = 1155.73 ms / 199 tokens ( 5.81 ms per token, 172.19 tokens per second)
- eval time = 10588.86 ms / 60 tokens ( 176.48 ms per token, 5.67 tokens per second)
- total time = 11744.58 ms / 259 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- slot release: id 50 | task 26655 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 50 | task 26655 |
- prompt eval time = 1275.57 ms / 199 tokens ( 6.41 ms per token, 156.01 tokens per second)
- eval time = 9311.77 ms / 59 tokens ( 157.83 ms per token, 6.34 tokens per second)
- total time = 10587.33 ms / 258 tokens
- srv params_from_: Chat format: Content-only
- slot launch_slot_: id 15 | task 26706 | processing task
- slot launch_slot_: id 17 | task 26570 | processing task
- slot launch_slot_: id 29 | task 26707 | processing task
- slot launch_slot_: id 34 | task 26708 | processing task
- slot launch_slot_: id 50 | task 27450 | processing task
- slot update_slots: id 52 | task 27390 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 15 | task 26706 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 15 | task 26706 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 15 | task 26706 | kv cache rm [0, end)
- slot update_slots: id 15 | task 26706 | prompt processing progress, n_past = 199, n_tokens = 258, progress = 1.000000
- slot update_slots: id 15 | task 26706 | prompt done, n_past = 199, n_tokens = 258
- slot update_slots: id 17 | task 26570 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 17 | task 26570 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 17 | task 26570 | kv cache rm [0, end)
- slot update_slots: id 17 | task 26570 | prompt processing progress, n_past = 199, n_tokens = 457, progress = 1.000000
- slot update_slots: id 17 | task 26570 | prompt done, n_past = 199, n_tokens = 457
- slot update_slots: id 29 | task 26707 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 26707 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 26707 | kv cache rm [0, end)
- slot update_slots: id 29 | task 26707 | prompt processing progress, n_past = 199, n_tokens = 656, progress = 1.000000
- slot update_slots: id 29 | task 26707 | prompt done, n_past = 199, n_tokens = 656
- slot update_slots: id 34 | task 26708 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 34 | task 26708 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 34 | task 26708 | kv cache rm [0, end)
- slot update_slots: id 34 | task 26708 | prompt processing progress, n_past = 199, n_tokens = 855, progress = 1.000000
- slot update_slots: id 34 | task 26708 | prompt done, n_past = 199, n_tokens = 855
- slot update_slots: id 50 | task 27450 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 50 | task 27450 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 50 | task 27450 | kv cache rm [0, end)
- slot update_slots: id 50 | task 27450 | prompt processing progress, n_past = 199, n_tokens = 1054, progress = 1.000000
- slot update_slots: id 50 | task 27450 | prompt done, n_past = 199, n_tokens = 1054
- slot release: id 31 | task 27386 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 31 | task 27386 |
- prompt eval time = 909.63 ms / 199 tokens ( 4.57 ms per token, 218.77 tokens per second)
- eval time = 9180.13 ms / 59 tokens ( 155.60 ms per token, 6.43 tokens per second)
- total time = 10089.77 ms / 258 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 40 | task 27383 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 40 | task 27383 |
- prompt eval time = 1273.51 ms / 199 tokens ( 6.40 ms per token, 156.26 tokens per second)
- eval time = 10097.20 ms / 60 tokens ( 168.29 ms per token, 5.94 tokens per second)
- total time = 11370.71 ms / 259 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27342
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- slot release: id 57 | task 26656 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 57 | task 26656 |
- prompt eval time = 1277.18 ms / 199 tokens ( 6.42 ms per token, 155.81 tokens per second)
- eval time = 10098.36 ms / 60 tokens ( 168.31 ms per token, 5.94 tokens per second)
- total time = 11375.55 ms / 259 tokens
- slot release: id 60 | task 26660 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 60 | task 26660 |
- prompt eval time = 916.52 ms / 199 tokens ( 4.61 ms per token, 217.12 tokens per second)
- eval time = 9180.74 ms / 59 tokens ( 155.61 ms per token, 6.43 tokens per second)
- total time = 10097.26 ms / 258 tokens
- slot release: id 45 | task 27342 | stop processing: n_past = 132, truncated = 1
- slot launch_slot_: id 31 | task 26709 | processing task
- slot launch_slot_: id 40 | task 26710 | processing task
- slot launch_slot_: id 57 | task 26711 | processing task
- slot launch_slot_: id 60 | task 27456 | processing task
- slot launch_slot_: id 45 | task 27457 | processing task
- slot update_slots: id 58 | task 26677 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 31 | task 26709 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 26709 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 26709 | kv cache rm [0, end)
- slot update_slots: id 31 | task 26709 | prompt processing progress, n_past = 199, n_tokens = 258, progress = 1.000000
- slot update_slots: id 31 | task 26709 | prompt done, n_past = 199, n_tokens = 258
- slot update_slots: id 40 | task 26710 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 26710 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 26710 | kv cache rm [0, end)
- slot update_slots: id 40 | task 26710 | prompt processing progress, n_past = 199, n_tokens = 457, progress = 1.000000
- slot update_slots: id 40 | task 26710 | prompt done, n_past = 199, n_tokens = 457
- slot update_slots: id 45 | task 27457 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 45 | task 27457 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 45 | task 27457 | kv cache rm [0, end)
- slot update_slots: id 45 | task 27457 | prompt processing progress, n_past = 199, n_tokens = 656, progress = 1.000000
- slot update_slots: id 45 | task 27457 | prompt done, n_past = 199, n_tokens = 656
- slot update_slots: id 57 | task 26711 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 57 | task 26711 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 57 | task 26711 | kv cache rm [0, end)
- slot update_slots: id 57 | task 26711 | prompt processing progress, n_past = 199, n_tokens = 855, progress = 1.000000
- slot update_slots: id 57 | task 26711 | prompt done, n_past = 199, n_tokens = 855
- slot update_slots: id 60 | task 27456 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 27456 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 27456 | kv cache rm [0, end)
- slot update_slots: id 60 | task 27456 | prompt processing progress, n_past = 199, n_tokens = 1054, progress = 1.000000
- slot update_slots: id 60 | task 27456 | prompt done, n_past = 199, n_tokens = 1054
- srv params_from_: Chat format: Content-only
- slot release: id 5 | task 26572 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 5 | task 26572 |
- prompt eval time = 1890.88 ms / 199 tokens ( 9.50 ms per token, 105.24 tokens per second)
- eval time = 24977.55 ms / 123 tokens ( 203.07 ms per token, 4.92 tokens per second)
- total time = 26868.43 ms / 322 tokens
- slot release: id 49 | task 26665 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 49 | task 26665 |
- prompt eval time = 913.93 ms / 199 tokens ( 4.59 ms per token, 217.74 tokens per second)
- eval time = 9981.71 ms / 60 tokens ( 166.36 ms per token, 6.01 tokens per second)
- total time = 10895.65 ms / 259 tokens
- slot release: id 61 | task 26661 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 61 | task 26661 |
- prompt eval time = 916.63 ms / 199 tokens ( 4.61 ms per token, 217.10 tokens per second)
- eval time = 9981.62 ms / 60 tokens ( 166.36 ms per token, 6.01 tokens per second)
- total time = 10898.26 ms / 259 tokens
- slot release: id 63 | task 26663 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 63 | task 26663 |
- prompt eval time = 916.73 ms / 199 tokens ( 4.61 ms per token, 217.08 tokens per second)
- eval time = 9981.92 ms / 60 tokens ( 166.37 ms per token, 6.01 tokens per second)
- total time = 10898.65 ms / 259 tokens
- slot launch_slot_: id 5 | task 27459 | processing task
- slot launch_slot_: id 49 | task 26713 | processing task
- slot launch_slot_: id 61 | task 26715 | processing task
- slot launch_slot_: id 63 | task 26714 | processing task
- slot update_slots: id 5 | task 27459 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 5 | task 27459 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 5 | task 27459 | kv cache rm [0, end)
- slot update_slots: id 5 | task 27459 | prompt processing progress, n_past = 199, n_tokens = 259, progress = 1.000000
- slot update_slots: id 5 | task 27459 | prompt done, n_past = 199, n_tokens = 259
- slot update_slots: id 49 | task 26713 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 49 | task 26713 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 49 | task 26713 | kv cache rm [0, end)
- slot update_slots: id 49 | task 26713 | prompt processing progress, n_past = 199, n_tokens = 458, progress = 1.000000
- slot update_slots: id 49 | task 26713 | prompt done, n_past = 199, n_tokens = 458
- slot update_slots: id 61 | task 26715 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 61 | task 26715 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 61 | task 26715 | kv cache rm [0, end)
- slot update_slots: id 61 | task 26715 | prompt processing progress, n_past = 199, n_tokens = 657, progress = 1.000000
- slot update_slots: id 61 | task 26715 | prompt done, n_past = 199, n_tokens = 657
- slot update_slots: id 63 | task 26714 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 26714 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 26714 | kv cache rm [0, end)
- slot update_slots: id 63 | task 26714 | prompt processing progress, n_past = 199, n_tokens = 856, progress = 1.000000
- slot update_slots: id 63 | task 26714 | prompt done, n_past = 199, n_tokens = 856
- srv cancel_tasks: cancel task, id_task = 27338
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot release: id 21 | task 27338 | stop processing: n_past = 134, truncated = 1
- slot launch_slot_: id 21 | task 27462 | processing task
- slot update_slots: id 19 | task 26483 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 22 | task 26488 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 23 | task 26535 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 25 | task 26512 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 35 | task 26522 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 37 | task 26484 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 27462 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 27462 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 27462 | kv cache rm [0, end)
- slot update_slots: id 21 | task 27462 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 21 | task 27462 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 58 | task 26677 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 58 | task 26677 |
- prompt eval time = 422.19 ms / 199 tokens ( 2.12 ms per token, 471.35 tokens per second)
- eval time = 10198.08 ms / 60 tokens ( 169.97 ms per token, 5.88 tokens per second)
- total time = 10620.27 ms / 259 tokens
- slot launch_slot_: id 58 | task 26719 | processing task
- slot update_slots: id 48 | task 26498 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 54 | task 26525 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 58 | task 26719 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 26719 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 26719 | kv cache rm [0, end)
- slot update_slots: id 58 | task 26719 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 58 | task 26719 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27339
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot release: id 27 | task 27339 | stop processing: n_past = 137, truncated = 1
- slot launch_slot_: id 27 | task 27467 | processing task
- slot update_slots: id 27 | task 27467 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 27 | task 27467 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 27 | task 27467 | kv cache rm [0, end)
- slot update_slots: id 27 | task 27467 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 27 | task 27467 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 20 | task 26678 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 32 | task 26679 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 47 | task 26681 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 53 | task 26683 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 3 | task 26567 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 3 | task 26567 |
- prompt eval time = 1890.78 ms / 199 tokens ( 9.50 ms per token, 105.25 tokens per second)
- eval time = 27444.28 ms / 132 tokens ( 207.91 ms per token, 4.81 tokens per second)
- total time = 29335.06 ms / 331 tokens
- slot release: id 10 | task 26576 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 10 | task 26576 |
- prompt eval time = 1891.11 ms / 199 tokens ( 9.50 ms per token, 105.23 tokens per second)
- eval time = 27445.45 ms / 132 tokens ( 207.92 ms per token, 4.81 tokens per second)
- total time = 29336.56 ms / 331 tokens
- slot release: id 20 | task 26678 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 20 | task 26678 |
- prompt eval time = 432.17 ms / 199 tokens ( 2.17 ms per token, 460.47 tokens per second)
- eval time = 10770.12 ms / 60 tokens ( 179.50 ms per token, 5.57 tokens per second)
- total time = 11202.29 ms / 259 tokens
- slot release: id 32 | task 26679 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 32 | task 26679 |
- prompt eval time = 434.98 ms / 199 tokens ( 2.19 ms per token, 457.50 tokens per second)
- eval time = 10769.85 ms / 60 tokens ( 179.50 ms per token, 5.57 tokens per second)
- total time = 11204.82 ms / 259 tokens
- slot release: id 47 | task 26681 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 47 | task 26681 |
- prompt eval time = 438.53 ms / 199 tokens ( 2.20 ms per token, 453.79 tokens per second)
- eval time = 10769.50 ms / 60 tokens ( 179.49 ms per token, 5.57 tokens per second)
- total time = 11208.03 ms / 259 tokens
- slot launch_slot_: id 3 | task 26721 | processing task
- slot launch_slot_: id 10 | task 26722 | processing task
- slot launch_slot_: id 20 | task 26723 | processing task
- slot launch_slot_: id 32 | task 26724 | processing task
- slot launch_slot_: id 47 | task 26725 | processing task
- slot update_slots: id 3 | task 26721 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 3 | task 26721 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 3 | task 26721 | kv cache rm [0, end)
- slot update_slots: id 3 | task 26721 | prompt processing progress, n_past = 199, n_tokens = 258, progress = 1.000000
- slot update_slots: id 3 | task 26721 | prompt done, n_past = 199, n_tokens = 258
- slot update_slots: id 10 | task 26722 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 10 | task 26722 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 10 | task 26722 | kv cache rm [0, end)
- slot update_slots: id 10 | task 26722 | prompt processing progress, n_past = 199, n_tokens = 457, progress = 1.000000
- slot update_slots: id 10 | task 26722 | prompt done, n_past = 199, n_tokens = 457
- slot update_slots: id 20 | task 26723 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 26723 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 26723 | kv cache rm [0, end)
- slot update_slots: id 20 | task 26723 | prompt processing progress, n_past = 199, n_tokens = 656, progress = 1.000000
- slot update_slots: id 20 | task 26723 | prompt done, n_past = 199, n_tokens = 656
- slot update_slots: id 32 | task 26724 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 32 | task 26724 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 32 | task 26724 | kv cache rm [0, end)
- slot update_slots: id 32 | task 26724 | prompt processing progress, n_past = 199, n_tokens = 855, progress = 1.000000
- slot update_slots: id 32 | task 26724 | prompt done, n_past = 199, n_tokens = 855
- slot update_slots: id 47 | task 26725 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 47 | task 26725 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 47 | task 26725 | kv cache rm [0, end)
- slot update_slots: id 47 | task 26725 | prompt processing progress, n_past = 199, n_tokens = 1054, progress = 1.000000
- slot update_slots: id 47 | task 26725 | prompt done, n_past = 199, n_tokens = 1054
- slot release: id 59 | task 26625 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 59 | task 26625 |
- prompt eval time = 244.69 ms / 199 tokens ( 1.23 ms per token, 813.28 tokens per second)
- eval time = 20624.75 ms / 101 tokens ( 204.21 ms per token, 4.90 tokens per second)
- total time = 20869.44 ms / 300 tokens
- slot launch_slot_: id 59 | task 26726 | processing task
- slot update_slots: id 59 | task 26726 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 59 | task 26726 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 59 | task 26726 | kv cache rm [0, end)
- slot update_slots: id 59 | task 26726 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 59 | task 26726 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 42 | task 26606 | stop processing: n_past = 214, truncated = 1
- slot print_timing: id 42 | task 26606 |
- prompt eval time = 2241.95 ms / 199 tokens ( 11.27 ms per token, 88.76 tokens per second)
- eval time = 24904.63 ms / 143 tokens ( 174.16 ms per token, 5.74 tokens per second)
- total time = 27146.59 ms / 342 tokens
- slot launch_slot_: id 42 | task 26727 | processing task
- slot update_slots: id 42 | task 26727 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 42 | task 26727 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 42 | task 26727 | kv cache rm [0, end)
- slot update_slots: id 42 | task 26727 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 42 | task 26727 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 4 | task 26685 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 55 | task 26686 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 4 | task 26685 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 4 | task 26685 |
- prompt eval time = 162.65 ms / 199 tokens ( 0.82 ms per token, 1223.52 tokens per second)
- eval time = 11102.70 ms / 59 tokens ( 188.18 ms per token, 5.31 tokens per second)
- total time = 11265.34 ms / 258 tokens
- slot launch_slot_: id 4 | task 26573 | processing task
- slot update_slots: id 39 | task 26687 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 4 | task 26573 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 4 | task 26573 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 4 | task 26573 | kv cache rm [0, end)
- slot update_slots: id 4 | task 26573 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 4 | task 26573 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 7 | task 26691 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 11 | task 26693 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 28 | task 26652 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 7 | task 26691 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 7 | task 26691 |
- prompt eval time = 244.22 ms / 199 tokens ( 1.23 ms per token, 814.85 tokens per second)
- eval time = 10941.39 ms / 59 tokens ( 185.45 ms per token, 5.39 tokens per second)
- total time = 11185.60 ms / 258 tokens
- slot release: id 39 | task 26687 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 39 | task 26687 |
- prompt eval time = 164.31 ms / 199 tokens ( 0.83 ms per token, 1211.15 tokens per second)
- eval time = 11196.96 ms / 60 tokens ( 186.62 ms per token, 5.36 tokens per second)
- total time = 11361.27 ms / 259 tokens
- slot launch_slot_: id 7 | task 26728 | processing task
- slot launch_slot_: id 39 | task 26729 | processing task
- slot update_slots: id 7 | task 26728 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 26728 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 26728 | kv cache rm [0, end)
- slot update_slots: id 7 | task 26728 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 7 | task 26728 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 39 | task 26729 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 39 | task 26729 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 39 | task 26729 | kv cache rm [0, end)
- slot update_slots: id 39 | task 26729 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 39 | task 26729 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 11 | task 26693 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 11 | task 26693 |
- prompt eval time = 244.47 ms / 199 tokens ( 1.23 ms per token, 814.00 tokens per second)
- eval time = 11192.37 ms / 60 tokens ( 186.54 ms per token, 5.36 tokens per second)
- total time = 11436.84 ms / 259 tokens
- slot launch_slot_: id 11 | task 26732 | processing task
- slot update_slots: id 33 | task 26694 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 11 | task 26732 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 11 | task 26732 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 11 | task 26732 | kv cache rm [0, end)
- slot update_slots: id 11 | task 26732 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 11 | task 26732 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 8 | task 26623 | stop processing: n_past = 195, truncated = 1
- slot print_timing: id 8 | task 26623 |
- prompt eval time = 232.12 ms / 199 tokens ( 1.17 ms per token, 857.33 tokens per second)
- eval time = 23508.00 ms / 124 tokens ( 189.58 ms per token, 5.27 tokens per second)
- total time = 23740.12 ms / 323 tokens
- slot release: id 28 | task 26652 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 28 | task 26652 |
- prompt eval time = 167.24 ms / 199 tokens ( 0.84 ms per token, 1189.89 tokens per second)
- eval time = 11348.07 ms / 60 tokens ( 189.13 ms per token, 5.29 tokens per second)
- total time = 11515.31 ms / 259 tokens
- slot launch_slot_: id 8 | task 26734 | processing task
- slot launch_slot_: id 28 | task 26735 | processing task
- slot update_slots: id 8 | task 26734 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 8 | task 26734 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 8 | task 26734 | kv cache rm [0, end)
- slot update_slots: id 8 | task 26734 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 8 | task 26734 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 28 | task 26735 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 26735 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 26735 | kv cache rm [0, end)
- slot update_slots: id 28 | task 26735 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 28 | task 26735 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 36 | task 27382 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 36 | task 27382 |
- prompt eval time = 1272.89 ms / 199 tokens ( 6.40 ms per token, 156.34 tokens per second)
- eval time = 17369.87 ms / 101 tokens ( 171.98 ms per token, 5.81 tokens per second)
- total time = 18642.76 ms / 300 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot launch_slot_: id 36 | task 26736 | processing task
- slot update_slots: id 36 | task 26736 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 36 | task 26736 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 36 | task 26736 | kv cache rm [0, end)
- slot update_slots: id 36 | task 26736 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 36 | task 26736 | prompt done, n_past = 199, n_tokens = 262
- srv params_from_: Chat format: Content-only
- slot release: id 33 | task 26694 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 33 | task 26694 |
- prompt eval time = 166.50 ms / 199 tokens ( 0.84 ms per token, 1195.20 tokens per second)
- eval time = 11383.52 ms / 60 tokens ( 189.73 ms per token, 5.27 tokens per second)
- total time = 11550.02 ms / 259 tokens
- slot release: id 56 | task 27388 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 56 | task 27388 |
- prompt eval time = 915.36 ms / 199 tokens ( 4.60 ms per token, 217.40 tokens per second)
- eval time = 16781.24 ms / 101 tokens ( 166.15 ms per token, 6.02 tokens per second)
- total time = 17696.60 ms / 300 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot launch_slot_: id 33 | task 27505 | processing task
- slot launch_slot_: id 56 | task 26737 | processing task
- slot update_slots: id 33 | task 27505 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 33 | task 27505 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 33 | task 27505 | kv cache rm [0, end)
- slot update_slots: id 33 | task 27505 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 33 | task 27505 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 56 | task 26737 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 26737 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 26737 | kv cache rm [0, end)
- slot update_slots: id 56 | task 26737 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 56 | task 26737 | prompt done, n_past = 199, n_tokens = 460
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27385
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 51 | task 27385 | stop processing: n_past = 173, truncated = 1
- slot launch_slot_: id 51 | task 27507 | processing task
- slot update_slots: id 51 | task 27507 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 27507 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 27507 | kv cache rm [0, end)
- slot update_slots: id 51 | task 27507 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 51 | task 27507 | prompt done, n_past = 199, n_tokens = 262
- srv params_from_: Chat format: Content-only
- slot update_slots: id 9 | task 26696 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26697 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 26698 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 0 | task 26700 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 26701 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 13 | task 26703 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 26705 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 9 | task 26696 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 9 | task 26696 |
- prompt eval time = 558.39 ms / 199 tokens ( 2.81 ms per token, 356.38 tokens per second)
- eval time = 11177.75 ms / 60 tokens ( 186.30 ms per token, 5.37 tokens per second)
- total time = 11736.14 ms / 259 tokens
- slot release: id 41 | task 26697 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 41 | task 26697 |
- prompt eval time = 566.21 ms / 199 tokens ( 2.85 ms per token, 351.46 tokens per second)
- eval time = 11175.86 ms / 60 tokens ( 186.26 ms per token, 5.37 tokens per second)
- total time = 11742.07 ms / 259 tokens
- srv cancel_tasks: cancel task, id_task = 27390
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 52 | task 27390 | stop processing: n_past = 185, truncated = 1
- slot launch_slot_: id 9 | task 26740 | processing task
- slot launch_slot_: id 41 | task 26741 | processing task
- slot launch_slot_: id 52 | task 26742 | processing task
- slot update_slots: id 15 | task 26706 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 17 | task 26570 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 29 | task 26707 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 34 | task 26708 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 50 | task 27450 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 9 | task 26740 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 9 | task 26740 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 9 | task 26740 | kv cache rm [0, end)
- slot update_slots: id 9 | task 26740 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 9 | task 26740 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 41 | task 26741 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 26741 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 26741 | kv cache rm [0, end)
- slot update_slots: id 41 | task 26741 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 41 | task 26741 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 52 | task 26742 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 52 | task 26742 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 52 | task 26742 | kv cache rm [0, end)
- slot update_slots: id 52 | task 26742 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 52 | task 26742 | prompt done, n_past = 199, n_tokens = 658
- srv params_from_: Chat format: Content-only
- slot release: id 12 | task 26698 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 12 | task 26698 |
- prompt eval time = 501.51 ms / 199 tokens ( 2.52 ms per token, 396.80 tokens per second)
- eval time = 11213.89 ms / 60 tokens ( 186.90 ms per token, 5.35 tokens per second)
- total time = 11715.40 ms / 259 tokens
- slot launch_slot_: id 12 | task 27525 | processing task
- slot update_slots: id 31 | task 26709 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 26710 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 45 | task 27457 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 57 | task 26711 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 60 | task 27456 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 27525 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 27525 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 27525 | kv cache rm [0, end)
- slot update_slots: id 12 | task 27525 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 27525 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 13 | task 26703 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 13 | task 26703 |
- prompt eval time = 727.69 ms / 199 tokens ( 3.66 ms per token, 273.47 tokens per second)
- eval time = 10817.63 ms / 60 tokens ( 180.29 ms per token, 5.55 tokens per second)
- total time = 11545.32 ms / 259 tokens
- slot release: id 18 | task 26705 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 18 | task 26705 |
- prompt eval time = 729.38 ms / 199 tokens ( 3.67 ms per token, 272.83 tokens per second)
- eval time = 10816.39 ms / 60 tokens ( 180.27 ms per token, 5.55 tokens per second)
- total time = 11545.76 ms / 259 tokens
- slot release: id 29 | task 26707 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 29 | task 26707 |
- prompt eval time = 775.95 ms / 199 tokens ( 3.90 ms per token, 256.46 tokens per second)
- eval time = 10027.16 ms / 59 tokens ( 169.95 ms per token, 5.88 tokens per second)
- total time = 10803.11 ms / 258 tokens
- srv cancel_tasks: cancel task, id_task = 27450
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 50 | task 27450 | stop processing: n_past = 130, truncated = 1
- slot launch_slot_: id 13 | task 26744 | processing task
- slot launch_slot_: id 18 | task 26745 | processing task
- slot launch_slot_: id 29 | task 26746 | processing task
- slot launch_slot_: id 50 | task 26747 | processing task
- slot update_slots: id 5 | task 27459 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 49 | task 26713 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 61 | task 26715 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 26714 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 13 | task 26744 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 13 | task 26744 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 13 | task 26744 | kv cache rm [0, end)
- slot update_slots: id 13 | task 26744 | prompt processing progress, n_past = 199, n_tokens = 259, progress = 1.000000
- slot update_slots: id 13 | task 26744 | prompt done, n_past = 199, n_tokens = 259
- slot update_slots: id 18 | task 26745 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 18 | task 26745 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 18 | task 26745 | kv cache rm [0, end)
- slot update_slots: id 18 | task 26745 | prompt processing progress, n_past = 199, n_tokens = 458, progress = 1.000000
- slot update_slots: id 18 | task 26745 | prompt done, n_past = 199, n_tokens = 458
- slot update_slots: id 29 | task 26746 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 26746 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 26746 | kv cache rm [0, end)
- slot update_slots: id 29 | task 26746 | prompt processing progress, n_past = 199, n_tokens = 657, progress = 1.000000
- slot update_slots: id 29 | task 26746 | prompt done, n_past = 199, n_tokens = 657
- slot update_slots: id 50 | task 26747 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 50 | task 26747 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 50 | task 26747 | kv cache rm [0, end)
- slot update_slots: id 50 | task 26747 | prompt processing progress, n_past = 199, n_tokens = 856, progress = 1.000000
- slot update_slots: id 50 | task 26747 | prompt done, n_past = 199, n_tokens = 856
- srv params_from_: Chat format: Content-only
- slot release: id 15 | task 26706 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 15 | task 26706 |
- prompt eval time = 773.16 ms / 199 tokens ( 3.89 ms per token, 257.39 tokens per second)
- eval time = 10671.38 ms / 60 tokens ( 177.86 ms per token, 5.62 tokens per second)
- total time = 11444.53 ms / 259 tokens
- slot release: id 17 | task 26570 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 17 | task 26570 |
- prompt eval time = 773.23 ms / 199 tokens ( 3.89 ms per token, 257.36 tokens per second)
- eval time = 10671.41 ms / 60 tokens ( 177.86 ms per token, 5.62 tokens per second)
- total time = 11444.63 ms / 259 tokens
- slot launch_slot_: id 15 | task 27529 | processing task
- slot launch_slot_: id 17 | task 26748 | processing task
- slot update_slots: id 21 | task 27462 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 15 | task 27529 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 15 | task 27529 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 15 | task 27529 | kv cache rm [0, end)
- slot update_slots: id 15 | task 27529 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 15 | task 27529 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 17 | task 26748 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 17 | task 26748 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 17 | task 26748 | kv cache rm [0, end)
- slot update_slots: id 17 | task 26748 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 17 | task 26748 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 40 | task 26710 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 40 | task 26710 |
- prompt eval time = 795.32 ms / 199 tokens ( 4.00 ms per token, 250.21 tokens per second)
- eval time = 10308.62 ms / 60 tokens ( 171.81 ms per token, 5.82 tokens per second)
- total time = 11103.94 ms / 259 tokens
- slot release: id 45 | task 27457 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 45 | task 27457 |
- prompt eval time = 795.71 ms / 199 tokens ( 4.00 ms per token, 250.09 tokens per second)
- eval time = 10309.56 ms / 60 tokens ( 171.83 ms per token, 5.82 tokens per second)
- total time = 11105.27 ms / 259 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 57 | task 26711 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 57 | task 26711 |
- prompt eval time = 799.07 ms / 199 tokens ( 4.02 ms per token, 249.04 tokens per second)
- eval time = 10307.66 ms / 60 tokens ( 171.79 ms per token, 5.82 tokens per second)
- total time = 11106.73 ms / 259 tokens
- slot launch_slot_: id 40 | task 26750 | processing task
- slot launch_slot_: id 45 | task 26751 | processing task
- slot launch_slot_: id 57 | task 26752 | processing task
- slot update_slots: id 58 | task 26719 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 26750 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 26750 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 26750 | kv cache rm [0, end)
- slot update_slots: id 40 | task 26750 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 40 | task 26750 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 45 | task 26751 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 45 | task 26751 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 45 | task 26751 | kv cache rm [0, end)
- slot update_slots: id 45 | task 26751 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 45 | task 26751 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 57 | task 26752 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 57 | task 26752 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 57 | task 26752 | kv cache rm [0, end)
- slot update_slots: id 57 | task 26752 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 57 | task 26752 | prompt done, n_past = 199, n_tokens = 658
- srv params_from_: Chat format: Content-only
- slot release: id 2 | task 26638 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 2 | task 26638 |
- prompt eval time = 1083.46 ms / 199 tokens ( 5.44 ms per token, 183.67 tokens per second)
- eval time = 24183.77 ms / 123 tokens ( 196.62 ms per token, 5.09 tokens per second)
- total time = 25267.23 ms / 322 tokens
- slot release: id 5 | task 27459 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 5 | task 27459 |
- prompt eval time = 631.25 ms / 199 tokens ( 3.17 ms per token, 315.25 tokens per second)
- eval time = 10203.12 ms / 60 tokens ( 170.05 ms per token, 5.88 tokens per second)
- total time = 10834.36 ms / 259 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot release: id 63 | task 26714 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 63 | task 26714 |
- prompt eval time = 641.80 ms / 199 tokens ( 3.23 ms per token, 310.07 tokens per second)
- eval time = 10204.05 ms / 60 tokens ( 170.07 ms per token, 5.88 tokens per second)
- total time = 10845.85 ms / 259 tokens
- slot launch_slot_: id 2 | task 27532 | processing task
- slot launch_slot_: id 5 | task 26753 | processing task
- slot launch_slot_: id 63 | task 26754 | processing task
- slot update_slots: id 2 | task 27532 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 27532 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 27532 | kv cache rm [0, end)
- slot update_slots: id 2 | task 27532 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 2 | task 27532 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 5 | task 26753 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 5 | task 26753 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 5 | task 26753 | kv cache rm [0, end)
- slot update_slots: id 5 | task 26753 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 5 | task 26753 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 63 | task 26754 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 26754 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 26754 | kv cache rm [0, end)
- slot update_slots: id 63 | task 26754 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 63 | task 26754 | prompt done, n_past = 199, n_tokens = 658
- slot release: id 21 | task 27462 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 21 | task 27462 |
- prompt eval time = 447.23 ms / 199 tokens ( 2.25 ms per token, 444.96 tokens per second)
- eval time = 10285.98 ms / 60 tokens ( 171.43 ms per token, 5.83 tokens per second)
- total time = 10733.21 ms / 259 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 58 | task 26719 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 58 | task 26719 |
- prompt eval time = 362.73 ms / 199 tokens ( 1.82 ms per token, 548.61 tokens per second)
- eval time = 9919.69 ms / 59 tokens ( 168.13 ms per token, 5.95 tokens per second)
- total time = 10282.42 ms / 258 tokens
- srv params_from_: Chat format: Content-only
- slot launch_slot_: id 21 | task 26756 | processing task
- slot launch_slot_: id 58 | task 26757 | processing task
- slot update_slots: id 27 | task 27467 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 26756 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 26756 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 26756 | kv cache rm [0, end)
- slot update_slots: id 21 | task 26756 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 21 | task 26756 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 58 | task 26757 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 26757 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 26757 | kv cache rm [0, end)
- slot update_slots: id 58 | task 26757 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 58 | task 26757 | prompt done, n_past = 199, n_tokens = 460
- srv cancel_tasks: cancel task, id_task = 27456
- slot update_slots: id 16 | task 26582 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27467
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27505
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot release: id 33 | task 27505 | stop processing: n_past = 220, truncated = 1
- slot release: id 27 | task 27467 | stop processing: n_past = 130, truncated = 1
- slot release: id 60 | task 27456 | stop processing: n_past = 135, truncated = 1
- slot launch_slot_: id 33 | task 27539 | processing task
- slot launch_slot_: id 27 | task 27542 | processing task
- srv params_from_: Chat format: Content-only
- slot launch_slot_: id 60 | task 26758 | processing task
- slot update_slots: id 38 | task 26598 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 44 | task 26608 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 27 | task 27542 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 27 | task 27542 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 27 | task 27542 | kv cache rm [0, end)
- slot update_slots: id 27 | task 27542 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 27 | task 27542 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 33 | task 27539 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 33 | task 27539 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 33 | task 27539 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 33 | task 27539 | kv cache rm [198, end)
- slot update_slots: id 33 | task 27539 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 0.005025
- slot update_slots: id 33 | task 27539 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 60 | task 26758 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 26758 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 26758 | kv cache rm [0, end)
- slot update_slots: id 60 | task 26758 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 60 | task 26758 | prompt done, n_past = 199, n_tokens = 460
- slot update_slots: id 62 | task 26614 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27507
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 51 | task 27507 | stop processing: n_past = 221, truncated = 1
- slot launch_slot_: id 51 | task 26761 | processing task
- slot update_slots: id 51 | task 26761 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 26761 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 26761 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 51 | task 26761 | kv cache rm [198, end)
- slot update_slots: id 51 | task 26761 | prompt processing progress, n_past = 199, n_tokens = 64, progress = 0.005025
- slot update_slots: id 51 | task 26761 | prompt done, n_past = 199, n_tokens = 64
- srv params_from_: Chat format: Content-only
- slot update_slots: id 3 | task 26721 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 10 | task 26722 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 20 | task 26723 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 32 | task 26724 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 47 | task 26725 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 3 | task 26721 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 3 | task 26721 |
- prompt eval time = 501.75 ms / 199 tokens ( 2.52 ms per token, 396.62 tokens per second)
- eval time = 10465.31 ms / 60 tokens ( 174.42 ms per token, 5.73 tokens per second)
- total time = 10967.05 ms / 259 tokens
- slot release: id 20 | task 26723 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 20 | task 26723 |
- prompt eval time = 504.14 ms / 199 tokens ( 2.53 ms per token, 394.73 tokens per second)
- eval time = 10466.52 ms / 60 tokens ( 174.44 ms per token, 5.73 tokens per second)
- total time = 10970.66 ms / 259 tokens
- slot release: id 26 | task 26641 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 26 | task 26641 |
- prompt eval time = 1085.37 ms / 199 tokens ( 5.45 ms per token, 183.35 tokens per second)
- eval time = 26787.91 ms / 132 tokens ( 202.94 ms per token, 4.93 tokens per second)
- total time = 27873.28 ms / 331 tokens
- slot release: id 32 | task 26724 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 32 | task 26724 |
- prompt eval time = 506.59 ms / 199 tokens ( 2.55 ms per token, 392.82 tokens per second)
- eval time = 10466.03 ms / 60 tokens ( 174.43 ms per token, 5.73 tokens per second)
- total time = 10972.62 ms / 259 tokens
- slot release: id 47 | task 26725 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 47 | task 26725 |
- prompt eval time = 509.36 ms / 199 tokens ( 2.56 ms per token, 390.69 tokens per second)
- eval time = 10465.44 ms / 60 tokens ( 174.42 ms per token, 5.73 tokens per second)
- total time = 10974.80 ms / 259 tokens
- slot launch_slot_: id 3 | task 26762 | processing task
- slot launch_slot_: id 20 | task 26763 | processing task
- slot launch_slot_: id 26 | task 26764 | processing task
- slot launch_slot_: id 32 | task 26767 | processing task
- slot launch_slot_: id 47 | task 26768 | processing task
- slot update_slots: id 3 | task 26762 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 3 | task 26762 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 3 | task 26762 | kv cache rm [0, end)
- slot update_slots: id 3 | task 26762 | prompt processing progress, n_past = 199, n_tokens = 258, progress = 1.000000
- slot update_slots: id 3 | task 26762 | prompt done, n_past = 199, n_tokens = 258
- slot update_slots: id 20 | task 26763 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 26763 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 26763 | kv cache rm [0, end)
- slot update_slots: id 20 | task 26763 | prompt processing progress, n_past = 199, n_tokens = 457, progress = 1.000000
- slot update_slots: id 20 | task 26763 | prompt done, n_past = 199, n_tokens = 457
- slot update_slots: id 26 | task 26764 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 26764 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 26764 | kv cache rm [0, end)
- slot update_slots: id 26 | task 26764 | prompt processing progress, n_past = 199, n_tokens = 656, progress = 1.000000
- slot update_slots: id 26 | task 26764 | prompt done, n_past = 199, n_tokens = 656
- slot update_slots: id 32 | task 26767 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 32 | task 26767 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 32 | task 26767 | kv cache rm [0, end)
- slot update_slots: id 32 | task 26767 | prompt processing progress, n_past = 199, n_tokens = 855, progress = 1.000000
- slot update_slots: id 32 | task 26767 | prompt done, n_past = 199, n_tokens = 855
- slot update_slots: id 47 | task 26768 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 47 | task 26768 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 47 | task 26768 | kv cache rm [0, end)
- slot update_slots: id 47 | task 26768 | prompt processing progress, n_past = 199, n_tokens = 1054, progress = 1.000000
- slot update_slots: id 47 | task 26768 | prompt done, n_past = 199, n_tokens = 1054
- srv cancel_tasks: cancel task, id_task = 27525
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 46 | task 26654 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 46 | task 26654 |
- prompt eval time = 1275.27 ms / 199 tokens ( 6.41 ms per token, 156.05 tokens per second)
- eval time = 25284.77 ms / 132 tokens ( 191.55 ms per token, 5.22 tokens per second)
- total time = 26560.04 ms / 331 tokens
- srv params_from_: Chat format: Content-only
- slot release: id 12 | task 27525 | stop processing: n_past = 213, truncated = 1
- slot launch_slot_: id 46 | task 26775 | processing task
- slot launch_slot_: id 12 | task 27555 | processing task
- slot update_slots: id 12 | task 27555 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 27555 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 27555 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 12 | task 27555 | kv cache rm [198, end)
- slot update_slots: id 12 | task 27555 | prompt processing progress, n_past = 199, n_tokens = 63, progress = 0.005025
- slot update_slots: id 12 | task 27555 | prompt done, n_past = 199, n_tokens = 63
- slot update_slots: id 46 | task 26775 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 46 | task 26775 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 46 | task 26775 | kv cache rm [0, end)
- slot update_slots: id 46 | task 26775 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 46 | task 26775 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 53 | task 26683 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 53 | task 26683 |
- prompt eval time = 440.08 ms / 199 tokens ( 2.21 ms per token, 452.19 tokens per second)
- eval time = 23202.50 ms / 123 tokens ( 188.64 ms per token, 5.30 tokens per second)
- total time = 23642.58 ms / 322 tokens
- slot launch_slot_: id 53 | task 26776 | processing task
- slot update_slots: id 53 | task 26776 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 53 | task 26776 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 53 | task 26776 | kv cache rm [0, end)
- slot update_slots: id 53 | task 26776 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 53 | task 26776 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27532
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27529
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- slot release: id 15 | task 27529 | stop processing: n_past = 213, truncated = 1
- slot release: id 2 | task 27532 | stop processing: n_past = 211, truncated = 1
- slot launch_slot_: id 15 | task 27560 | processing task
- slot launch_slot_: id 2 | task 27561 | processing task
- slot update_slots: id 2 | task 27561 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 27561 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 27561 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 2 | task 27561 | kv cache rm [198, end)
- slot update_slots: id 2 | task 27561 | prompt processing progress, n_past = 199, n_tokens = 63, progress = 0.005025
- slot update_slots: id 2 | task 27561 | prompt done, n_past = 199, n_tokens = 63
- slot update_slots: id 15 | task 27560 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 15 | task 27560 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 15 | task 27560 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 15 | task 27560 | kv cache rm [198, end)
- slot update_slots: id 15 | task 27560 | prompt processing progress, n_past = 199, n_tokens = 64, progress = 0.005025
- slot update_slots: id 15 | task 27560 | prompt done, n_past = 199, n_tokens = 64
- slot update_slots: id 59 | task 26726 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 59 | task 26726 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 59 | task 26726 |
- prompt eval time = 160.42 ms / 199 tokens ( 0.81 ms per token, 1240.49 tokens per second)
- eval time = 11730.88 ms / 60 tokens ( 195.51 ms per token, 5.11 tokens per second)
- total time = 11891.30 ms / 259 tokens
- slot launch_slot_: id 59 | task 26787 | processing task
- slot update_slots: id 59 | task 26787 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 59 | task 26787 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 59 | task 26787 | kv cache rm [0, end)
- slot update_slots: id 59 | task 26787 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 59 | task 26787 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 42 | task 26727 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27539
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 42 | task 26727 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 42 | task 26727 |
- prompt eval time = 156.80 ms / 199 tokens ( 0.79 ms per token, 1269.17 tokens per second)
- eval time = 11525.93 ms / 60 tokens ( 192.10 ms per token, 5.21 tokens per second)
- total time = 11682.73 ms / 259 tokens
- slot release: id 33 | task 27539 | stop processing: n_past = 217, truncated = 1
- slot launch_slot_: id 42 | task 26789 | processing task
- slot launch_slot_: id 33 | task 26790 | processing task
- slot update_slots: id 33 | task 26790 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 33 | task 26790 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 33 | task 26790 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 33 | task 26790 | kv cache rm [198, end)
- slot update_slots: id 33 | task 26790 | prompt processing progress, n_past = 199, n_tokens = 63, progress = 0.005025
- slot update_slots: id 33 | task 26790 | prompt done, n_past = 199, n_tokens = 63
- slot update_slots: id 42 | task 26789 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 42 | task 26789 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 42 | task 26789 | kv cache rm [0, end)
- slot update_slots: id 42 | task 26789 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 42 | task 26789 | prompt done, n_past = 199, n_tokens = 262
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27542
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 27 | task 27542 | stop processing: n_past = 223, truncated = 1
- slot launch_slot_: id 27 | task 26791 | processing task
- slot update_slots: id 27 | task 26791 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 27 | task 26791 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 27 | task 26791 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 27 | task 26791 | kv cache rm [198, end)
- slot update_slots: id 27 | task 26791 | prompt processing progress, n_past = 199, n_tokens = 64, progress = 0.005025
- slot update_slots: id 27 | task 26791 | prompt done, n_past = 199, n_tokens = 64
- srv params_from_: Chat format: Content-only
- slot update_slots: id 4 | task 26573 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 4 | task 26573 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 4 | task 26573 |
- prompt eval time = 160.19 ms / 199 tokens ( 0.80 ms per token, 1242.24 tokens per second)
- eval time = 11820.64 ms / 60 tokens ( 197.01 ms per token, 5.08 tokens per second)
- total time = 11980.84 ms / 259 tokens
- srv cancel_tasks: cancel task, id_task = 27560
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27555
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 12 | task 27555 | stop processing: n_past = 221, truncated = 1
- slot release: id 15 | task 27560 | stop processing: n_past = 219, truncated = 1
- slot launch_slot_: id 4 | task 26792 | processing task
- slot launch_slot_: id 12 | task 26793 | processing task
- slot launch_slot_: id 15 | task 26795 | processing task
- slot update_slots: id 7 | task 26728 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 39 | task 26729 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 4 | task 26792 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 4 | task 26792 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 4 | task 26792 | kv cache rm [0, end)
- slot update_slots: id 4 | task 26792 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 4 | task 26792 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 12 | task 26793 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 26793 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 26793 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 12 | task 26793 | kv cache rm [198, end)
- slot update_slots: id 12 | task 26793 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 0.005025
- slot update_slots: id 12 | task 26793 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 15 | task 26795 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 15 | task 26795 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 15 | task 26795 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 15 | task 26795 | kv cache rm [198, end)
- slot update_slots: id 15 | task 26795 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 0.005025
- slot update_slots: id 15 | task 26795 | prompt done, n_past = 199, n_tokens = 262
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- slot update_slots: id 11 | task 26732 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 7 | task 26728 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 7 | task 26728 |
- prompt eval time = 238.69 ms / 199 tokens ( 1.20 ms per token, 833.71 tokens per second)
- eval time = 11545.66 ms / 59 tokens ( 195.69 ms per token, 5.11 tokens per second)
- total time = 11784.35 ms / 258 tokens
- slot release: id 39 | task 26729 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 39 | task 26729 |
- prompt eval time = 243.93 ms / 199 tokens ( 1.23 ms per token, 815.81 tokens per second)
- eval time = 11543.04 ms / 59 tokens ( 195.64 ms per token, 5.11 tokens per second)
- total time = 11786.97 ms / 258 tokens
- slot launch_slot_: id 7 | task 26802 | processing task
- slot launch_slot_: id 39 | task 26805 | processing task
- slot update_slots: id 8 | task 26734 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 28 | task 26735 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26802 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 26802 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 26802 | kv cache rm [0, end)
- slot update_slots: id 7 | task 26802 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 7 | task 26802 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 39 | task 26805 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 39 | task 26805 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 39 | task 26805 | kv cache rm [0, end)
- slot update_slots: id 39 | task 26805 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 39 | task 26805 | prompt done, n_past = 199, n_tokens = 460
- srv cancel_tasks: cancel task, id_task = 27561
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 2 | task 27561 | stop processing: n_past = 222, truncated = 1
- srv params_from_: Chat format: Content-only
- slot launch_slot_: id 2 | task 26807 | processing task
- slot update_slots: id 24 | task 26621 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 36 | task 26736 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 26807 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 26807 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 26807 | need to evaluate at least 1 token to generate logits, n_past = 199, n_prompt_tokens = 199
- slot update_slots: id 2 | task 26807 | kv cache rm [198, end)
- slot update_slots: id 2 | task 26807 | prompt processing progress, n_past = 199, n_tokens = 64, progress = 0.005025
- slot update_slots: id 2 | task 26807 | prompt done, n_past = 199, n_tokens = 64
- slot release: id 11 | task 26732 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 11 | task 26732 |
- prompt eval time = 321.99 ms / 199 tokens ( 1.62 ms per token, 618.04 tokens per second)
- eval time = 11707.64 ms / 60 tokens ( 195.13 ms per token, 5.12 tokens per second)
- total time = 12029.62 ms / 259 tokens
- slot launch_slot_: id 11 | task 26813 | processing task
- slot update_slots: id 56 | task 26737 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 11 | task 26813 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 11 | task 26813 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 11 | task 26813 | kv cache rm [0, end)
- slot update_slots: id 11 | task 26813 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 11 | task 26813 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 34 | task 26708 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 34 | task 26708 |
- prompt eval time = 777.04 ms / 199 tokens ( 3.90 ms per token, 256.10 tokens per second)
- eval time = 19166.72 ms / 101 tokens ( 189.77 ms per token, 5.27 tokens per second)
- total time = 19943.76 ms / 300 tokens
- slot launch_slot_: id 34 | task 26816 | processing task
- slot update_slots: id 34 | task 26816 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 34 | task 26816 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 34 | task 26816 | kv cache rm [0, end)
- slot update_slots: id 34 | task 26816 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 34 | task 26816 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 31 | task 26709 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 31 | task 26709 |
- prompt eval time = 793.57 ms / 199 tokens ( 3.99 ms per token, 250.77 tokens per second)
- eval time = 18526.44 ms / 101 tokens ( 183.43 ms per token, 5.45 tokens per second)
- total time = 19320.01 ms / 300 tokens
- slot release: id 36 | task 26736 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 36 | task 26736 |
- prompt eval time = 323.19 ms / 199 tokens ( 1.62 ms per token, 615.74 tokens per second)
- eval time = 11726.21 ms / 60 tokens ( 195.44 ms per token, 5.12 tokens per second)
- total time = 12049.40 ms / 259 tokens
- slot launch_slot_: id 31 | task 26818 | processing task
- slot launch_slot_: id 36 | task 26819 | processing task
- slot update_slots: id 31 | task 26818 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 26818 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 26818 | kv cache rm [0, end)
- slot update_slots: id 31 | task 26818 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 31 | task 26818 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 36 | task 26819 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 36 | task 26819 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 36 | task 26819 | kv cache rm [0, end)
- slot update_slots: id 36 | task 26819 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 36 | task 26819 | prompt done, n_past = 199, n_tokens = 460
- srv cancel_tasks: cancel task, id_task = 27248
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27358
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27348
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27262
- srv cancel_tasks: cancel task, id_task = 27548
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27356
- srv cancel_tasks: cancel task, id_task = 27266
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27213
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27201
- srv cancel_tasks: cancel task, id_task = 27264
- srv cancel_tasks: cancel task, id_task = 27215
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27345
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27250
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27265
- srv cancel_tasks: cancel task, id_task = 27451
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27224
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27377
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27452
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27232
- srv cancel_tasks: cancel task, id_task = 27355
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27206
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27251
- srv params_from_: Chat format: Content-only
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27365
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27249
- srv params_from_: Chat format: Content-only
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27234
- srv cancel_tasks: cancel task, id_task = 27573
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27261
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27230
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27218
- srv cancel_tasks: cancel task, id_task = 27360
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27543
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27376
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27208
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27357
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27453
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27367
- srv cancel_tasks: cancel task, id_task = 27214
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27535
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27223
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27368
- srv cancel_tasks: cancel task, id_task = 27202
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27510
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27581
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27591
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27359
- srv params_from_: Chat format: Content-only
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27216
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27238
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27346
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27533
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27375
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27236
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27447
- srv cancel_tasks: cancel task, id_task = 27343
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27253
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27344
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27240
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27378
- srv cancel_tasks: cancel task, id_task = 27226
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27263
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27370
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27207
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27366
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27225
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27247
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot update_slots: id 14 | task 26627 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 10 | task 26722 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 10 | task 26722 |
- prompt eval time = 503.29 ms / 199 tokens ( 2.53 ms per token, 395.40 tokens per second)
- eval time = 16565.51 ms / 101 tokens ( 164.01 ms per token, 6.10 tokens per second)
- total time = 17068.80 ms / 300 tokens
- slot launch_slot_: id 10 | task 26821 | processing task
- slot update_slots: id 10 | task 26821 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 10 | task 26821 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 10 | task 26821 | kv cache rm [0, end)
- slot update_slots: id 10 | task 26821 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 10 | task 26821 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27625
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27590
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27595
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot update_slots: id 9 | task 26740 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26741 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 52 | task 26742 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 41 | task 26741 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 41 | task 26741 |
- prompt eval time = 544.01 ms / 199 tokens ( 2.73 ms per token, 365.80 tokens per second)
- eval time = 11285.63 ms / 59 tokens ( 191.28 ms per token, 5.23 tokens per second)
- total time = 11829.64 ms / 258 tokens
- slot launch_slot_: id 41 | task 26822 | processing task
- slot update_slots: id 13 | task 26744 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 26745 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 29 | task 26746 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 50 | task 26747 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26822 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 26822 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 26822 | kv cache rm [0, end)
- slot update_slots: id 41 | task 26822 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 41 | task 26822 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 9 | task 26740 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 9 | task 26740 |
- prompt eval time = 538.37 ms / 199 tokens ( 2.71 ms per token, 369.64 tokens per second)
- eval time = 11459.73 ms / 60 tokens ( 191.00 ms per token, 5.24 tokens per second)
- total time = 11998.10 ms / 259 tokens
- slot launch_slot_: id 9 | task 26823 | processing task
- slot update_slots: id 17 | task 26748 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 9 | task 26823 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 9 | task 26823 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 9 | task 26823 | kv cache rm [0, end)
- slot update_slots: id 9 | task 26823 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 9 | task 26823 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27627
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- slot update_slots: id 40 | task 26750 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 45 | task 26751 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 57 | task 26752 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 13 | task 26744 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 13 | task 26744 |
- prompt eval time = 633.49 ms / 199 tokens ( 3.18 ms per token, 314.13 tokens per second)
- eval time = 11182.17 ms / 60 tokens ( 186.37 ms per token, 5.37 tokens per second)
- total time = 11815.66 ms / 259 tokens
- slot launch_slot_: id 13 | task 26824 | processing task
- slot update_slots: id 5 | task 26753 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 26754 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 13 | task 26824 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 13 | task 26824 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 13 | task 26824 | kv cache rm [0, end)
- slot update_slots: id 13 | task 26824 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 13 | task 26824 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 17 | task 26748 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 17 | task 26748 |
- prompt eval time = 431.34 ms / 199 tokens ( 2.17 ms per token, 461.35 tokens per second)
- eval time = 10907.48 ms / 60 tokens ( 181.79 ms per token, 5.50 tokens per second)
- total time = 11338.82 ms / 259 tokens
- slot launch_slot_: id 17 | task 26825 | processing task
- slot update_slots: id 21 | task 26756 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 58 | task 26757 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 17 | task 26825 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 17 | task 26825 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 17 | task 26825 | kv cache rm [0, end)
- slot update_slots: id 17 | task 26825 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 17 | task 26825 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 5 | task 26753 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 5 | task 26753 |
- prompt eval time = 524.55 ms / 199 tokens ( 2.64 ms per token, 379.37 tokens per second)
- eval time = 10184.43 ms / 59 tokens ( 172.62 ms per token, 5.79 tokens per second)
- total time = 10708.98 ms / 258 tokens
- slot release: id 40 | task 26750 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 40 | task 26750 |
- prompt eval time = 534.19 ms / 199 tokens ( 2.68 ms per token, 372.53 tokens per second)
- eval time = 10716.53 ms / 60 tokens ( 178.61 ms per token, 5.60 tokens per second)
- total time = 11250.72 ms / 259 tokens
- slot release: id 45 | task 26751 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 45 | task 26751 |
- prompt eval time = 534.53 ms / 199 tokens ( 2.69 ms per token, 372.29 tokens per second)
- eval time = 10716.60 ms / 60 tokens ( 178.61 ms per token, 5.60 tokens per second)
- total time = 11251.13 ms / 259 tokens
- slot launch_slot_: id 5 | task 26826 | processing task
- slot launch_slot_: id 40 | task 26828 | processing task
- slot launch_slot_: id 45 | task 26830 | processing task
- slot update_slots: id 5 | task 26826 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 5 | task 26826 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 5 | task 26826 | kv cache rm [0, end)
- slot update_slots: id 5 | task 26826 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 5 | task 26826 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 40 | task 26828 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 26828 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 26828 | kv cache rm [0, end)
- slot update_slots: id 40 | task 26828 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 40 | task 26828 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 45 | task 26830 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 45 | task 26830 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 45 | task 26830 | kv cache rm [0, end)
- slot update_slots: id 45 | task 26830 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 45 | task 26830 | prompt done, n_past = 199, n_tokens = 658
- srv cancel_tasks: cancel task, id_task = 27622
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27629
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- slot release: id 1 | task 26701 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 1 | task 26701 |
- prompt eval time = 726.81 ms / 199 tokens ( 3.65 ms per token, 273.80 tokens per second)
- eval time = 23519.38 ms / 123 tokens ( 191.21 ms per token, 5.23 tokens per second)
- total time = 24246.19 ms / 322 tokens
- slot release: id 58 | task 26757 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 58 | task 26757 |
- prompt eval time = 605.92 ms / 199 tokens ( 3.04 ms per token, 328.43 tokens per second)
- eval time = 9930.10 ms / 59 tokens ( 168.31 ms per token, 5.94 tokens per second)
- total time = 10536.02 ms / 258 tokens
- slot launch_slot_: id 1 | task 27757 | processing task
- slot launch_slot_: id 58 | task 27758 | processing task
- slot update_slots: id 30 | task 26629 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 60 | task 26758 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 27757 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 27757 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 27757 | kv cache rm [0, end)
- slot update_slots: id 1 | task 27757 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 1 | task 27757 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 58 | task 27758 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 27758 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 27758 | kv cache rm [0, end)
- slot update_slots: id 58 | task 27758 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 58 | task 27758 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 21 | task 26756 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 21 | task 26756 |
- prompt eval time = 599.74 ms / 199 tokens ( 3.01 ms per token, 331.81 tokens per second)
- eval time = 10383.83 ms / 60 tokens ( 173.06 ms per token, 5.78 tokens per second)
- total time = 10983.57 ms / 259 tokens
- slot launch_slot_: id 21 | task 26832 | processing task
- slot update_slots: id 6 | task 26605 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 26832 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 26832 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 26832 | kv cache rm [0, end)
- slot update_slots: id 21 | task 26832 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 21 | task 26832 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 51 | task 26761 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 60 | task 26758 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 60 | task 26758 |
- prompt eval time = 247.13 ms / 199 tokens ( 1.24 ms per token, 805.23 tokens per second)
- eval time = 10366.95 ms / 60 tokens ( 172.78 ms per token, 5.79 tokens per second)
- total time = 10614.08 ms / 259 tokens
- slot launch_slot_: id 60 | task 26836 | processing task
- slot update_slots: id 43 | task 26653 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 60 | task 26836 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 26836 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 26836 | kv cache rm [0, end)
- slot update_slots: id 60 | task 26836 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 60 | task 26836 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 51 | task 26761 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 51 | task 26761 |
- prompt eval time = 59.29 ms / 1 tokens ( 59.29 ms per token, 16.87 tokens per second)
- eval time = 10358.93 ms / 60 tokens ( 172.65 ms per token, 5.79 tokens per second)
- total time = 10418.22 ms / 61 tokens
- slot launch_slot_: id 51 | task 26837 | processing task
- slot update_slots: id 51 | task 26837 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 26837 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 26837 | kv cache rm [0, end)
- slot update_slots: id 51 | task 26837 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 51 | task 26837 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 3 | task 26762 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 20 | task 26763 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 26764 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 32 | task 26767 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 47 | task 26768 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 32 | task 26767 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 32 | task 26767 |
- prompt eval time = 541.59 ms / 199 tokens ( 2.72 ms per token, 367.44 tokens per second)
- eval time = 9906.24 ms / 59 tokens ( 167.90 ms per token, 5.96 tokens per second)
- total time = 10447.83 ms / 258 tokens
- slot launch_slot_: id 32 | task 26846 | processing task
- slot update_slots: id 19 | task 26483 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 22 | task 26488 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 23 | task 26535 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 25 | task 26512 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 35 | task 26522 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 37 | task 26484 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 46 | task 26775 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 32 | task 26846 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 32 | task 26846 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 32 | task 26846 | kv cache rm [0, end)
- slot update_slots: id 32 | task 26846 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 32 | task 26846 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 20 | task 26763 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 20 | task 26763 |
- prompt eval time = 539.61 ms / 199 tokens ( 2.71 ms per token, 368.78 tokens per second)
- eval time = 10380.00 ms / 60 tokens ( 173.00 ms per token, 5.78 tokens per second)
- total time = 10919.61 ms / 259 tokens
- slot release: id 26 | task 26764 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 26764 |
- prompt eval time = 540.12 ms / 199 tokens ( 2.71 ms per token, 368.43 tokens per second)
- eval time = 10379.93 ms / 60 tokens ( 173.00 ms per token, 5.78 tokens per second)
- total time = 10920.06 ms / 259 tokens
- slot release: id 47 | task 26768 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 47 | task 26768 |
- prompt eval time = 544.09 ms / 199 tokens ( 2.73 ms per token, 365.75 tokens per second)
- eval time = 10377.65 ms / 60 tokens ( 172.96 ms per token, 5.78 tokens per second)
- total time = 10921.74 ms / 259 tokens
- slot launch_slot_: id 20 | task 26848 | processing task
- slot launch_slot_: id 26 | task 26849 | processing task
- slot launch_slot_: id 47 | task 26852 | processing task
- slot update_slots: id 48 | task 26498 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 53 | task 26776 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 54 | task 26525 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 20 | task 26848 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 26848 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 26848 | kv cache rm [0, end)
- slot update_slots: id 20 | task 26848 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 20 | task 26848 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 26 | task 26849 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 26849 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 26849 | kv cache rm [0, end)
- slot update_slots: id 26 | task 26849 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 26 | task 26849 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 47 | task 26852 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 47 | task 26852 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 47 | task 26852 | kv cache rm [0, end)
- slot update_slots: id 47 | task 26852 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 47 | task 26852 | prompt done, n_past = 199, n_tokens = 658
- slot update_slots: id 59 | task 26787 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 33 | task 26790 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 42 | task 26789 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 33 | task 26790 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 33 | task 26790 |
- prompt eval time = 161.81 ms / 1 tokens ( 161.81 ms per token, 6.18 tokens per second)
- eval time = 10230.82 ms / 60 tokens ( 170.51 ms per token, 5.86 tokens per second)
- total time = 10392.62 ms / 61 tokens
- slot launch_slot_: id 33 | task 26851 | processing task
- slot update_slots: id 33 | task 26851 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 33 | task 26851 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 33 | task 26851 | kv cache rm [0, end)
- slot update_slots: id 33 | task 26851 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 33 | task 26851 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 27 | task 26791 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 27 | task 26791 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 27 | task 26791 |
- prompt eval time = 56.54 ms / 1 tokens ( 56.54 ms per token, 17.69 tokens per second)
- eval time = 10255.18 ms / 59 tokens ( 173.82 ms per token, 5.75 tokens per second)
- total time = 10311.72 ms / 60 tokens
- slot launch_slot_: id 27 | task 26856 | processing task
- slot update_slots: id 27 | task 26856 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 27 | task 26856 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 27 | task 26856 | kv cache rm [0, end)
- slot update_slots: id 27 | task 26856 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 27 | task 26856 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 4 | task 26792 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 26793 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 15 | task 26795 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26802 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 39 | task 26805 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 12 | task 26793 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 12 | task 26793 |
- prompt eval time = 165.31 ms / 1 tokens ( 165.31 ms per token, 6.05 tokens per second)
- eval time = 10135.12 ms / 60 tokens ( 168.92 ms per token, 5.92 tokens per second)
- total time = 10300.43 ms / 61 tokens
- slot launch_slot_: id 12 | task 26858 | processing task
- slot update_slots: id 2 | task 26807 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 26858 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 26858 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 26858 | kv cache rm [0, end)
- slot update_slots: id 12 | task 26858 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 26858 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 11 | task 26813 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 7 | task 26802 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 26802 |
- prompt eval time = 248.14 ms / 199 tokens ( 1.25 ms per token, 801.97 tokens per second)
- eval time = 10207.34 ms / 60 tokens ( 170.12 ms per token, 5.88 tokens per second)
- total time = 10455.48 ms / 259 tokens
- slot release: id 39 | task 26805 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 39 | task 26805 |
- prompt eval time = 250.74 ms / 199 tokens ( 1.26 ms per token, 793.65 tokens per second)
- eval time = 10207.58 ms / 60 tokens ( 170.13 ms per token, 5.88 tokens per second)
- total time = 10458.32 ms / 259 tokens
- slot launch_slot_: id 7 | task 26860 | processing task
- slot launch_slot_: id 39 | task 26861 | processing task
- slot update_slots: id 34 | task 26816 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26860 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 26860 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 26860 | kv cache rm [0, end)
- slot update_slots: id 7 | task 26860 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 7 | task 26860 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 39 | task 26861 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 39 | task 26861 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 39 | task 26861 | kv cache rm [0, end)
- slot update_slots: id 39 | task 26861 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 39 | task 26861 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 2 | task 26807 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 2 | task 26807 |
- prompt eval time = 236.85 ms / 1 tokens ( 236.85 ms per token, 4.22 tokens per second)
- eval time = 10202.90 ms / 60 tokens ( 170.05 ms per token, 5.88 tokens per second)
- total time = 10439.76 ms / 61 tokens
- slot launch_slot_: id 2 | task 26864 | processing task
- slot update_slots: id 31 | task 26818 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 36 | task 26819 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 26864 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 26864 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 26864 | kv cache rm [0, end)
- slot update_slots: id 2 | task 26864 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 2 | task 26864 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 29 | task 26746 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 29 | task 26746 |
- prompt eval time = 636.64 ms / 199 tokens ( 3.20 ms per token, 312.58 tokens per second)
- eval time = 18418.51 ms / 101 tokens ( 182.36 ms per token, 5.48 tokens per second)
- total time = 19055.15 ms / 300 tokens
- slot release: id 50 | task 26747 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 50 | task 26747 |
- prompt eval time = 639.59 ms / 199 tokens ( 3.21 ms per token, 311.14 tokens per second)
- eval time = 18417.18 ms / 101 tokens ( 182.35 ms per token, 5.48 tokens per second)
- total time = 19056.77 ms / 300 tokens
- slot launch_slot_: id 29 | task 26865 | processing task
- slot launch_slot_: id 50 | task 26866 | processing task
- slot update_slots: id 29 | task 26865 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 26865 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 26865 | kv cache rm [0, end)
- slot update_slots: id 29 | task 26865 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 29 | task 26865 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 50 | task 26866 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 50 | task 26866 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 50 | task 26866 | kv cache rm [0, end)
- slot update_slots: id 50 | task 26866 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 50 | task 26866 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 34 | task 26816 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 34 | task 26816 |
- prompt eval time = 159.12 ms / 199 tokens ( 0.80 ms per token, 1250.60 tokens per second)
- eval time = 10167.10 ms / 60 tokens ( 169.45 ms per token, 5.90 tokens per second)
- total time = 10326.22 ms / 259 tokens
- slot launch_slot_: id 34 | task 26867 | processing task
- slot update_slots: id 34 | task 26867 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 34 | task 26867 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 34 | task 26867 | kv cache rm [0, end)
- slot update_slots: id 34 | task 26867 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 34 | task 26867 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 31 | task 26818 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 31 | task 26818 |
- prompt eval time = 249.69 ms / 199 tokens ( 1.25 ms per token, 796.99 tokens per second)
- eval time = 10241.27 ms / 60 tokens ( 170.69 ms per token, 5.86 tokens per second)
- total time = 10490.96 ms / 259 tokens
- slot release: id 36 | task 26819 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 36 | task 26819 |
- prompt eval time = 250.02 ms / 199 tokens ( 1.26 ms per token, 795.93 tokens per second)
- eval time = 10241.29 ms / 60 tokens ( 170.69 ms per token, 5.86 tokens per second)
- total time = 10491.31 ms / 259 tokens
- slot launch_slot_: id 31 | task 26868 | processing task
- slot launch_slot_: id 36 | task 26869 | processing task
- slot update_slots: id 55 | task 26686 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 31 | task 26868 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 26868 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 26868 | kv cache rm [0, end)
- slot update_slots: id 31 | task 26868 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 31 | task 26868 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 36 | task 26869 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 36 | task 26869 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 36 | task 26869 | kv cache rm [0, end)
- slot update_slots: id 36 | task 26869 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 36 | task 26869 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 63 | task 26754 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 63 | task 26754 |
- prompt eval time = 536.53 ms / 199 tokens ( 2.70 ms per token, 370.90 tokens per second)
- eval time = 17874.61 ms / 101 tokens ( 176.98 ms per token, 5.65 tokens per second)
- total time = 18411.14 ms / 300 tokens
- slot launch_slot_: id 63 | task 26871 | processing task
- slot update_slots: id 63 | task 26871 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 26871 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 26871 | kv cache rm [0, end)
- slot update_slots: id 63 | task 26871 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 63 | task 26871 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 28 | task 26735 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 28 | task 26735 |
- prompt eval time = 244.64 ms / 199 tokens ( 1.23 ms per token, 813.44 tokens per second)
- eval time = 23122.17 ms / 123 tokens ( 187.99 ms per token, 5.32 tokens per second)
- total time = 23366.81 ms / 322 tokens
- slot launch_slot_: id 28 | task 26872 | processing task
- slot update_slots: id 28 | task 26872 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 26872 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 26872 | kv cache rm [0, end)
- slot update_slots: id 28 | task 26872 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 28 | task 26872 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 56 | task 26737 | stop processing: n_past = 195, truncated = 1
- slot print_timing: id 56 | task 26737 |
- prompt eval time = 252.44 ms / 199 tokens ( 1.27 ms per token, 788.32 tokens per second)
- eval time = 22977.14 ms / 124 tokens ( 185.30 ms per token, 5.40 tokens per second)
- total time = 23229.58 ms / 323 tokens
- slot launch_slot_: id 56 | task 26873 | processing task
- slot update_slots: id 56 | task 26873 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 26873 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 26873 | kv cache rm [0, end)
- slot update_slots: id 56 | task 26873 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 56 | task 26873 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 10 | task 26821 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 8 | task 26734 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 8 | task 26734 |
- prompt eval time = 241.38 ms / 199 tokens ( 1.21 ms per token, 824.44 tokens per second)
- eval time = 24013.20 ms / 132 tokens ( 181.92 ms per token, 5.50 tokens per second)
- total time = 24254.58 ms / 331 tokens
- slot launch_slot_: id 8 | task 26875 | processing task
- slot update_slots: id 41 | task 26822 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 8 | task 26875 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 8 | task 26875 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 8 | task 26875 | kv cache rm [0, end)
- slot update_slots: id 8 | task 26875 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 8 | task 26875 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 9 | task 26823 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 41 | task 26822 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 41 | task 26822 |
- prompt eval time = 168.95 ms / 199 tokens ( 0.85 ms per token, 1177.84 tokens per second)
- eval time = 10599.87 ms / 60 tokens ( 176.66 ms per token, 5.66 tokens per second)
- total time = 10768.82 ms / 259 tokens
- slot launch_slot_: id 41 | task 26876 | processing task
- slot update_slots: id 13 | task 26824 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26876 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 26876 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 26876 | kv cache rm [0, end)
- slot update_slots: id 41 | task 26876 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 41 | task 26876 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 17 | task 26825 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 5 | task 26826 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 26828 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 45 | task 26830 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 13 | task 26824 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 13 | task 26824 |
- prompt eval time = 161.46 ms / 199 tokens ( 0.81 ms per token, 1232.51 tokens per second)
- eval time = 10181.48 ms / 60 tokens ( 169.69 ms per token, 5.89 tokens per second)
- total time = 10342.94 ms / 259 tokens
- slot launch_slot_: id 13 | task 26877 | processing task
- slot update_slots: id 1 | task 27757 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 58 | task 27758 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 13 | task 26877 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 13 | task 26877 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 13 | task 26877 | kv cache rm [0, end)
- slot update_slots: id 13 | task 26877 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 13 | task 26877 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 17 | task 26825 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 17 | task 26825 |
- prompt eval time = 346.39 ms / 199 tokens ( 1.74 ms per token, 574.50 tokens per second)
- eval time = 10169.69 ms / 60 tokens ( 169.49 ms per token, 5.90 tokens per second)
- total time = 10516.08 ms / 259 tokens
- slot release: id 45 | task 26830 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 45 | task 26830 |
- prompt eval time = 357.28 ms / 199 tokens ( 1.80 ms per token, 556.98 tokens per second)
- eval time = 9810.61 ms / 59 tokens ( 166.28 ms per token, 6.01 tokens per second)
- total time = 10167.89 ms / 258 tokens
- slot launch_slot_: id 17 | task 26878 | processing task
- slot launch_slot_: id 45 | task 26879 | processing task
- slot update_slots: id 21 | task 26832 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 17 | task 26878 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 17 | task 26878 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 17 | task 26878 | kv cache rm [0, end)
- slot update_slots: id 17 | task 26878 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 17 | task 26878 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 45 | task 26879 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 45 | task 26879 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 45 | task 26879 | kv cache rm [0, end)
- slot update_slots: id 45 | task 26879 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 45 | task 26879 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 5 | task 26826 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 5 | task 26826 |
- prompt eval time = 354.16 ms / 199 tokens ( 1.78 ms per token, 561.89 tokens per second)
- eval time = 10224.60 ms / 60 tokens ( 170.41 ms per token, 5.87 tokens per second)
- total time = 10578.77 ms / 259 tokens
- slot release: id 40 | task 26828 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 40 | task 26828 |
- prompt eval time = 356.95 ms / 199 tokens ( 1.79 ms per token, 557.50 tokens per second)
- eval time = 10224.69 ms / 60 tokens ( 170.41 ms per token, 5.87 tokens per second)
- total time = 10581.64 ms / 259 tokens
- slot launch_slot_: id 5 | task 26880 | processing task
- slot launch_slot_: id 40 | task 26882 | processing task
- slot update_slots: id 5 | task 26880 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 5 | task 26880 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 5 | task 26880 | kv cache rm [0, end)
- slot update_slots: id 5 | task 26880 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 5 | task 26880 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 40 | task 26882 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 26882 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 26882 | kv cache rm [0, end)
- slot update_slots: id 40 | task 26882 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 40 | task 26882 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 1 | task 27757 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 1 | task 27757 |
- prompt eval time = 443.84 ms / 199 tokens ( 2.23 ms per token, 448.36 tokens per second)
- eval time = 10318.12 ms / 60 tokens ( 171.97 ms per token, 5.82 tokens per second)
- total time = 10761.96 ms / 259 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 18 | task 26745 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 18 | task 26745 |
- prompt eval time = 634.00 ms / 199 tokens ( 3.19 ms per token, 313.88 tokens per second)
- eval time = 22825.55 ms / 123 tokens ( 185.57 ms per token, 5.39 tokens per second)
- total time = 23459.55 ms / 322 tokens
- srv params_from_: Chat format: Content-only
- slot launch_slot_: id 1 | task 26884 | processing task
- slot launch_slot_: id 18 | task 26885 | processing task
- slot update_slots: id 60 | task 26836 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 26884 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 26884 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 26884 | kv cache rm [0, end)
- slot update_slots: id 1 | task 26884 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 1 | task 26884 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 18 | task 26885 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 18 | task 26885 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 18 | task 26885 | kv cache rm [0, end)
- slot update_slots: id 18 | task 26885 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 18 | task 26885 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 21 | task 26832 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 21 | task 26832 |
- prompt eval time = 345.33 ms / 199 tokens ( 1.74 ms per token, 576.25 tokens per second)
- eval time = 10431.18 ms / 60 tokens ( 173.85 ms per token, 5.75 tokens per second)
- total time = 10776.51 ms / 259 tokens
- slot launch_slot_: id 21 | task 26886 | processing task
- slot update_slots: id 0 | task 26700 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 26886 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 26886 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 26886 | kv cache rm [0, end)
- slot update_slots: id 21 | task 26886 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 21 | task 26886 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 51 | task 26837 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 42 | task 26789 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 42 | task 26789 |
- prompt eval time = 162.49 ms / 199 tokens ( 0.82 ms per token, 1224.70 tokens per second)
- eval time = 17704.47 ms / 101 tokens ( 175.29 ms per token, 5.70 tokens per second)
- total time = 17866.96 ms / 300 tokens
- slot release: id 60 | task 26836 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 60 | task 26836 |
- prompt eval time = 352.17 ms / 199 tokens ( 1.77 ms per token, 565.06 tokens per second)
- eval time = 10433.90 ms / 60 tokens ( 173.90 ms per token, 5.75 tokens per second)
- total time = 10786.07 ms / 259 tokens
- slot launch_slot_: id 42 | task 26888 | processing task
- slot launch_slot_: id 60 | task 26889 | processing task
- slot update_slots: id 42 | task 26888 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 42 | task 26888 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 42 | task 26888 | kv cache rm [0, end)
- slot update_slots: id 42 | task 26888 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 42 | task 26888 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 60 | task 26889 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 26889 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 26889 | kv cache rm [0, end)
- slot update_slots: id 60 | task 26889 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 60 | task 26889 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 51 | task 26837 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 51 | task 26837 |
- prompt eval time = 159.91 ms / 199 tokens ( 0.80 ms per token, 1244.42 tokens per second)
- eval time = 10455.91 ms / 59 tokens ( 177.22 ms per token, 5.64 tokens per second)
- total time = 10615.83 ms / 258 tokens
- slot launch_slot_: id 51 | task 26891 | processing task
- slot update_slots: id 49 | task 26713 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 61 | task 26715 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 51 | task 26891 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 26891 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 26891 | kv cache rm [0, end)
- slot update_slots: id 51 | task 26891 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 51 | task 26891 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 32 | task 26846 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 20 | task 26848 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 26849 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 47 | task 26852 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 26 | task 26849 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 26849 |
- prompt eval time = 573.56 ms / 199 tokens ( 2.88 ms per token, 346.96 tokens per second)
- eval time = 9925.78 ms / 60 tokens ( 165.43 ms per token, 6.04 tokens per second)
- total time = 10499.34 ms / 259 tokens
- slot launch_slot_: id 26 | task 26892 | processing task
- slot update_slots: id 16 | task 26582 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 26892 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 26892 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 26892 | kv cache rm [0, end)
- slot update_slots: id 26 | task 26892 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 26 | task 26892 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 38 | task 26598 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 44 | task 26608 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 62 | task 26614 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 3 | task 26762 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 3 | task 26762 |
- prompt eval time = 536.72 ms / 199 tokens ( 2.70 ms per token, 370.77 tokens per second)
- eval time = 21336.96 ms / 123 tokens ( 173.47 ms per token, 5.76 tokens per second)
- total time = 21873.69 ms / 322 tokens
- slot launch_slot_: id 3 | task 26893 | processing task
- slot update_slots: id 3 | task 26893 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 3 | task 26893 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 3 | task 26893 | kv cache rm [0, end)
- slot update_slots: id 3 | task 26893 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 3 | task 26893 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 33 | task 26851 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 59 | task 26787 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 59 | task 26787 |
- prompt eval time = 161.62 ms / 199 tokens ( 0.81 ms per token, 1231.28 tokens per second)
- eval time = 20404.29 ms / 123 tokens ( 165.89 ms per token, 6.03 tokens per second)
- total time = 20565.91 ms / 322 tokens
- slot launch_slot_: id 59 | task 26894 | processing task
- slot update_slots: id 59 | task 26894 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 59 | task 26894 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 59 | task 26894 | kv cache rm [0, end)
- slot update_slots: id 59 | task 26894 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 59 | task 26894 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 46 | task 26775 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 46 | task 26775 |
- prompt eval time = 514.00 ms / 199 tokens ( 2.58 ms per token, 387.16 tokens per second)
- eval time = 21401.80 ms / 132 tokens ( 162.13 ms per token, 6.17 tokens per second)
- total time = 21915.80 ms / 331 tokens
- slot launch_slot_: id 46 | task 26895 | processing task
- slot update_slots: id 46 | task 26895 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 46 | task 26895 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 46 | task 26895 | kv cache rm [0, end)
- slot update_slots: id 46 | task 26895 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 46 | task 26895 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 27 | task 26856 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 27 | task 26856 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 27 | task 26856 |
- prompt eval time = 153.74 ms / 199 tokens ( 0.77 ms per token, 1294.42 tokens per second)
- eval time = 9996.21 ms / 60 tokens ( 166.60 ms per token, 6.00 tokens per second)
- total time = 10149.95 ms / 259 tokens
- slot launch_slot_: id 27 | task 26896 | processing task
- slot update_slots: id 27 | task 26896 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 27 | task 26896 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 27 | task 26896 | kv cache rm [0, end)
- slot update_slots: id 27 | task 26896 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 27 | task 26896 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 12 | task 26858 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26860 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 39 | task 26861 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 12 | task 26858 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 12 | task 26858 |
- prompt eval time = 323.25 ms / 199 tokens ( 1.62 ms per token, 615.62 tokens per second)
- eval time = 9654.51 ms / 60 tokens ( 160.91 ms per token, 6.21 tokens per second)
- total time = 9977.77 ms / 259 tokens
- slot launch_slot_: id 12 | task 26897 | processing task
- slot update_slots: id 2 | task 26864 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 26897 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 26897 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 26897 | kv cache rm [0, end)
- slot update_slots: id 12 | task 26897 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 26897 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 29 | task 26865 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 50 | task 26866 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 7 | task 26860 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 26860 |
- prompt eval time = 231.43 ms / 199 tokens ( 1.16 ms per token, 859.86 tokens per second)
- eval time = 9579.62 ms / 60 tokens ( 159.66 ms per token, 6.26 tokens per second)
- total time = 9811.05 ms / 259 tokens
- slot launch_slot_: id 7 | task 26898 | processing task
- slot update_slots: id 34 | task 26867 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26898 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 26898 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 26898 | kv cache rm [0, end)
- slot update_slots: id 7 | task 26898 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 7 | task 26898 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 31 | task 26868 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 36 | task 26869 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 26871 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 34 | task 26867 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 34 | task 26867 |
- prompt eval time = 323.07 ms / 199 tokens ( 1.62 ms per token, 615.96 tokens per second)
- eval time = 9300.53 ms / 60 tokens ( 155.01 ms per token, 6.45 tokens per second)
- total time = 9623.60 ms / 259 tokens
- slot launch_slot_: id 34 | task 26899 | processing task
- slot update_slots: id 28 | task 26872 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 34 | task 26899 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 34 | task 26899 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 34 | task 26899 | kv cache rm [0, end)
- slot update_slots: id 34 | task 26899 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 34 | task 26899 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 28 | task 26872 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 28 | task 26872 |
- prompt eval time = 155.70 ms / 199 tokens ( 0.78 ms per token, 1278.07 tokens per second)
- eval time = 9048.62 ms / 59 tokens ( 153.37 ms per token, 6.52 tokens per second)
- total time = 9204.32 ms / 258 tokens
- slot release: id 58 | task 27758 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 58 | task 27758 |
- prompt eval time = 449.96 ms / 199 tokens ( 2.26 ms per token, 442.26 tokens per second)
- eval time = 16426.86 ms / 101 tokens ( 162.64 ms per token, 6.15 tokens per second)
- total time = 16876.83 ms / 300 tokens
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 63 | task 26871 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 63 | task 26871 |
- prompt eval time = 331.37 ms / 199 tokens ( 1.67 ms per token, 600.53 tokens per second)
- eval time = 9208.61 ms / 60 tokens ( 153.48 ms per token, 6.52 tokens per second)
- total time = 9539.98 ms / 259 tokens
- slot launch_slot_: id 28 | task 26900 | processing task
- slot launch_slot_: id 58 | task 26901 | processing task
- slot launch_slot_: id 63 | task 26902 | processing task
- slot update_slots: id 28 | task 26900 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 26900 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 26900 | kv cache rm [0, end)
- slot update_slots: id 28 | task 26900 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 28 | task 26900 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 58 | task 26901 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 26901 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 26901 | kv cache rm [0, end)
- slot update_slots: id 58 | task 26901 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 58 | task 26901 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 63 | task 26902 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 26902 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 26902 | kv cache rm [0, end)
- slot update_slots: id 63 | task 26902 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 63 | task 26902 | prompt done, n_past = 199, n_tokens = 658
- srv params_from_: Chat format: Content-only
- slot update_slots: id 56 | task 26873 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 56 | task 26873 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 56 | task 26873 |
- prompt eval time = 157.81 ms / 199 tokens ( 0.79 ms per token, 1260.99 tokens per second)
- eval time = 9319.93 ms / 59 tokens ( 157.96 ms per token, 6.33 tokens per second)
- total time = 9477.75 ms / 258 tokens
- slot launch_slot_: id 56 | task 26907 | processing task
- slot update_slots: id 24 | task 26621 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 56 | task 26907 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 26907 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 26907 | kv cache rm [0, end)
- slot update_slots: id 56 | task 26907 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 56 | task 26907 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27710
- srv cancel_tasks: cancel task, id_task = 27626
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27704
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27708
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27643
- srv cancel_tasks: cancel task, id_task = 27623
- srv cancel_tasks: cancel task, id_task = 27660
- srv cancel_tasks: cancel task, id_task = 27675
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27687
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27646
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27711
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27702
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27621
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27691
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27624
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27658
- srv cancel_tasks: cancel task, id_task = 27652
- srv cancel_tasks: cancel task, id_task = 27695
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27632
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27620
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27686
- srv cancel_tasks: cancel task, id_task = 27697
- srv cancel_tasks: cancel task, id_task = 27724
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27653
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27667
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27705
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27706
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27641
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27713
- srv cancel_tasks: cancel task, id_task = 27720
- srv cancel_tasks: cancel task, id_task = 27699
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27618
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27680
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27664
- srv cancel_tasks: cancel task, id_task = 27672
- srv cancel_tasks: cancel task, id_task = 27665
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27656
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27619
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27690
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27703
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27709
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27716
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27654
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv cancel_tasks: cancel task, id_task = 27670
- srv cancel_tasks: cancel task, id_task = 27669
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27688
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27678
- srv cancel_tasks: cancel task, id_task = 27657
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- srv params_from_: Chat format: Content-only
- slot update_slots: id 8 | task 26875 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 14 | task 26627 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 47 | task 26852 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 47 | task 26852 |
- prompt eval time = 575.19 ms / 199 tokens ( 2.89 ms per token, 345.97 tokens per second)
- eval time = 15239.12 ms / 101 tokens ( 150.88 ms per token, 6.63 tokens per second)
- total time = 15814.31 ms / 300 tokens
- slot launch_slot_: id 47 | task 26903 | processing task
- slot update_slots: id 47 | task 26903 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 47 | task 26903 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 47 | task 26903 | kv cache rm [0, end)
- slot update_slots: id 47 | task 26903 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 47 | task 26903 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 8 | task 26875 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 8 | task 26875 |
- prompt eval time = 161.12 ms / 199 tokens ( 0.81 ms per token, 1235.10 tokens per second)
- eval time = 9703.33 ms / 60 tokens ( 161.72 ms per token, 6.18 tokens per second)
- total time = 9864.46 ms / 259 tokens
- slot launch_slot_: id 8 | task 26904 | processing task
- slot update_slots: id 41 | task 26876 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 8 | task 26904 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 8 | task 26904 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 8 | task 26904 | kv cache rm [0, end)
- slot update_slots: id 8 | task 26904 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 8 | task 26904 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27712
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27707
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 41 | task 26876 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 41 | task 26876 |
- prompt eval time = 165.28 ms / 199 tokens ( 0.83 ms per token, 1204.00 tokens per second)
- eval time = 9960.92 ms / 60 tokens ( 166.02 ms per token, 6.02 tokens per second)
- total time = 10126.20 ms / 259 tokens
- slot launch_slot_: id 41 | task 26912 | processing task
- slot update_slots: id 13 | task 26877 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26912 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 26912 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 26912 | kv cache rm [0, end)
- slot update_slots: id 41 | task 26912 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 41 | task 26912 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 17 | task 26878 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 45 | task 26879 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 5 | task 26880 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 26882 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 26884 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 26885 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 45 | task 26879 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 45 | task 26879 |
- prompt eval time = 411.32 ms / 199 tokens ( 2.07 ms per token, 483.80 tokens per second)
- eval time = 9663.02 ms / 60 tokens ( 161.05 ms per token, 6.21 tokens per second)
- total time = 10074.34 ms / 259 tokens
- slot launch_slot_: id 45 | task 26908 | processing task
- slot update_slots: id 21 | task 26886 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 45 | task 26908 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 45 | task 26908 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 45 | task 26908 | kv cache rm [0, end)
- slot update_slots: id 45 | task 26908 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 45 | task 26908 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27714
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 5 | task 26880 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 5 | task 26880 |
- prompt eval time = 536.85 ms / 199 tokens ( 2.70 ms per token, 370.68 tokens per second)
- eval time = 9499.17 ms / 60 tokens ( 158.32 ms per token, 6.32 tokens per second)
- total time = 10036.02 ms / 259 tokens
- slot release: id 40 | task 26882 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 40 | task 26882 |
- prompt eval time = 539.68 ms / 199 tokens ( 2.71 ms per token, 368.74 tokens per second)
- eval time = 9499.06 ms / 60 tokens ( 158.32 ms per token, 6.32 tokens per second)
- total time = 10038.74 ms / 259 tokens
- slot launch_slot_: id 5 | task 26905 | processing task
- slot launch_slot_: id 40 | task 26909 | processing task
- slot update_slots: id 52 | task 26742 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 5 | task 26905 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 5 | task 26905 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 5 | task 26905 | kv cache rm [0, end)
- slot update_slots: id 5 | task 26905 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 5 | task 26905 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 40 | task 26909 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 26909 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 26909 | kv cache rm [0, end)
- slot update_slots: id 40 | task 26909 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 40 | task 26909 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 1 | task 26884 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 1 | task 26884 |
- prompt eval time = 457.78 ms / 199 tokens ( 2.30 ms per token, 434.70 tokens per second)
- eval time = 9279.09 ms / 60 tokens ( 154.65 ms per token, 6.47 tokens per second)
- total time = 9736.87 ms / 259 tokens
- slot launch_slot_: id 1 | task 26911 | processing task
- slot update_slots: id 42 | task 26888 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 60 | task 26889 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 26911 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 26911 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 26911 | kv cache rm [0, end)
- slot update_slots: id 1 | task 26911 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 1 | task 26911 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 51 | task 26891 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 57 | task 26752 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 51 | task 26891 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 51 | task 26891 |
- prompt eval time = 330.27 ms / 199 tokens ( 1.66 ms per token, 602.54 tokens per second)
- eval time = 8971.92 ms / 60 tokens ( 149.53 ms per token, 6.69 tokens per second)
- total time = 9302.18 ms / 259 tokens
- slot launch_slot_: id 51 | task 26910 | processing task
- slot update_slots: id 51 | task 26910 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 26910 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 26910 | kv cache rm [0, end)
- slot update_slots: id 51 | task 26910 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 51 | task 26910 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 26 | task 26892 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 30 | task 26629 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 6 | task 26605 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27722
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27715
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 26 | task 26892 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 26892 |
- prompt eval time = 157.09 ms / 199 tokens ( 0.79 ms per token, 1266.80 tokens per second)
- eval time = 8812.87 ms / 60 tokens ( 146.88 ms per token, 6.81 tokens per second)
- total time = 8969.96 ms / 259 tokens
- slot launch_slot_: id 26 | task 26906 | processing task
- slot update_slots: id 3 | task 26893 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 26906 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 26906 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 26906 | kv cache rm [0, end)
- slot update_slots: id 26 | task 26906 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 26 | task 26906 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 43 | task 26653 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 3 | task 26893 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 3 | task 26893 |
- prompt eval time = 149.76 ms / 199 tokens ( 0.75 ms per token, 1328.78 tokens per second)
- eval time = 8974.61 ms / 60 tokens ( 149.58 ms per token, 6.69 tokens per second)
- total time = 9124.38 ms / 259 tokens
- slot launch_slot_: id 3 | task 26913 | processing task
- slot update_slots: id 3 | task 26913 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 3 | task 26913 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 3 | task 26913 | kv cache rm [0, end)
- slot update_slots: id 3 | task 26913 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 3 | task 26913 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 20 | task 26848 | stop processing: n_past = 195, truncated = 1
- slot print_timing: id 20 | task 26848 |
- prompt eval time = 573.20 ms / 199 tokens ( 2.88 ms per token, 347.17 tokens per second)
- eval time = 19664.58 ms / 124 tokens ( 158.59 ms per token, 6.31 tokens per second)
- total time = 20237.78 ms / 323 tokens
- slot launch_slot_: id 20 | task 26914 | processing task
- slot update_slots: id 20 | task 26914 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 26914 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 26914 | kv cache rm [0, end)
- slot update_slots: id 20 | task 26914 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 20 | task 26914 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27721
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 19 | task 26483 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 22 | task 26488 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 23 | task 26535 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 25 | task 26512 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 35 | task 26522 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 37 | task 26484 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 48 | task 26498 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 53 | task 26776 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 54 | task 26525 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 39 | task 26861 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 39 | task 26861 |
- prompt eval time = 233.93 ms / 199 tokens ( 1.18 ms per token, 850.69 tokens per second)
- eval time = 17145.24 ms / 101 tokens ( 169.75 ms per token, 5.89 tokens per second)
- total time = 17379.17 ms / 300 tokens
- slot launch_slot_: id 39 | task 26915 | processing task
- slot update_slots: id 59 | task 26894 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 39 | task 26915 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 39 | task 26915 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 39 | task 26915 | kv cache rm [0, end)
- slot update_slots: id 39 | task 26915 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 39 | task 26915 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 2 | task 26864 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 2 | task 26864 |
- prompt eval time = 154.00 ms / 199 tokens ( 0.77 ms per token, 1292.22 tokens per second)
- eval time = 17307.59 ms / 101 tokens ( 171.36 ms per token, 5.84 tokens per second)
- total time = 17461.59 ms / 300 tokens
- slot launch_slot_: id 2 | task 26916 | processing task
- slot update_slots: id 2 | task 26916 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 26916 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 26916 | kv cache rm [0, end)
- slot update_slots: id 2 | task 26916 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 2 | task 26916 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27719
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27725
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27717
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27723
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 59 | task 26894 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 59 | task 26894 |
- prompt eval time = 157.73 ms / 199 tokens ( 0.79 ms per token, 1261.65 tokens per second)
- eval time = 9723.99 ms / 59 tokens ( 164.81 ms per token, 6.07 tokens per second)
- total time = 9881.72 ms / 258 tokens
- slot launch_slot_: id 59 | task 26917 | processing task
- slot update_slots: id 59 | task 26917 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 59 | task 26917 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 59 | task 26917 | kv cache rm [0, end)
- slot update_slots: id 59 | task 26917 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 59 | task 26917 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 46 | task 26895 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 31 | task 26868 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 31 | task 26868 |
- prompt eval time = 237.47 ms / 199 tokens ( 1.19 ms per token, 838.00 tokens per second)
- eval time = 17039.19 ms / 101 tokens ( 168.70 ms per token, 5.93 tokens per second)
- total time = 17276.66 ms / 300 tokens
- slot launch_slot_: id 31 | task 26918 | processing task
- slot update_slots: id 31 | task 26918 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 26918 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 26918 | kv cache rm [0, end)
- slot update_slots: id 31 | task 26918 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 31 | task 26918 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 27 | task 26896 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 27 | task 26896 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 27 | task 26896 |
- prompt eval time = 156.44 ms / 199 tokens ( 0.79 ms per token, 1272.01 tokens per second)
- eval time = 9754.54 ms / 60 tokens ( 162.58 ms per token, 6.15 tokens per second)
- total time = 9910.99 ms / 259 tokens
- slot launch_slot_: id 27 | task 26919 | processing task
- slot update_slots: id 27 | task 26919 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 27 | task 26919 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 27 | task 26919 | kv cache rm [0, end)
- slot update_slots: id 27 | task 26919 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 27 | task 26919 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27727
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27718
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27739
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 12 | task 26897 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26898 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 12 | task 26897 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 12 | task 26897 |
- prompt eval time = 323.11 ms / 199 tokens ( 1.62 ms per token, 615.90 tokens per second)
- eval time = 9915.13 ms / 60 tokens ( 165.25 ms per token, 6.05 tokens per second)
- total time = 10238.23 ms / 259 tokens
- slot launch_slot_: id 12 | task 26920 | processing task
- slot update_slots: id 12 | task 26920 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 26920 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 26920 | kv cache rm [0, end)
- slot update_slots: id 12 | task 26920 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 26920 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 7 | task 26898 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 26898 |
- prompt eval time = 316.96 ms / 199 tokens ( 1.59 ms per token, 627.84 tokens per second)
- eval time = 9743.28 ms / 60 tokens ( 162.39 ms per token, 6.16 tokens per second)
- total time = 10060.24 ms / 259 tokens
- slot launch_slot_: id 7 | task 26921 | processing task
- slot update_slots: id 34 | task 26899 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26921 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 26921 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 26921 | kv cache rm [0, end)
- slot update_slots: id 7 | task 26921 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 7 | task 26921 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 4 | task 26792 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 15 | task 26795 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 28 | task 26900 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 58 | task 26901 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 26902 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 34 | task 26899 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 34 | task 26899 |
- prompt eval time = 159.89 ms / 199 tokens ( 0.80 ms per token, 1244.59 tokens per second)
- eval time = 9921.87 ms / 60 tokens ( 165.36 ms per token, 6.05 tokens per second)
- total time = 10081.76 ms / 259 tokens
- slot launch_slot_: id 34 | task 26922 | processing task
- slot update_slots: id 34 | task 26922 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 34 | task 26922 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 34 | task 26922 | kv cache rm [0, end)
- slot update_slots: id 34 | task 26922 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 34 | task 26922 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 58 | task 26901 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 58 | task 26901 |
- prompt eval time = 345.64 ms / 199 tokens ( 1.74 ms per token, 575.74 tokens per second)
- eval time = 9684.24 ms / 59 tokens ( 164.14 ms per token, 6.09 tokens per second)
- total time = 10029.88 ms / 258 tokens
- slot release: id 63 | task 26902 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 63 | task 26902 |
- prompt eval time = 345.95 ms / 199 tokens ( 1.74 ms per token, 575.22 tokens per second)
- eval time = 9684.26 ms / 59 tokens ( 164.14 ms per token, 6.09 tokens per second)
- total time = 10030.22 ms / 258 tokens
- slot launch_slot_: id 58 | task 26923 | processing task
- slot launch_slot_: id 63 | task 26928 | processing task
- slot update_slots: id 58 | task 26923 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 26923 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 26923 | kv cache rm [0, end)
- slot update_slots: id 58 | task 26923 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 58 | task 26923 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 63 | task 26928 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 26928 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 26928 | kv cache rm [0, end)
- slot update_slots: id 63 | task 26928 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 63 | task 26928 | prompt done, n_past = 199, n_tokens = 460
- srv cancel_tasks: cancel task, id_task = 27741
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27744
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27750
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 18 | task 26885 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 18 | task 26885 |
- prompt eval time = 459.02 ms / 199 tokens ( 2.31 ms per token, 433.53 tokens per second)
- eval time = 16200.49 ms / 101 tokens ( 160.40 ms per token, 6.23 tokens per second)
- total time = 16659.52 ms / 300 tokens
- slot release: id 28 | task 26900 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 28 | task 26900 |
- prompt eval time = 343.23 ms / 199 tokens ( 1.72 ms per token, 579.79 tokens per second)
- eval time = 10208.20 ms / 60 tokens ( 170.14 ms per token, 5.88 tokens per second)
- total time = 10551.43 ms / 259 tokens
- slot release: id 29 | task 26865 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 29 | task 26865 |
- prompt eval time = 402.41 ms / 199 tokens ( 2.02 ms per token, 494.52 tokens per second)
- eval time = 20663.80 ms / 123 tokens ( 168.00 ms per token, 5.95 tokens per second)
- total time = 21066.21 ms / 322 tokens
- slot launch_slot_: id 18 | task 26929 | processing task
- slot launch_slot_: id 28 | task 26930 | processing task
- slot launch_slot_: id 29 | task 26931 | processing task
- slot update_slots: id 56 | task 26907 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 26929 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 18 | task 26929 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 18 | task 26929 | kv cache rm [0, end)
- slot update_slots: id 18 | task 26929 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 18 | task 26929 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 28 | task 26930 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 26930 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 26930 | kv cache rm [0, end)
- slot update_slots: id 28 | task 26930 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 28 | task 26930 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 29 | task 26931 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 26931 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 26931 | kv cache rm [0, end)
- slot update_slots: id 29 | task 26931 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 29 | task 26931 | prompt done, n_past = 199, n_tokens = 658
- srv cancel_tasks: cancel task, id_task = 27819
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27877
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27862
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 11 | task 26813 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 42 | task 26888 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 42 | task 26888 |
- prompt eval time = 238.41 ms / 199 tokens ( 1.20 ms per token, 834.70 tokens per second)
- eval time = 16668.91 ms / 101 tokens ( 165.04 ms per token, 6.06 tokens per second)
- total time = 16907.32 ms / 300 tokens
- slot release: id 56 | task 26907 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 56 | task 26907 |
- prompt eval time = 163.09 ms / 199 tokens ( 0.82 ms per token, 1220.19 tokens per second)
- eval time = 10980.62 ms / 60 tokens ( 183.01 ms per token, 5.46 tokens per second)
- total time = 11143.71 ms / 259 tokens
- slot release: id 60 | task 26889 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 60 | task 26889 |
- prompt eval time = 241.09 ms / 199 tokens ( 1.21 ms per token, 825.43 tokens per second)
- eval time = 16667.74 ms / 101 tokens ( 165.03 ms per token, 6.06 tokens per second)
- total time = 16908.82 ms / 300 tokens
- slot launch_slot_: id 42 | task 26932 | processing task
- slot launch_slot_: id 56 | task 26934 | processing task
- slot launch_slot_: id 60 | task 26936 | processing task
- slot update_slots: id 42 | task 26932 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 42 | task 26932 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 42 | task 26932 | kv cache rm [0, end)
- slot update_slots: id 42 | task 26932 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 42 | task 26932 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 56 | task 26934 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 26934 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 26934 | kv cache rm [0, end)
- slot update_slots: id 56 | task 26934 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 56 | task 26934 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 60 | task 26936 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 26936 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 26936 | kv cache rm [0, end)
- slot update_slots: id 60 | task 26936 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 60 | task 26936 | prompt done, n_past = 199, n_tokens = 658
- srv cancel_tasks: cancel task, id_task = 27886
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27876
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 47 | task 26903 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 55 | task 26686 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 8 | task 26904 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26912 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27890
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27880
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 41 | task 26912 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 41 | task 26912 |
- prompt eval time = 163.48 ms / 199 tokens ( 0.82 ms per token, 1217.30 tokens per second)
- eval time = 10786.70 ms / 60 tokens ( 179.78 ms per token, 5.56 tokens per second)
- total time = 10950.18 ms / 259 tokens
- slot launch_slot_: id 41 | task 26937 | processing task
- slot update_slots: id 10 | task 26821 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26937 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 26937 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 26937 | kv cache rm [0, end)
- slot update_slots: id 41 | task 26937 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 41 | task 26937 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 45 | task 26908 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 5 | task 26905 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 26909 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 26911 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27885
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 40 | task 26909 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 40 | task 26909 |
- prompt eval time = 242.37 ms / 199 tokens ( 1.22 ms per token, 821.06 tokens per second)
- eval time = 10481.69 ms / 60 tokens ( 174.69 ms per token, 5.72 tokens per second)
- total time = 10724.06 ms / 259 tokens
- slot launch_slot_: id 40 | task 26949 | processing task
- slot update_slots: id 9 | task 26823 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 26949 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 26949 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 26949 | kv cache rm [0, end)
- slot update_slots: id 40 | task 26949 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 40 | task 26949 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 51 | task 26910 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 51 | task 26910 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 51 | task 26910 |
- prompt eval time = 156.56 ms / 199 tokens ( 0.79 ms per token, 1271.07 tokens per second)
- eval time = 10008.46 ms / 60 tokens ( 166.81 ms per token, 5.99 tokens per second)
- total time = 10165.02 ms / 259 tokens
- slot launch_slot_: id 51 | task 26948 | processing task
- slot update_slots: id 51 | task 26948 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 26948 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 26948 | kv cache rm [0, end)
- slot update_slots: id 51 | task 26948 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 51 | task 26948 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 21 | task 26886 | stop processing: n_past = 195, truncated = 1
- slot print_timing: id 21 | task 26886 |
- prompt eval time = 355.54 ms / 199 tokens ( 1.79 ms per token, 559.72 tokens per second)
- eval time = 20099.73 ms / 124 tokens ( 162.09 ms per token, 6.17 tokens per second)
- total time = 20455.27 ms / 323 tokens
- slot launch_slot_: id 21 | task 26951 | processing task
- slot update_slots: id 21 | task 26951 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 26951 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 26951 | kv cache rm [0, end)
- slot update_slots: id 21 | task 26951 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 21 | task 26951 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 26 | task 26906 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27902
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27889
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27888
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27887
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 0 | task 26700 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 26 | task 26906 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 26906 |
- prompt eval time = 321.53 ms / 199 tokens ( 1.62 ms per token, 618.92 tokens per second)
- eval time = 9942.70 ms / 60 tokens ( 165.71 ms per token, 6.03 tokens per second)
- total time = 10264.23 ms / 259 tokens
- slot launch_slot_: id 26 | task 26954 | processing task
- slot update_slots: id 3 | task 26913 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 26954 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 26954 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 26954 | kv cache rm [0, end)
- slot update_slots: id 26 | task 26954 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 26 | task 26954 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 17 | task 26878 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 17 | task 26878 |
- prompt eval time = 409.12 ms / 199 tokens ( 2.06 ms per token, 486.41 tokens per second)
- eval time = 22132.62 ms / 132 tokens ( 167.67 ms per token, 5.96 tokens per second)
- total time = 22541.73 ms / 331 tokens
- slot launch_slot_: id 17 | task 26956 | processing task
- slot update_slots: id 20 | task 26914 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 17 | task 26956 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 17 | task 26956 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 17 | task 26956 | kv cache rm [0, end)
- slot update_slots: id 17 | task 26956 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 17 | task 26956 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 49 | task 26713 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 61 | task 26715 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 3 | task 26913 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 3 | task 26913 |
- prompt eval time = 152.36 ms / 199 tokens ( 0.77 ms per token, 1306.12 tokens per second)
- eval time = 10432.18 ms / 60 tokens ( 173.87 ms per token, 5.75 tokens per second)
- total time = 10584.54 ms / 259 tokens
- slot release: id 20 | task 26914 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 20 | task 26914 |
- prompt eval time = 155.53 ms / 199 tokens ( 0.78 ms per token, 1279.46 tokens per second)
- eval time = 10273.32 ms / 59 tokens ( 174.12 ms per token, 5.74 tokens per second)
- total time = 10428.85 ms / 258 tokens
- slot launch_slot_: id 3 | task 26961 | processing task
- slot launch_slot_: id 20 | task 26965 | processing task
- slot update_slots: id 32 | task 26846 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 3 | task 26961 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 3 | task 26961 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 3 | task 26961 | kv cache rm [0, end)
- slot update_slots: id 3 | task 26961 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 3 | task 26961 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 20 | task 26965 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 26965 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 26965 | kv cache rm [0, end)
- slot update_slots: id 20 | task 26965 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 20 | task 26965 | prompt done, n_past = 199, n_tokens = 460
- srv cancel_tasks: cancel task, id_task = 27904
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27905
- srv cancel_tasks: cancel task, id_task = 27906
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 39 | task 26915 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 26916 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 16 | task 26582 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 59 | task 26917 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 39 | task 26915 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 39 | task 26915 |
- prompt eval time = 320.24 ms / 199 tokens ( 1.61 ms per token, 621.42 tokens per second)
- eval time = 10784.78 ms / 60 tokens ( 179.75 ms per token, 5.56 tokens per second)
- total time = 11105.02 ms / 259 tokens
- slot launch_slot_: id 39 | task 26966 | processing task
- slot update_slots: id 38 | task 26598 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 44 | task 26608 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 39 | task 26966 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 39 | task 26966 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 39 | task 26966 | kv cache rm [0, end)
- slot update_slots: id 39 | task 26966 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 39 | task 26966 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 2 | task 26916 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 2 | task 26916 |
- prompt eval time = 321.62 ms / 199 tokens ( 1.62 ms per token, 618.74 tokens per second)
- eval time = 10863.38 ms / 60 tokens ( 181.06 ms per token, 5.52 tokens per second)
- total time = 11185.01 ms / 259 tokens
- slot launch_slot_: id 2 | task 26967 | processing task
- slot update_slots: id 31 | task 26918 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 62 | task 26614 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 26967 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 26967 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 26967 | kv cache rm [0, end)
- slot update_slots: id 2 | task 26967 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 2 | task 26967 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27907
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27910
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 59 | task 26917 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 59 | task 26917 |
- prompt eval time = 321.68 ms / 199 tokens ( 1.62 ms per token, 618.63 tokens per second)
- eval time = 10949.05 ms / 60 tokens ( 182.48 ms per token, 5.48 tokens per second)
- total time = 11270.73 ms / 259 tokens
- slot launch_slot_: id 59 | task 26968 | processing task
- slot update_slots: id 59 | task 26968 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 59 | task 26968 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 59 | task 26968 | kv cache rm [0, end)
- slot update_slots: id 59 | task 26968 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 59 | task 26968 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 31 | task 26918 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 31 | task 26918 |
- prompt eval time = 149.69 ms / 199 tokens ( 0.75 ms per token, 1329.39 tokens per second)
- eval time = 11218.19 ms / 60 tokens ( 186.97 ms per token, 5.35 tokens per second)
- total time = 11367.88 ms / 259 tokens
- slot launch_slot_: id 31 | task 26969 | processing task
- slot update_slots: id 31 | task 26969 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 26969 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 26969 | kv cache rm [0, end)
- slot update_slots: id 31 | task 26969 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 31 | task 26969 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27909
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27921
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27911
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27908
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 27 | task 26919 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 33 | task 26851 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 26920 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26921 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 12 | task 26920 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 12 | task 26920 |
- prompt eval time = 152.80 ms / 199 tokens ( 0.77 ms per token, 1302.37 tokens per second)
- eval time = 10763.58 ms / 60 tokens ( 179.39 ms per token, 5.57 tokens per second)
- total time = 10916.38 ms / 259 tokens
- slot launch_slot_: id 12 | task 26970 | processing task
- slot update_slots: id 12 | task 26970 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 26970 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 26970 | kv cache rm [0, end)
- slot update_slots: id 12 | task 26970 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 26970 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 7 | task 26921 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 26921 |
- prompt eval time = 157.96 ms / 199 tokens ( 0.79 ms per token, 1259.82 tokens per second)
- eval time = 10758.80 ms / 60 tokens ( 179.31 ms per token, 5.58 tokens per second)
- total time = 10916.76 ms / 259 tokens
- slot launch_slot_: id 7 | task 26972 | processing task
- slot update_slots: id 34 | task 26922 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26972 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 26972 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 26972 | kv cache rm [0, end)
- slot update_slots: id 7 | task 26972 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 7 | task 26972 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 58 | task 26923 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 26928 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 26929 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 28 | task 26930 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 29 | task 26931 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 63 | task 26928 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 63 | task 26928 |
- prompt eval time = 523.78 ms / 199 tokens ( 2.63 ms per token, 379.93 tokens per second)
- eval time = 10368.32 ms / 60 tokens ( 172.81 ms per token, 5.79 tokens per second)
- total time = 10892.09 ms / 259 tokens
- slot launch_slot_: id 63 | task 26973 | processing task
- slot update_slots: id 63 | task 26973 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 26973 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 26973 | kv cache rm [0, end)
- slot update_slots: id 63 | task 26973 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 63 | task 26973 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27924
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 1 | task 26911 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 1 | task 26911 |
- prompt eval time = 346.93 ms / 199 tokens ( 1.74 ms per token, 573.60 tokens per second)
- eval time = 17093.13 ms / 101 tokens ( 169.24 ms per token, 5.91 tokens per second)
- total time = 17440.06 ms / 300 tokens
- slot release: id 18 | task 26929 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 18 | task 26929 |
- prompt eval time = 653.87 ms / 199 tokens ( 3.29 ms per token, 304.34 tokens per second)
- eval time = 9866.25 ms / 60 tokens ( 164.44 ms per token, 6.08 tokens per second)
- total time = 10520.12 ms / 259 tokens
- slot release: id 28 | task 26930 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 28 | task 26930 |
- prompt eval time = 654.73 ms / 199 tokens ( 3.29 ms per token, 303.94 tokens per second)
- eval time = 9866.12 ms / 60 tokens ( 164.44 ms per token, 6.08 tokens per second)
- total time = 10520.86 ms / 259 tokens
- slot release: id 29 | task 26931 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 29 | task 26931 |
- prompt eval time = 654.76 ms / 199 tokens ( 3.29 ms per token, 303.93 tokens per second)
- eval time = 9866.15 ms / 60 tokens ( 164.44 ms per token, 6.08 tokens per second)
- total time = 10520.91 ms / 259 tokens
- slot launch_slot_: id 1 | task 26974 | processing task
- slot launch_slot_: id 18 | task 26976 | processing task
- slot launch_slot_: id 28 | task 26977 | processing task
- slot launch_slot_: id 29 | task 26978 | processing task
- slot update_slots: id 42 | task 26932 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 56 | task 26934 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 60 | task 26936 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 26974 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 26974 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 26974 | kv cache rm [0, end)
- slot update_slots: id 1 | task 26974 | prompt processing progress, n_past = 199, n_tokens = 259, progress = 1.000000
- slot update_slots: id 1 | task 26974 | prompt done, n_past = 199, n_tokens = 259
- slot update_slots: id 18 | task 26976 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 18 | task 26976 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 18 | task 26976 | kv cache rm [0, end)
- slot update_slots: id 18 | task 26976 | prompt processing progress, n_past = 199, n_tokens = 458, progress = 1.000000
- slot update_slots: id 18 | task 26976 | prompt done, n_past = 199, n_tokens = 458
- slot update_slots: id 28 | task 26977 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 26977 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 26977 | kv cache rm [0, end)
- slot update_slots: id 28 | task 26977 | prompt processing progress, n_past = 199, n_tokens = 657, progress = 1.000000
- slot update_slots: id 28 | task 26977 | prompt done, n_past = 199, n_tokens = 657
- slot update_slots: id 29 | task 26978 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 26978 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 26978 | kv cache rm [0, end)
- slot update_slots: id 29 | task 26978 | prompt processing progress, n_past = 199, n_tokens = 856, progress = 1.000000
- slot update_slots: id 29 | task 26978 | prompt done, n_past = 199, n_tokens = 856
- srv cancel_tasks: cancel task, id_task = 27919
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27920
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 50 | task 26866 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 42 | task 26932 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 42 | task 26932 |
- prompt eval time = 342.67 ms / 199 tokens ( 1.72 ms per token, 580.73 tokens per second)
- eval time = 9783.18 ms / 60 tokens ( 163.05 ms per token, 6.13 tokens per second)
- total time = 10125.86 ms / 259 tokens
- slot release: id 60 | task 26936 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 60 | task 26936 |
- prompt eval time = 344.04 ms / 199 tokens ( 1.73 ms per token, 578.41 tokens per second)
- eval time = 9783.14 ms / 60 tokens ( 163.05 ms per token, 6.13 tokens per second)
- total time = 10127.18 ms / 259 tokens
- slot launch_slot_: id 42 | task 26981 | processing task
- slot launch_slot_: id 60 | task 26983 | processing task
- slot update_slots: id 36 | task 26869 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 42 | task 26981 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 42 | task 26981 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 42 | task 26981 | kv cache rm [0, end)
- slot update_slots: id 42 | task 26981 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 42 | task 26981 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 60 | task 26983 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 26983 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 26983 | kv cache rm [0, end)
- slot update_slots: id 60 | task 26983 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 60 | task 26983 | prompt done, n_past = 199, n_tokens = 460
- srv cancel_tasks: cancel task, id_task = 27918
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27923
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 24 | task 26621 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26937 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 41 | task 26937 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 41 | task 26937 |
- prompt eval time = 161.52 ms / 199 tokens ( 0.81 ms per token, 1232.02 tokens per second)
- eval time = 9672.74 ms / 60 tokens ( 161.21 ms per token, 6.20 tokens per second)
- total time = 9834.27 ms / 259 tokens
- slot launch_slot_: id 41 | task 26984 | processing task
- slot update_slots: id 41 | task 26984 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 26984 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 26984 | kv cache rm [0, end)
- slot update_slots: id 41 | task 26984 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 41 | task 26984 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27929
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27922
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 14 | task 26627 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 26949 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 13 | task 26877 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 51 | task 26948 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 26951 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 21 | task 26951 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 21 | task 26951 |
- prompt eval time = 155.66 ms / 199 tokens ( 0.78 ms per token, 1278.45 tokens per second)
- eval time = 9072.38 ms / 59 tokens ( 153.77 ms per token, 6.50 tokens per second)
- total time = 9228.04 ms / 258 tokens
- slot launch_slot_: id 21 | task 26985 | processing task
- slot update_slots: id 21 | task 26985 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 26985 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 26985 | kv cache rm [0, end)
- slot update_slots: id 21 | task 26985 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 21 | task 26985 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27945
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27934
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27947
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 26 | task 26954 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 52 | task 26742 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 17 | task 26956 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 26 | task 26954 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 26954 |
- prompt eval time = 159.90 ms / 199 tokens ( 0.80 ms per token, 1244.56 tokens per second)
- eval time = 9363.16 ms / 60 tokens ( 156.05 ms per token, 6.41 tokens per second)
- total time = 9523.05 ms / 259 tokens
- slot launch_slot_: id 26 | task 26988 | processing task
- slot update_slots: id 3 | task 26961 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 20 | task 26965 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 26988 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 26988 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 26988 | kv cache rm [0, end)
- slot update_slots: id 26 | task 26988 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 26 | task 26988 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 17 | task 26956 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 17 | task 26956 |
- prompt eval time = 418.79 ms / 199 tokens ( 2.10 ms per token, 475.18 tokens per second)
- eval time = 9103.58 ms / 60 tokens ( 151.73 ms per token, 6.59 tokens per second)
- total time = 9522.36 ms / 259 tokens
- slot launch_slot_: id 17 | task 26992 | processing task
- slot update_slots: id 57 | task 26752 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 17 | task 26992 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 17 | task 26992 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 17 | task 26992 | kv cache rm [0, end)
- slot update_slots: id 17 | task 26992 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 17 | task 26992 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27933
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 20 | task 26965 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 20 | task 26965 |
- prompt eval time = 529.52 ms / 199 tokens ( 2.66 ms per token, 375.81 tokens per second)
- eval time = 8782.08 ms / 60 tokens ( 146.37 ms per token, 6.83 tokens per second)
- total time = 9311.60 ms / 259 tokens
- slot launch_slot_: id 20 | task 26989 | processing task
- slot update_slots: id 20 | task 26989 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 26989 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 26989 | kv cache rm [0, end)
- slot update_slots: id 20 | task 26989 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 20 | task 26989 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 30 | task 26629 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 39 | task 26966 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 26967 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 6 | task 26605 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27940
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27937
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 59 | task 26968 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 43 | task 26653 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 2 | task 26967 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 2 | task 26967 |
- prompt eval time = 402.27 ms / 199 tokens ( 2.02 ms per token, 494.70 tokens per second)
- eval time = 8237.66 ms / 60 tokens ( 137.29 ms per token, 7.28 tokens per second)
- total time = 8639.92 ms / 259 tokens
- slot launch_slot_: id 2 | task 26993 | processing task
- slot update_slots: id 31 | task 26969 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 26993 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 26993 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 26993 | kv cache rm [0, end)
- slot update_slots: id 2 | task 26993 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 2 | task 26993 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 59 | task 26968 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 59 | task 26968 |
- prompt eval time = 425.59 ms / 199 tokens ( 2.14 ms per token, 467.58 tokens per second)
- eval time = 7973.92 ms / 60 tokens ( 132.90 ms per token, 7.52 tokens per second)
- total time = 8399.52 ms / 259 tokens
- slot launch_slot_: id 59 | task 26994 | processing task
- slot update_slots: id 59 | task 26994 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 59 | task 26994 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 59 | task 26994 | kv cache rm [0, end)
- slot update_slots: id 59 | task 26994 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 59 | task 26994 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27944
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27935
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 31 | task 26969 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 31 | task 26969 |
- prompt eval time = 147.99 ms / 199 tokens ( 0.74 ms per token, 1344.68 tokens per second)
- eval time = 8277.79 ms / 60 tokens ( 137.96 ms per token, 7.25 tokens per second)
- total time = 8425.78 ms / 259 tokens
- slot release: id 58 | task 26923 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 58 | task 26923 |
- prompt eval time = 523.45 ms / 199 tokens ( 2.63 ms per token, 380.17 tokens per second)
- eval time = 16742.41 ms / 101 tokens ( 165.77 ms per token, 6.03 tokens per second)
- total time = 17265.85 ms / 300 tokens
- slot launch_slot_: id 31 | task 26995 | processing task
- slot launch_slot_: id 58 | task 26996 | processing task
- slot update_slots: id 31 | task 26995 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 26995 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 26995 | kv cache rm [0, end)
- slot update_slots: id 31 | task 26995 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 31 | task 26995 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 58 | task 26996 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 26996 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 26996 | kv cache rm [0, end)
- slot update_slots: id 58 | task 26996 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 58 | task 26996 | prompt done, n_past = 199, n_tokens = 460
- slot update_slots: id 19 | task 26483 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 22 | task 26488 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 23 | task 26535 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 25 | task 26512 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 35 | task 26522 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 37 | task 26484 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 48 | task 26498 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 53 | task 26776 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 54 | task 26525 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27941
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 56 | task 26934 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 56 | task 26934 |
- prompt eval time = 343.76 ms / 199 tokens ( 1.73 ms per token, 578.89 tokens per second)
- eval time = 16370.12 ms / 101 tokens ( 162.08 ms per token, 6.17 tokens per second)
- total time = 16713.88 ms / 300 tokens
- slot launch_slot_: id 56 | task 26997 | processing task
- slot update_slots: id 56 | task 26997 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 26997 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 26997 | kv cache rm [0, end)
- slot update_slots: id 56 | task 26997 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 56 | task 26997 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 46 | task 26895 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27948
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27939
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 12 | task 26970 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 26972 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 12 | task 26970 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 12 | task 26970 |
- prompt eval time = 152.67 ms / 199 tokens ( 0.77 ms per token, 1303.46 tokens per second)
- eval time = 9856.74 ms / 60 tokens ( 164.28 ms per token, 6.09 tokens per second)
- total time = 10009.41 ms / 259 tokens
- slot launch_slot_: id 12 | task 26990 | processing task
- slot update_slots: id 12 | task 26990 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 26990 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 26990 | kv cache rm [0, end)
- slot update_slots: id 12 | task 26990 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 26990 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 7 | task 26972 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 26972 |
- prompt eval time = 159.27 ms / 199 tokens ( 0.80 ms per token, 1249.42 tokens per second)
- eval time = 9842.47 ms / 60 tokens ( 164.04 ms per token, 6.10 tokens per second)
- total time = 10001.75 ms / 259 tokens
- slot launch_slot_: id 7 | task 27000 | processing task
- slot update_slots: id 7 | task 27000 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 27000 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 27000 | kv cache rm [0, end)
- slot update_slots: id 7 | task 27000 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 7 | task 27000 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 63 | task 26973 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 26974 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 26976 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 28 | task 26977 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 29 | task 26978 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27946
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 29 | task 26978 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 29 | task 26978 |
- prompt eval time = 444.96 ms / 199 tokens ( 2.24 ms per token, 447.23 tokens per second)
- eval time = 9350.06 ms / 59 tokens ( 158.48 ms per token, 6.31 tokens per second)
- total time = 9795.02 ms / 258 tokens
- slot release: id 40 | task 26949 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 40 | task 26949 |
- prompt eval time = 157.37 ms / 199 tokens ( 0.79 ms per token, 1264.54 tokens per second)
- eval time = 16603.12 ms / 101 tokens ( 164.39 ms per token, 6.08 tokens per second)
- total time = 16760.49 ms / 300 tokens
- slot launch_slot_: id 29 | task 27004 | processing task
- slot launch_slot_: id 40 | task 27005 | processing task
- slot update_slots: id 29 | task 27004 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 27004 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 27004 | kv cache rm [0, end)
- slot update_slots: id 29 | task 27004 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 29 | task 27004 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 40 | task 27005 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 27005 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 27005 | kv cache rm [0, end)
- slot update_slots: id 40 | task 27005 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 40 | task 27005 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 1 | task 26974 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 1 | task 26974 |
- prompt eval time = 443.07 ms / 199 tokens ( 2.23 ms per token, 449.14 tokens per second)
- eval time = 9584.85 ms / 60 tokens ( 159.75 ms per token, 6.26 tokens per second)
- total time = 10027.91 ms / 259 tokens
- slot release: id 18 | task 26976 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 18 | task 26976 |
- prompt eval time = 444.30 ms / 199 tokens ( 2.23 ms per token, 447.90 tokens per second)
- eval time = 9584.91 ms / 60 tokens ( 159.75 ms per token, 6.26 tokens per second)
- total time = 10029.21 ms / 259 tokens
- slot launch_slot_: id 1 | task 27008 | processing task
- slot launch_slot_: id 18 | task 27009 | processing task
- slot update_slots: id 42 | task 26981 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 60 | task 26983 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 27008 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 27008 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 27008 | kv cache rm [0, end)
- slot update_slots: id 1 | task 27008 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 1 | task 27008 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 18 | task 27009 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 18 | task 27009 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 18 | task 27009 | kv cache rm [0, end)
- slot update_slots: id 18 | task 27009 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 18 | task 27009 | prompt done, n_past = 199, n_tokens = 460
- srv cancel_tasks: cancel task, id_task = 27942
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 42 | task 26981 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 42 | task 26981 |
- prompt eval time = 408.21 ms / 199 tokens ( 2.05 ms per token, 487.49 tokens per second)
- eval time = 9348.29 ms / 59 tokens ( 158.45 ms per token, 6.31 tokens per second)
- total time = 9756.51 ms / 258 tokens
- slot launch_slot_: id 42 | task 27006 | processing task
- slot update_slots: id 42 | task 27006 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 42 | task 27006 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 42 | task 27006 | kv cache rm [0, end)
- slot update_slots: id 42 | task 27006 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 42 | task 27006 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 4 | task 26792 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 15 | task 26795 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27950
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27949
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27962
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 11 | task 26813 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 26984 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 41 | task 26984 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 41 | task 26984 |
- prompt eval time = 158.99 ms / 199 tokens ( 0.80 ms per token, 1251.62 tokens per second)
- eval time = 9576.53 ms / 60 tokens ( 159.61 ms per token, 6.27 tokens per second)
- total time = 9735.53 ms / 259 tokens
- slot launch_slot_: id 41 | task 27010 | processing task
- slot update_slots: id 47 | task 26903 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 55 | task 26686 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 27010 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 27010 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 27010 | kv cache rm [0, end)
- slot update_slots: id 41 | task 27010 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 41 | task 27010 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 8 | task 26904 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27951
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 10 | task 26821 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 26985 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 45 | task 26908 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 5 | task 26905 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 21 | task 26985 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 21 | task 26985 |
- prompt eval time = 159.84 ms / 199 tokens ( 0.80 ms per token, 1244.99 tokens per second)
- eval time = 9789.43 ms / 60 tokens ( 163.16 ms per token, 6.13 tokens per second)
- total time = 9949.27 ms / 259 tokens
- slot launch_slot_: id 21 | task 27011 | processing task
- slot update_slots: id 21 | task 27011 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 27011 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 27011 | kv cache rm [0, end)
- slot update_slots: id 21 | task 27011 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 21 | task 27011 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27957
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot update_slots: id 9 | task 26823 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 26988 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 51 | task 26948 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 51 | task 26948 |
- prompt eval time = 155.12 ms / 199 tokens ( 0.78 ms per token, 1282.91 tokens per second)
- eval time = 19644.44 ms / 123 tokens ( 159.71 ms per token, 6.26 tokens per second)
- total time = 19799.55 ms / 322 tokens
- slot launch_slot_: id 51 | task 27012 | processing task
- slot update_slots: id 17 | task 26992 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 51 | task 27012 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 27012 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 27012 | kv cache rm [0, end)
- slot update_slots: id 51 | task 27012 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 51 | task 27012 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 26 | task 26988 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 26988 |
- prompt eval time = 161.22 ms / 199 tokens ( 0.81 ms per token, 1234.35 tokens per second)
- eval time = 9914.66 ms / 60 tokens ( 165.24 ms per token, 6.05 tokens per second)
- total time = 10075.88 ms / 259 tokens
- slot launch_slot_: id 26 | task 27013 | processing task
- slot update_slots: id 20 | task 26989 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 27013 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 27013 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 27013 | kv cache rm [0, end)
- slot update_slots: id 26 | task 27013 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 26 | task 27013 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 17 | task 26992 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 17 | task 26992 |
- prompt eval time = 329.88 ms / 199 tokens ( 1.66 ms per token, 603.25 tokens per second)
- eval time = 9741.43 ms / 60 tokens ( 162.36 ms per token, 6.16 tokens per second)
- total time = 10071.31 ms / 259 tokens
- slot launch_slot_: id 17 | task 27014 | processing task
- slot update_slots: id 17 | task 27014 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 17 | task 27014 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 17 | task 27014 | kv cache rm [0, end)
- slot update_slots: id 17 | task 27014 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 17 | task 27014 | prompt done, n_past = 199, n_tokens = 262
- srv cancel_tasks: cancel task, id_task = 27959
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27958
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 3 | task 26961 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 3 | task 26961 |
- prompt eval time = 528.33 ms / 199 tokens ( 2.65 ms per token, 376.66 tokens per second)
- eval time = 19011.69 ms / 123 tokens ( 154.57 ms per token, 6.47 tokens per second)
- total time = 19540.02 ms / 322 tokens
- slot launch_slot_: id 3 | task 27015 | processing task
- slot update_slots: id 0 | task 26700 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 26993 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 3 | task 27015 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 3 | task 27015 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 3 | task 27015 | kv cache rm [0, end)
- slot update_slots: id 3 | task 27015 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 3 | task 27015 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 59 | task 26994 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- srv cancel_tasks: cancel task, id_task = 27960
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- srv cancel_tasks: cancel task, id_task = 27961
- srv log_server_r: request: POST /v1/chat/completions 127.0.0.1 200
- slot release: id 2 | task 26993 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 2 | task 26993 |
- prompt eval time = 157.22 ms / 199 tokens ( 0.79 ms per token, 1265.78 tokens per second)
- eval time = 9440.86 ms / 60 tokens ( 157.35 ms per token, 6.36 tokens per second)
- total time = 9598.08 ms / 259 tokens
- slot launch_slot_: id 2 | task 27016 | processing task
- slot update_slots: id 31 | task 26995 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 49 | task 26713 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 58 | task 26996 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 61 | task 26715 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 2 | task 27016 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 27016 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 27016 | kv cache rm [0, end)
- slot update_slots: id 2 | task 27016 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 2 | task 27016 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 32 | task 26846 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 56 | task 26997 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 16 | task 26582 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 38 | task 26598 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 44 | task 26608 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 62 | task 26614 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 26990 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 27 | task 26919 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 7 | task 27000 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 33 | task 26851 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 12 | task 26990 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 12 | task 26990 |
- prompt eval time = 148.87 ms / 199 tokens ( 0.75 ms per token, 1336.72 tokens per second)
- eval time = 8079.57 ms / 60 tokens ( 134.66 ms per token, 7.43 tokens per second)
- total time = 8228.44 ms / 259 tokens
- slot launch_slot_: id 12 | task 27021 | processing task
- slot update_slots: id 12 | task 27021 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 27021 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 27021 | kv cache rm [0, end)
- slot update_slots: id 12 | task 27021 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 27021 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 7 | task 27000 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 27000 |
- prompt eval time = 150.96 ms / 199 tokens ( 0.76 ms per token, 1318.20 tokens per second)
- eval time = 8247.51 ms / 60 tokens ( 137.46 ms per token, 7.27 tokens per second)
- total time = 8398.48 ms / 259 tokens
- slot launch_slot_: id 7 | task 27022 | processing task
- slot update_slots: id 7 | task 27022 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 27022 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 27022 | kv cache rm [0, end)
- slot update_slots: id 7 | task 27022 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 7 | task 27022 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 29 | task 27004 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 27005 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 27008 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 27009 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 29 | task 27004 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 29 | task 27004 |
- prompt eval time = 231.17 ms / 199 tokens ( 1.16 ms per token, 860.84 tokens per second)
- eval time = 7844.59 ms / 60 tokens ( 130.74 ms per token, 7.65 tokens per second)
- total time = 8075.76 ms / 259 tokens
- slot launch_slot_: id 29 | task 27023 | processing task
- slot update_slots: id 42 | task 27006 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 29 | task 27023 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 27023 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 27023 | kv cache rm [0, end)
- slot update_slots: id 29 | task 27023 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 29 | task 27023 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 1 | task 27008 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 1 | task 27008 |
- prompt eval time = 400.65 ms / 199 tokens ( 2.01 ms per token, 496.69 tokens per second)
- eval time = 7594.81 ms / 60 tokens ( 126.58 ms per token, 7.90 tokens per second)
- total time = 7995.47 ms / 259 tokens
- slot release: id 18 | task 27009 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 18 | task 27009 |
- prompt eval time = 401.90 ms / 199 tokens ( 2.02 ms per token, 495.14 tokens per second)
- eval time = 7594.97 ms / 60 tokens ( 126.58 ms per token, 7.90 tokens per second)
- total time = 7996.88 ms / 259 tokens
- slot launch_slot_: id 1 | task 27024 | processing task
- slot launch_slot_: id 18 | task 27025 | processing task
- slot update_slots: id 1 | task 27024 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 27024 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 27024 | kv cache rm [0, end)
- slot update_slots: id 1 | task 27024 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 1 | task 27024 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 18 | task 27025 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 18 | task 27025 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 18 | task 27025 | kv cache rm [0, end)
- slot update_slots: id 18 | task 27025 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 18 | task 27025 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 42 | task 27006 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 42 | task 27006 |
- prompt eval time = 153.56 ms / 199 tokens ( 0.77 ms per token, 1295.94 tokens per second)
- eval time = 7624.58 ms / 59 tokens ( 129.23 ms per token, 7.74 tokens per second)
- total time = 7778.13 ms / 258 tokens
- slot launch_slot_: id 42 | task 27026 | processing task
- slot update_slots: id 42 | task 27026 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 42 | task 27026 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 42 | task 27026 | kv cache rm [0, end)
- slot update_slots: id 42 | task 27026 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 42 | task 27026 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 63 | task 26973 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 63 | task 26973 |
- prompt eval time = 155.86 ms / 199 tokens ( 0.78 ms per token, 1276.78 tokens per second)
- eval time = 18598.17 ms / 123 tokens ( 151.20 ms per token, 6.61 tokens per second)
- total time = 18754.04 ms / 322 tokens
- slot launch_slot_: id 63 | task 27028 | processing task
- slot update_slots: id 34 | task 26922 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 27028 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 27028 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 27028 | kv cache rm [0, end)
- slot update_slots: id 63 | task 27028 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 63 | task 27028 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 60 | task 26983 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 60 | task 26983 |
- prompt eval time = 409.63 ms / 199 tokens ( 2.06 ms per token, 485.80 tokens per second)
- eval time = 17940.37 ms / 123 tokens ( 145.86 ms per token, 6.86 tokens per second)
- total time = 18350.00 ms / 322 tokens
- slot launch_slot_: id 60 | task 27032 | processing task
- slot update_slots: id 60 | task 27032 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 27032 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 27032 | kv cache rm [0, end)
- slot update_slots: id 60 | task 27032 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 60 | task 27032 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 50 | task 26866 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 36 | task 26869 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 27010 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 28 | task 26977 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 28 | task 26977 |
- prompt eval time = 444.97 ms / 199 tokens ( 2.24 ms per token, 447.22 tokens per second)
- eval time = 19607.03 ms / 132 tokens ( 148.54 ms per token, 6.73 tokens per second)
- total time = 20052.00 ms / 331 tokens
- slot launch_slot_: id 28 | task 27033 | processing task
- slot update_slots: id 28 | task 27033 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 27033 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 27033 | kv cache rm [0, end)
- slot update_slots: id 28 | task 27033 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 28 | task 27033 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 41 | task 27010 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 41 | task 27010 |
- prompt eval time = 158.54 ms / 199 tokens ( 0.80 ms per token, 1255.17 tokens per second)
- eval time = 8339.88 ms / 60 tokens ( 139.00 ms per token, 7.19 tokens per second)
- total time = 8498.42 ms / 259 tokens
- slot launch_slot_: id 41 | task 27034 | processing task
- slot update_slots: id 41 | task 27034 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 27034 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 27034 | kv cache rm [0, end)
- slot update_slots: id 41 | task 27034 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 41 | task 27034 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 24 | task 26621 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 27011 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 31 | task 26995 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 31 | task 26995 |
- prompt eval time = 240.26 ms / 199 tokens ( 1.21 ms per token, 828.27 tokens per second)
- eval time = 14890.79 ms / 101 tokens ( 147.43 ms per token, 6.78 tokens per second)
- total time = 15131.05 ms / 300 tokens
- slot launch_slot_: id 31 | task 27039 | processing task
- slot update_slots: id 31 | task 27039 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 27039 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 27039 | kv cache rm [0, end)
- slot update_slots: id 31 | task 27039 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 31 | task 27039 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 14 | task 26627 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 51 | task 27012 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 56 | task 26997 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 56 | task 26997 |
- prompt eval time = 377.81 ms / 199 tokens ( 1.90 ms per token, 526.72 tokens per second)
- eval time = 14171.34 ms / 101 tokens ( 140.31 ms per token, 7.13 tokens per second)
- total time = 14549.16 ms / 300 tokens
- slot launch_slot_: id 56 | task 27046 | processing task
- slot update_slots: id 26 | task 27013 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 56 | task 27046 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 27046 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 27046 | kv cache rm [0, end)
- slot update_slots: id 56 | task 27046 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 56 | task 27046 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 17 | task 27014 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 13 | task 26877 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 26 | task 27013 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 27013 |
- prompt eval time = 157.18 ms / 199 tokens ( 0.79 ms per token, 1266.10 tokens per second)
- eval time = 8849.03 ms / 60 tokens ( 147.48 ms per token, 6.78 tokens per second)
- total time = 9006.20 ms / 259 tokens
- slot launch_slot_: id 26 | task 27047 | processing task
- slot update_slots: id 26 | task 27047 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 27047 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 27047 | kv cache rm [0, end)
- slot update_slots: id 26 | task 27047 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 26 | task 27047 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 3 | task 27015 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 52 | task 26742 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 3 | task 27015 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 3 | task 27015 |
- prompt eval time = 155.94 ms / 199 tokens ( 0.78 ms per token, 1276.12 tokens per second)
- eval time = 8672.22 ms / 60 tokens ( 144.54 ms per token, 6.92 tokens per second)
- total time = 8828.16 ms / 259 tokens
- slot release: id 20 | task 26989 | stop processing: n_past = 194, truncated = 1
- slot print_timing: id 20 | task 26989 |
- prompt eval time = 155.50 ms / 199 tokens ( 0.78 ms per token, 1279.78 tokens per second)
- eval time = 18903.83 ms / 123 tokens ( 153.69 ms per token, 6.51 tokens per second)
- total time = 19059.32 ms / 322 tokens
- slot launch_slot_: id 3 | task 27048 | processing task
- slot launch_slot_: id 20 | task 27049 | processing task
- slot update_slots: id 2 | task 27016 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 3 | task 27048 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 3 | task 27048 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 3 | task 27048 | kv cache rm [0, end)
- slot update_slots: id 3 | task 27048 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 3 | task 27048 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 20 | task 27049 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 27049 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 27049 | kv cache rm [0, end)
- slot update_slots: id 20 | task 27049 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 20 | task 27049 | prompt done, n_past = 199, n_tokens = 460
- slot update_slots: id 57 | task 26752 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 2 | task 27016 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 2 | task 27016 |
- prompt eval time = 155.91 ms / 199 tokens ( 0.78 ms per token, 1276.42 tokens per second)
- eval time = 8845.40 ms / 60 tokens ( 147.42 ms per token, 6.78 tokens per second)
- total time = 9001.30 ms / 259 tokens
- slot launch_slot_: id 2 | task 27050 | processing task
- slot update_slots: id 2 | task 27050 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 27050 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 27050 | kv cache rm [0, end)
- slot update_slots: id 2 | task 27050 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 2 | task 27050 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 30 | task 26629 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 39 | task 26966 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 6 | task 26605 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 43 | task 26653 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 19 | task 26483 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 22 | task 26488 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 23 | task 26535 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 25 | task 26512 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 35 | task 26522 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 37 | task 26484 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 48 | task 26498 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 53 | task 26776 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 54 | task 26525 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 22 | task 26488 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 22 | task 26488 |
- prompt eval time = 1696.41 ms / 1 tokens ( 1696.41 ms per token, 0.59 tokens per second)
- eval time = 120753.73 ms / 694 tokens ( 174.00 ms per token, 5.75 tokens per second)
- total time = 122450.14 ms / 695 tokens
- slot release: id 59 | task 26994 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 59 | task 26994 |
- prompt eval time = 452.45 ms / 199 tokens ( 2.27 ms per token, 439.82 tokens per second)
- eval time = 19659.24 ms / 132 tokens ( 148.93 ms per token, 6.71 tokens per second)
- total time = 20111.70 ms / 331 tokens
- slot launch_slot_: id 22 | task 27051 | processing task
- slot launch_slot_: id 59 | task 27052 | processing task
- slot update_slots: id 12 | task 27021 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 22 | task 27051 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 22 | task 27051 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 22 | task 27051 | kv cache rm [0, end)
- slot update_slots: id 22 | task 27051 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 22 | task 27051 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 59 | task 27052 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 59 | task 27052 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 59 | task 27052 | kv cache rm [0, end)
- slot update_slots: id 59 | task 27052 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 59 | task 27052 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 58 | task 26996 | stop processing: n_past = 203, truncated = 1
- slot print_timing: id 58 | task 26996 |
- prompt eval time = 242.38 ms / 199 tokens ( 1.22 ms per token, 821.01 tokens per second)
- eval time = 20002.60 ms / 132 tokens ( 151.53 ms per token, 6.60 tokens per second)
- total time = 20244.99 ms / 331 tokens
- slot launch_slot_: id 58 | task 27053 | processing task
- slot update_slots: id 7 | task 27022 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 58 | task 27053 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 27053 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 27053 | kv cache rm [0, end)
- slot update_slots: id 58 | task 27053 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 58 | task 27053 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 46 | task 26895 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 7 | task 27022 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 27022 |
- prompt eval time = 149.33 ms / 199 tokens ( 0.75 ms per token, 1332.62 tokens per second)
- eval time = 9764.40 ms / 60 tokens ( 162.74 ms per token, 6.14 tokens per second)
- total time = 9913.73 ms / 259 tokens
- slot launch_slot_: id 7 | task 27058 | processing task
- slot update_slots: id 7 | task 27058 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 27058 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 27058 | kv cache rm [0, end)
- slot update_slots: id 7 | task 27058 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 7 | task 27058 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 29 | task 27023 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 1 | task 27024 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 27025 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 42 | task 27026 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 27028 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 1 | task 27024 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 1 | task 27024 |
- prompt eval time = 403.37 ms / 199 tokens ( 2.03 ms per token, 493.34 tokens per second)
- eval time = 9411.40 ms / 60 tokens ( 156.86 ms per token, 6.38 tokens per second)
- total time = 9814.77 ms / 259 tokens
- slot launch_slot_: id 1 | task 27057 | processing task
- slot update_slots: id 1 | task 27057 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 27057 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 27057 | kv cache rm [0, end)
- slot update_slots: id 1 | task 27057 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 1 | task 27057 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 42 | task 27026 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 42 | task 27026 |
- prompt eval time = 153.01 ms / 199 tokens ( 0.77 ms per token, 1300.54 tokens per second)
- eval time = 9408.41 ms / 60 tokens ( 156.81 ms per token, 6.38 tokens per second)
- total time = 9561.42 ms / 259 tokens
- slot launch_slot_: id 42 | task 27063 | processing task
- slot update_slots: id 42 | task 27063 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 42 | task 27063 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 42 | task 27063 | kv cache rm [0, end)
- slot update_slots: id 42 | task 27063 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 42 | task 27063 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 63 | task 27028 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 63 | task 27028 |
- prompt eval time = 325.14 ms / 199 tokens ( 1.63 ms per token, 612.05 tokens per second)
- eval time = 9236.60 ms / 60 tokens ( 153.94 ms per token, 6.50 tokens per second)
- total time = 9561.74 ms / 259 tokens
- slot launch_slot_: id 63 | task 27062 | processing task
- slot update_slots: id 63 | task 27062 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 27062 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 27062 | kv cache rm [0, end)
- slot update_slots: id 63 | task 27062 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 63 | task 27062 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 60 | task 27032 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 28 | task 27033 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 41 | task 27034 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 4 | task 26792 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 15 | task 26795 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 28 | task 27033 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 28 | task 27033 |
- prompt eval time = 153.05 ms / 199 tokens ( 0.77 ms per token, 1300.22 tokens per second)
- eval time = 8929.33 ms / 60 tokens ( 148.82 ms per token, 6.72 tokens per second)
- total time = 9082.39 ms / 259 tokens
- slot launch_slot_: id 28 | task 27064 | processing task
- slot update_slots: id 28 | task 27064 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 27064 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 27064 | kv cache rm [0, end)
- slot update_slots: id 28 | task 27064 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 28 | task 27064 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 11 | task 26813 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 40 | task 27005 | stop processing: n_past = 210, truncated = 1
- slot print_timing: id 40 | task 27005 |
- prompt eval time = 231.86 ms / 199 tokens ( 1.17 ms per token, 858.27 tokens per second)
- eval time = 19926.58 ms / 139 tokens ( 143.36 ms per token, 6.98 tokens per second)
- total time = 20158.45 ms / 338 tokens
- slot launch_slot_: id 40 | task 27065 | processing task
- slot update_slots: id 31 | task 27039 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 27065 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 27065 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 27065 | kv cache rm [0, end)
- slot update_slots: id 40 | task 27065 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 40 | task 27065 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 12 | task 27021 | stop processing: n_past = 157, truncated = 1
- slot print_timing: id 12 | task 27021 |
- prompt eval time = 317.27 ms / 199 tokens ( 1.59 ms per token, 627.23 tokens per second)
- eval time = 12951.89 ms / 86 tokens ( 150.60 ms per token, 6.64 tokens per second)
- total time = 13269.16 ms / 285 tokens
- slot launch_slot_: id 12 | task 27066 | processing task
- slot update_slots: id 12 | task 27066 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 12 | task 27066 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 12 | task 27066 | kv cache rm [0, end)
- slot update_slots: id 12 | task 27066 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 12 | task 27066 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 47 | task 26903 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 55 | task 26686 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 31 | task 27039 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 31 | task 27039 |
- prompt eval time = 156.26 ms / 199 tokens ( 0.79 ms per token, 1273.52 tokens per second)
- eval time = 9155.59 ms / 60 tokens ( 152.59 ms per token, 6.55 tokens per second)
- total time = 9311.85 ms / 259 tokens
- slot launch_slot_: id 31 | task 27067 | processing task
- slot update_slots: id 8 | task 26904 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 31 | task 27067 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 27067 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 27067 | kv cache rm [0, end)
- slot update_slots: id 31 | task 27067 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 31 | task 27067 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 56 | task 27046 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 56 | task 27046 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 56 | task 27046 |
- prompt eval time = 164.10 ms / 199 tokens ( 0.82 ms per token, 1212.70 tokens per second)
- eval time = 9334.54 ms / 60 tokens ( 155.58 ms per token, 6.43 tokens per second)
- total time = 9498.64 ms / 259 tokens
- slot launch_slot_: id 56 | task 27072 | processing task
- slot update_slots: id 26 | task 27047 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 56 | task 27072 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 27072 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 27072 | kv cache rm [0, end)
- slot update_slots: id 56 | task 27072 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 56 | task 27072 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 10 | task 26821 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 26 | task 27047 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 27047 |
- prompt eval time = 155.48 ms / 199 tokens ( 0.78 ms per token, 1279.88 tokens per second)
- eval time = 9077.03 ms / 60 tokens ( 151.28 ms per token, 6.61 tokens per second)
- total time = 9232.51 ms / 259 tokens
- slot launch_slot_: id 26 | task 27073 | processing task
- slot update_slots: id 45 | task 26908 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 26 | task 27073 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 27073 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 27073 | kv cache rm [0, end)
- slot update_slots: id 26 | task 27073 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 26 | task 27073 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 5 | task 26905 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 3 | task 27048 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 20 | task 27049 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 29 | task 27023 | stop processing: n_past = 164, truncated = 1
- slot print_timing: id 29 | task 27023 |
- prompt eval time = 153.35 ms / 199 tokens ( 0.77 ms per token, 1297.66 tokens per second)
- eval time = 15054.80 ms / 93 tokens ( 161.88 ms per token, 6.18 tokens per second)
- total time = 15208.15 ms / 292 tokens
- slot launch_slot_: id 29 | task 27074 | processing task
- slot update_slots: id 9 | task 26823 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 29 | task 27074 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 27074 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 27074 | kv cache rm [0, end)
- slot update_slots: id 29 | task 27074 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 29 | task 27074 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 3 | task 27048 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 3 | task 27048 |
- prompt eval time = 241.02 ms / 199 tokens ( 1.21 ms per token, 825.65 tokens per second)
- eval time = 9764.40 ms / 60 tokens ( 162.74 ms per token, 6.14 tokens per second)
- total time = 10005.42 ms / 259 tokens
- slot release: id 20 | task 27049 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 20 | task 27049 |
- prompt eval time = 242.24 ms / 199 tokens ( 1.22 ms per token, 821.52 tokens per second)
- eval time = 9764.61 ms / 60 tokens ( 162.74 ms per token, 6.14 tokens per second)
- total time = 10006.84 ms / 259 tokens
- slot release: id 21 | task 27011 | stop processing: n_past = 202, truncated = 1
- slot print_timing: id 21 | task 27011 |
- prompt eval time = 152.41 ms / 199 tokens ( 0.77 ms per token, 1305.71 tokens per second)
- eval time = 20480.83 ms / 131 tokens ( 156.34 ms per token, 6.40 tokens per second)
- total time = 20633.24 ms / 330 tokens
- slot launch_slot_: id 3 | task 27075 | processing task
- slot launch_slot_: id 20 | task 27079 | processing task
- slot launch_slot_: id 21 | task 27080 | processing task
- slot update_slots: id 2 | task 27050 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 3 | task 27075 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 3 | task 27075 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 3 | task 27075 | kv cache rm [0, end)
- slot update_slots: id 3 | task 27075 | prompt processing progress, n_past = 199, n_tokens = 260, progress = 1.000000
- slot update_slots: id 3 | task 27075 | prompt done, n_past = 199, n_tokens = 260
- slot update_slots: id 20 | task 27079 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 27079 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 27079 | kv cache rm [0, end)
- slot update_slots: id 20 | task 27079 | prompt processing progress, n_past = 199, n_tokens = 459, progress = 1.000000
- slot update_slots: id 20 | task 27079 | prompt done, n_past = 199, n_tokens = 459
- slot update_slots: id 21 | task 27080 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 27080 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 27080 | kv cache rm [0, end)
- slot update_slots: id 21 | task 27080 | prompt processing progress, n_past = 199, n_tokens = 658, progress = 1.000000
- slot update_slots: id 21 | task 27080 | prompt done, n_past = 199, n_tokens = 658
- slot release: id 2 | task 27050 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 2 | task 27050 |
- prompt eval time = 462.52 ms / 199 tokens ( 2.32 ms per token, 430.25 tokens per second)
- eval time = 10002.27 ms / 60 tokens ( 166.70 ms per token, 6.00 tokens per second)
- total time = 10464.79 ms / 259 tokens
- slot launch_slot_: id 2 | task 27082 | processing task
- slot update_slots: id 2 | task 27082 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 27082 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 27082 | kv cache rm [0, end)
- slot update_slots: id 2 | task 27082 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 2 | task 27082 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 18 | task 27025 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 18 | task 27025 |
- prompt eval time = 404.77 ms / 199 tokens ( 2.03 ms per token, 491.64 tokens per second)
- eval time = 16506.38 ms / 101 tokens ( 163.43 ms per token, 6.12 tokens per second)
- total time = 16911.14 ms / 300 tokens
- slot launch_slot_: id 18 | task 27088 | processing task
- slot update_slots: id 0 | task 26700 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 18 | task 27088 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 18 | task 27088 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 18 | task 27088 | kv cache rm [0, end)
- slot update_slots: id 18 | task 27088 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 18 | task 27088 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 49 | task 26713 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 61 | task 26715 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 32 | task 26846 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 51 | task 27012 | stop processing: n_past = 212, truncated = 1
- slot print_timing: id 51 | task 27012 |
- prompt eval time = 159.87 ms / 199 tokens ( 0.80 ms per token, 1244.73 tokens per second)
- eval time = 22327.81 ms / 141 tokens ( 158.35 ms per token, 6.31 tokens per second)
- total time = 22487.68 ms / 340 tokens
- slot release: id 60 | task 27032 | stop processing: n_past = 172, truncated = 1
- slot print_timing: id 60 | task 27032 |
- prompt eval time = 153.41 ms / 199 tokens ( 0.77 ms per token, 1297.14 tokens per second)
- eval time = 16459.57 ms / 101 tokens ( 162.97 ms per token, 6.14 tokens per second)
- total time = 16612.99 ms / 300 tokens
- slot launch_slot_: id 51 | task 27093 | processing task
- slot launch_slot_: id 60 | task 27094 | processing task
- slot update_slots: id 22 | task 27051 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 59 | task 27052 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 51 | task 27093 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 51 | task 27093 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 51 | task 27093 | kv cache rm [0, end)
- slot update_slots: id 51 | task 27093 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 51 | task 27093 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 60 | task 27094 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 60 | task 27094 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 60 | task 27094 | kv cache rm [0, end)
- slot update_slots: id 60 | task 27094 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 60 | task 27094 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 22 | task 27051 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 22 | task 27051 |
- prompt eval time = 407.07 ms / 199 tokens ( 2.05 ms per token, 488.86 tokens per second)
- eval time = 10094.44 ms / 59 tokens ( 171.09 ms per token, 5.84 tokens per second)
- total time = 10501.51 ms / 258 tokens
- slot launch_slot_: id 22 | task 27097 | processing task
- slot update_slots: id 16 | task 26582 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 58 | task 27053 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 22 | task 27097 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 22 | task 27097 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 22 | task 27097 | kv cache rm [0, end)
- slot update_slots: id 22 | task 27097 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 22 | task 27097 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 17 | task 27014 | stop processing: n_past = 212, truncated = 1
- slot print_timing: id 17 | task 27014 |
- prompt eval time = 328.62 ms / 199 tokens ( 1.65 ms per token, 605.57 tokens per second)
- eval time = 22376.85 ms / 141 tokens ( 158.70 ms per token, 6.30 tokens per second)
- total time = 22705.47 ms / 340 tokens
- slot release: id 59 | task 27052 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 59 | task 27052 |
- prompt eval time = 410.06 ms / 199 tokens ( 2.06 ms per token, 485.29 tokens per second)
- eval time = 10478.62 ms / 60 tokens ( 174.64 ms per token, 5.73 tokens per second)
- total time = 10888.68 ms / 259 tokens
- slot launch_slot_: id 17 | task 27098 | processing task
- slot launch_slot_: id 59 | task 27100 | processing task
- slot update_slots: id 38 | task 26598 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 44 | task 26608 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 17 | task 27098 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 17 | task 27098 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 17 | task 27098 | kv cache rm [0, end)
- slot update_slots: id 17 | task 27098 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 17 | task 27098 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 59 | task 27100 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 59 | task 27100 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 59 | task 27100 | kv cache rm [0, end)
- slot update_slots: id 59 | task 27100 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 59 | task 27100 | prompt done, n_past = 199, n_tokens = 460
- slot update_slots: id 62 | task 26614 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 58 | task 27053 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 58 | task 27053 |
- prompt eval time = 329.27 ms / 199 tokens ( 1.65 ms per token, 604.36 tokens per second)
- eval time = 10602.07 ms / 60 tokens ( 176.70 ms per token, 5.66 tokens per second)
- total time = 10931.35 ms / 259 tokens
- slot launch_slot_: id 58 | task 27102 | processing task
- slot update_slots: id 7 | task 27058 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 58 | task 27102 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 58 | task 27102 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 58 | task 27102 | kv cache rm [0, end)
- slot update_slots: id 58 | task 27102 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 58 | task 27102 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 7 | task 27058 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 7 | task 27058 |
- prompt eval time = 148.56 ms / 199 tokens ( 0.75 ms per token, 1339.57 tokens per second)
- eval time = 10714.82 ms / 60 tokens ( 178.58 ms per token, 5.60 tokens per second)
- total time = 10863.38 ms / 259 tokens
- slot launch_slot_: id 7 | task 27103 | processing task
- slot update_slots: id 7 | task 27103 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 7 | task 27103 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 7 | task 27103 | kv cache rm [0, end)
- slot update_slots: id 7 | task 27103 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 7 | task 27103 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 1 | task 27057 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 27 | task 26919 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 42 | task 27063 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 33 | task 26851 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 63 | task 27062 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 1 | task 27057 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 1 | task 27057 |
- prompt eval time = 146.90 ms / 199 tokens ( 0.74 ms per token, 1354.70 tokens per second)
- eval time = 10602.70 ms / 60 tokens ( 176.71 ms per token, 5.66 tokens per second)
- total time = 10749.60 ms / 259 tokens
- slot launch_slot_: id 1 | task 27105 | processing task
- slot update_slots: id 1 | task 27105 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 1 | task 27105 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 1 | task 27105 | kv cache rm [0, end)
- slot update_slots: id 1 | task 27105 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 1 | task 27105 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 63 | task 27062 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 63 | task 27062 |
- prompt eval time = 322.56 ms / 199 tokens ( 1.62 ms per token, 616.93 tokens per second)
- eval time = 10491.87 ms / 60 tokens ( 174.86 ms per token, 5.72 tokens per second)
- total time = 10814.43 ms / 259 tokens
- slot launch_slot_: id 63 | task 27107 | processing task
- slot update_slots: id 63 | task 27107 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 63 | task 27107 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 63 | task 27107 | kv cache rm [0, end)
- slot update_slots: id 63 | task 27107 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 63 | task 27107 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 28 | task 27064 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 34 | task 26922 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 28 | task 27064 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 28 | task 27064 |
- prompt eval time = 368.78 ms / 199 tokens ( 1.85 ms per token, 539.61 tokens per second)
- eval time = 10276.33 ms / 60 tokens ( 171.27 ms per token, 5.84 tokens per second)
- total time = 10645.11 ms / 259 tokens
- slot launch_slot_: id 28 | task 27110 | processing task
- slot update_slots: id 28 | task 27110 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 28 | task 27110 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 28 | task 27110 | kv cache rm [0, end)
- slot update_slots: id 28 | task 27110 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 28 | task 27110 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 40 | task 27065 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 50 | task 26866 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 12 | task 27066 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 36 | task 26869 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 40 | task 27065 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 40 | task 27065 |
- prompt eval time = 157.14 ms / 199 tokens ( 0.79 ms per token, 1266.39 tokens per second)
- eval time = 10280.34 ms / 60 tokens ( 171.34 ms per token, 5.84 tokens per second)
- total time = 10437.48 ms / 259 tokens
- slot launch_slot_: id 40 | task 27112 | processing task
- slot update_slots: id 31 | task 27067 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 40 | task 27112 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 40 | task 27112 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 40 | task 27112 | kv cache rm [0, end)
- slot update_slots: id 40 | task 27112 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 40 | task 27112 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 31 | task 27067 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 31 | task 27067 |
- prompt eval time = 433.16 ms / 199 tokens ( 2.18 ms per token, 459.41 tokens per second)
- eval time = 9788.50 ms / 60 tokens ( 163.14 ms per token, 6.13 tokens per second)
- total time = 10221.66 ms / 259 tokens
- slot release: id 41 | task 27034 | stop processing: n_past = 199, truncated = 1
- slot print_timing: id 41 | task 27034 |
- prompt eval time = 369.30 ms / 199 tokens ( 1.86 ms per token, 538.86 tokens per second)
- eval time = 20307.83 ms / 128 tokens ( 158.65 ms per token, 6.30 tokens per second)
- total time = 20677.13 ms / 327 tokens
- slot launch_slot_: id 31 | task 27133 | processing task
- slot launch_slot_: id 41 | task 27134 | processing task
- slot update_slots: id 31 | task 27133 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 31 | task 27133 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 31 | task 27133 | kv cache rm [0, end)
- slot update_slots: id 31 | task 27133 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 31 | task 27133 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 41 | task 27134 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 41 | task 27134 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 41 | task 27134 | kv cache rm [0, end)
- slot update_slots: id 41 | task 27134 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 41 | task 27134 | prompt done, n_past = 199, n_tokens = 460
- slot update_slots: id 56 | task 27072 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 24 | task 26621 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 56 | task 27072 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 56 | task 27072 |
- prompt eval time = 159.04 ms / 199 tokens ( 0.80 ms per token, 1251.27 tokens per second)
- eval time = 10330.18 ms / 60 tokens ( 172.17 ms per token, 5.81 tokens per second)
- total time = 10489.22 ms / 259 tokens
- slot launch_slot_: id 56 | task 27139 | processing task
- slot update_slots: id 26 | task 27073 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 56 | task 27139 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 56 | task 27139 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 56 | task 27139 | kv cache rm [0, end)
- slot update_slots: id 56 | task 27139 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 56 | task 27139 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 26 | task 27073 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 26 | task 27073 |
- prompt eval time = 157.12 ms / 199 tokens ( 0.79 ms per token, 1266.55 tokens per second)
- eval time = 10325.87 ms / 60 tokens ( 172.10 ms per token, 5.81 tokens per second)
- total time = 10482.99 ms / 259 tokens
- slot launch_slot_: id 26 | task 27140 | processing task
- slot update_slots: id 26 | task 27140 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 26 | task 27140 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 26 | task 27140 | kv cache rm [0, end)
- slot update_slots: id 26 | task 27140 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 26 | task 27140 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 14 | task 26627 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 29 | task 27074 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 3 | task 27075 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 20 | task 27079 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 21 | task 27080 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 29 | task 27074 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 29 | task 27074 |
- prompt eval time = 155.77 ms / 199 tokens ( 0.78 ms per token, 1277.55 tokens per second)
- eval time = 9941.09 ms / 60 tokens ( 165.68 ms per token, 6.04 tokens per second)
- total time = 10096.86 ms / 259 tokens
- slot launch_slot_: id 29 | task 27141 | processing task
- slot update_slots: id 29 | task 27141 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 29 | task 27141 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 29 | task 27141 | kv cache rm [0, end)
- slot update_slots: id 29 | task 27141 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 29 | task 27141 | prompt done, n_past = 199, n_tokens = 262
- slot release: id 20 | task 27079 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 20 | task 27079 |
- prompt eval time = 343.21 ms / 199 tokens ( 1.72 ms per token, 579.82 tokens per second)
- eval time = 10094.66 ms / 60 tokens ( 168.24 ms per token, 5.94 tokens per second)
- total time = 10437.87 ms / 259 tokens
- slot release: id 21 | task 27080 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 21 | task 27080 |
- prompt eval time = 343.22 ms / 199 tokens ( 1.72 ms per token, 579.80 tokens per second)
- eval time = 10094.71 ms / 60 tokens ( 168.25 ms per token, 5.94 tokens per second)
- total time = 10437.93 ms / 259 tokens
- slot launch_slot_: id 20 | task 27142 | processing task
- slot launch_slot_: id 21 | task 27145 | processing task
- slot update_slots: id 2 | task 27082 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 13 | task 26877 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 20 | task 27142 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 20 | task 27142 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 20 | task 27142 | kv cache rm [0, end)
- slot update_slots: id 20 | task 27142 | prompt processing progress, n_past = 199, n_tokens = 261, progress = 1.000000
- slot update_slots: id 20 | task 27142 | prompt done, n_past = 199, n_tokens = 261
- slot update_slots: id 21 | task 27145 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 21 | task 27145 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 21 | task 27145 | kv cache rm [0, end)
- slot update_slots: id 21 | task 27145 | prompt processing progress, n_past = 199, n_tokens = 460, progress = 1.000000
- slot update_slots: id 21 | task 27145 | prompt done, n_past = 199, n_tokens = 460
- slot release: id 2 | task 27082 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 2 | task 27082 |
- prompt eval time = 150.92 ms / 199 tokens ( 0.76 ms per token, 1318.60 tokens per second)
- eval time = 9825.65 ms / 60 tokens ( 163.76 ms per token, 6.11 tokens per second)
- total time = 9976.57 ms / 259 tokens
- slot launch_slot_: id 2 | task 27164 | processing task
- slot update_slots: id 2 | task 27164 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 2 | task 27164 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 2 | task 27164 | kv cache rm [0, end)
- slot update_slots: id 2 | task 27164 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 2 | task 27164 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 18 | task 27088 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 52 | task 26742 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 52 | task 26742 | stop processing: n_past = 130, truncated = 1
- slot print_timing: id 52 | task 26742 |
- prompt eval time = 546.30 ms / 199 tokens ( 2.75 ms per token, 364.27 tokens per second)
- eval time = 96523.40 ms / 567 tokens ( 170.24 ms per token, 5.87 tokens per second)
- total time = 97069.70 ms / 766 tokens
- slot launch_slot_: id 52 | task 27167 | processing task
- slot update_slots: id 52 | task 27167 | new prompt, n_ctx_slot = 256, n_keep = 0, n_prompt_tokens = 8007
- slot update_slots: id 52 | task 27167 | input truncated, n_ctx = 256, n_keep = 0, n_left = 256, n_prompt_tokens = 199
- slot update_slots: id 52 | task 27167 | kv cache rm [0, end)
- slot update_slots: id 52 | task 27167 | prompt processing progress, n_past = 199, n_tokens = 262, progress = 1.000000
- slot update_slots: id 52 | task 27167 | prompt done, n_past = 199, n_tokens = 262
- slot update_slots: id 57 | task 26752 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 51 | task 27093 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 60 | task 27094 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot update_slots: id 22 | task 27097 | slot context shift, n_keep = 0, n_left = 255, n_discard = 127
- slot release: id 51 | task 27093 | stop processing: n_past = 131, truncated = 1
- slot print_timing: id 51 | task 27093 |
- prompt eval time = 243.72 ms / 199 tokens ( 1.22 ms per token, 816.51 tokens per second)
- eval time = 10145.84 ms / 60 tokens ( 169.10 ms per token, 5.91 tokens per second)
- total time = 10389.56 ms / 259 tokens
Advertisement
Add Comment
Please, Sign In to add comment