Advertisement
Guest User

Untitled

a guest
Feb 12th, 2025
38
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 1.64 KB | None | 0 0
  1. ubuntuai@ubuntuai ~/ik_llama.cpp (main)> ./build/bin/llama-server -m ~/models/DeepSeek-R1-GGUF/DeepSeek-R1-UD-IQ1_S/DeepSeek-R1-UD-IQ1_S-00001-of-00003.gguf -c 2048 --host 0.0.0.0 --port 5000 -a DeepSeek-R1 -ngl 36 -mla
  2.  
  3. INFO [ print_timings] prompt eval time = 842.49 ms / 8 tokens ( 105.31 ms per token, 9.50 tokens per second) | tid="127215082766336" timestamp=1739349739 id_slot=0 id_task=0 t_prompt_processing=842.493 n_prompt_tokens_processed=8 t_token=105.311625 n_tokens_second=9.495627856848662
  4. INFO [ print_timings] generation eval time = 43609.12 ms / 309 runs ( 141.13 ms per token, 7.09 tokens per second) | tid="127215082766336" timestamp=1739349739 id_slot=0 id_task=0 t_token_generation=43609.119 n_decoded=309 t_token=141.1298349514563 n_tokens_second=7.0856739848378965
  5.  
  6.  
  7.  
  8. ubuntuai@ubuntuai ~/ik_llama.cpp (main)> ./build/bin/llama-server -m ~/models/DeepSeek-R1-GGUF/DeepSeek-R1-UD-IQ1_S/DeepSeek-R1-UD-IQ1_S-00001-of-00003.gguf -c 2048 --host 0.0.0.0 --port 5000 -a DeepSeek-R1 -ngl 36
  9. INFO [ print_timings] prompt eval time = 809.67 ms / 8 tokens ( 101.21 ms per token, 9.88 tokens per second) | tid="137948583522304" timestamp=1739347778 id_slot=0 id_task=0 t_prompt_processing=809.667 n_prompt_tokens_processed=8 t_token=101.208375 n_tokens_second=9.88060523647376
  10. INFO [ print_timings] generation eval time = 43648.61 ms / 309 runs ( 141.26 ms per token, 7.08 tokens per second) | tid="137948583522304" timestamp=1739347778 id_slot=0 id_task=0 t_token_generation=43648.614 n_decoded=309 t_token=141.2576504854369 n_tokens_second=7.079262585519897
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement