Guest User

Untitled

a guest
Oct 13th, 2025
10
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.40 KB | None | 0 0
  1. start "" /High /B /Wait llama-server.exe ^
  2. -m "G:\LLM\models\GLM-4.5-Air-Q4_K_S\GLM-4.5-Air-Q4_K_S-00001-of-00002.gguf" ^
  3. -c 32768 ^
  4. -t 5 ^
  5. -fa on --prio-batch 2 -ub 2048 -b 2048 ^
  6. -ctk q8_0 -ctv q8_0 ^
  7. -ngl 999 ^
  8. -ot "blk.1\.ffn_.*_exps\.=CUDA0,blk.2\.ffn_.*_exps\.=CUDA0,blk.3\.ffn_.*_exps\.=CUDA0,blk.4\.ffn_.*_exps\.=CUDA0,blk.5\.ffn_.*_exps\.=CUDA0" ^
  9. --cpu-moe ^
  10. --no-context-shift ^
  11. --no-mmap
Advertisement
Add Comment
Please, Sign In to add comment