llml LLM Launcher
← browse

gemma-4-26B-A4B-thinking-vision-Q4_K_XL

16-18 GB total memory (RAM + VRAM) for 4-bit.

llama.cpp Mixed Cross-platform Chat Updated 24 seconds ago
Model gemma-4-26B-A4B-thinking-vision-Q4_K_XL
Backend llama.cpp
Hardware Mixed
Use case Chat
Maintainer @flyingnobita
Last updated 24 seconds ago

Why this profile exists

16-18 GB total memory (RAM + VRAM) for 4-bit; requires mmproj BF16 file for vision; llama.cpp supports CPU and GPU inference

Launch configuration

# args
--temp 1.0
--top-p 0.95
--top-k 64

Hardware assumptions

  • Mixed — tested envelope
  • Cross-platform — backend installed and on PATH
  • Backend: llama.cpp >= current llml-supported version
  • Profile assumes the model file is already on disk; llml supplies the path at launch