omm

Models

Run local LLMs with OMM. A desktop app for installing, configuring, and chatting with AI models — powered by a custom inference engine with GPU acceleration.

Install a model
terminal
$ omm pull llama3.2
Features
Popular Models
ModelParamsQuantSize
Llama 3.370BQ4_K_M40 GB
Qwen 2.532BQ4_K_M19 GB
Gemma 327BQ4_K_M16 GB
Mistral Small24BQ4_K_M14 GB
DeepSeek R114BQ5_K_M10 GB
Phi-414BQ4_K_M8 GB
Llama 3.23BQ8_03.2 GB
Qwen 2.5 Coder1.5BQ8_01.6 GB

50,000+ models available via HuggingFace and Ollama registry

Documentation