What is everyone using for the LLM model for HA voice when selfhosting ollama? I’ve tried llama and qwen with varying degrees of understanding my commands. I’m currently on llama as it appears a little better. I just wanted to see if anyone found a better model.
Edit: as pointed out, this is more of a speech to text issue than llm model. I’m looking into the alternatives to whisper
The Gemma 27b model has been solid for me. Using chatterbox for TTS as well
27b - how much VRAM does it use?
Looks to be 20gb of vram