2025 07 05 mini dwani v2
Mini - dwani.ai / Cost saving
Run only Gemma3-4b-instruct on llama.cpp with the smallest GPU on GCP
Add ASR + TTS on demand : during events
Conserve the credits- become efficient
Use - free huggingface servers for translation and ASR , run them individually
Skip - tts - find small example for kannada
Run - kokoro / speaxhes.ai and whisper - HF
- Start without power / Make it work with H100/ H200
--
whisper / ast Use indicLid - to identify language and switch between models.
Provide control to user
-
remove - unused subdomain from cloudflare