2025 03 22 dhwani roadmap v2 march 22 27
Dhwani- v2 - Roadmap - March 22-27
- Main issue -
Dark theme - old phone is broken Not usable to access settings
- Api server - For routing and loadbalamcing update system From python to go ?
Make it serve with less resources and high throughput
- Canvas/ message bist Message reaponse body should be markdown reader.
To present data in a nice format
- Auto Voice language Sample 2 sec audio on each Language for Transcription
Pass it via asr for the available Language and get text in multiple Language
Use Indic lid for text to match exact language.
Currently ASR is not streaming,
We want to add streaming voice input first and experiment with language identification.
- Live transcription- earphone to App ? Stream AsR / feed to b to translate
Show real time audioc in n text
- University Collab / access
Register with Uni email .
Get access token and build so.
Provide info / about app Chankya uni in app.
Add a separate tab / rag based /
- App Features/ characters
Add - status icon in settings page
Show availability of service
Choose- better models
Add - option for character's / stoeries
Ramayana/ mahabharsya Non-copyrighted books only
- API Server - user management Csv uploader - server - restart
Db backups?
Name , type Type - mobile Type- web
Username - full-email id
Password: username part before @
Allowed- domains
gmail.com chanakyauniversity.edu.in
Add - gpu check ? Torch compile Use bfloat16 for l4 and above
- Parler-tts- distillation Make smaller generator/ Distill the project for individual language
Improve speed and accuracy? Can we do it ?
- Dhwani Marketing-integration Create integration with 3rd Party clirnts
Live kit Fast rtx Plivo Twilio Whatsapp Api
- Dhwani - web ux - user management
Create a simple screen on dhwani - website
Login with admin details.
Get list of useers updatws to systen.
Add new users with simple button.
- Dhwani - model - server Fix - issue with asyn calls.
Make load testing of projrct .
Add load balancing to main api
Based on compute available, auto scale the systen Non GPU T4 - L4
Select betwen Gemma3-4b-instruct Gemma3-4b-instruct quantized
Gemma3-1b-instruct Gemma3-1b-instruct quantized
Translation models
Voice model /
Always lazy load
13, Transcription
Translate in real time without llm in betwen.
Suitable for handsfree on mobile app
Make it work for german, Kannada language first.
More users require it immediate.
Set source and target language.
Choose - main screen in setting.