2025 02 10 todo dhwani

Todov- Dhwani - Feb 10, 2025

Move code to fastapi

Create basic ux / single page for deno

Deploy with hf spaces ?

Learn about about docket container orchestration.


Use batch streaming for speaker audio. To get same speaker voice.

Combine all generation for each voice at once.

https://github.com/huggingface/parler-tts/blob/main/INFERENCE.md#batch-generation


Balance between creating demo video and the demo itself has become nerve-wracking. Last week, it was just an idea. In 7 days, poc has transformed into production grade system. UX is the last stretch. Building an Immersive Audiobook Generator with Text/Speech prompts. slabstech.com/dhwani

--

Usecase

Blind and illiterate users can talk to books and learn From a lot of material that is not available for them.

Braille books can be replaced by speech to Speech. They can interact with Voice and learn new things.

Cost of access will be reduced significantly


Meta Ai - hypersclaer

https://www.cs.cmu.edu/~dskarlat/publications/xfaas_sosp23.pdf

https://cacm.acm.org/research/metas-hyperscale-infrastructure-overview-and-insights/


Hackathon- print qr code. Scan project idea

Pitch presentation