🕉
Maharshi — Voice Chat
Status: Beta — Maharshi is a work-in-progress. Voice features may evolve.
Speak with Maharshi, a wise Vedic sage who listens and responds with a spoken voice. The experience includes an ambient Omkaaram drone, cosmic-themed visuals, and curated side imagery.
Try it: veda-guru-ai-ui.vercel.app/maharshi
How it works
- Tap the microphone and speak your question
- Browser speech recognition converts your voice to text
- Text is sent to the Maharshi persona LLM (Sarvam AI)
- The sage’s response is spoken aloud via Sarvam AI Text-to-Speech (
bulbul:v3) - Side image panels display relevant Vedic imagery from Wikipedia
- A subtle Omkaaram drone plays in the background
Features
| Feature | Detail |
|---|---|
| Omkaaram | Real CC0 Om chant (Freesound, 13.6s looped) with mute toggle |
| Voice meter | Real-time volume bar shows mic sensitivity |
| Speech-to-text | Browser Web Speech API with continuous listening |
| Text-to-speech | Sarvam AI Bulbul v3 (shubh voice, en-IN) with browser fallback |
| LLM | Sarvam AI sarvam-105b with Maharshi persona |
| Type input | Text input also available below the mic |
| Side imagery | Curated Wikipedia thumbnails matching Vedic concepts (50+ mappings) |
| Markdown | Response rendered with marked.js for rich formatting |
Design
Full-screen immersive experience:
- Background: Cosmic image (
latest_bkg.png) covering the viewport - Glassmorphism: Frosted glass overlays for controls and response text
- Side panels: Ethereal image panels with
mix-blend-mode: screenblending - Responsive: Breakpoints at 1400px, 900px, 600px
- Sound: Om chant loops at low volume; mute toggle in header
Tech
- Page:
maharshi.html(routed via vercel.json →/maharshi) - Logic:
maharshi.js— speech recognition, TTS, image panel management - Styling:
maharshi.css— cosmic background, glassmorphism, side panels, responsive - Endpoint:
POST /api/sage(sage persona, spoken-word friendly, 2-4 sentence replies) - TTS endpoint:
POST /api/tts(proxies to Sarvam AIbulbul:v3) - Images: Curated 50+ Vedic concept → Wikipedia page title mappings →
prop=pageimagesthumbnails - Om audio:
om.mp3— CC0 licensed, 13.6 seconds, auto-looped - Background:
latest_bkg.png— cosmic full-page background