Mantri AI is a real-time, speech-to-speech conversational AI assistant that listens, understands, and responds like a human.
It supports English, Hindi, and Hinglish conversations with natural flow, interruption handling, and continuous interaction.
- 🎙️ Real-time Speech Recognition (STT)
- 🧠 AI-powered conversational responses
- 🔊 Text-to-Speech (TTS) replies
- 🌐 Multilingual support (English, Hindi, Hinglish)
- 🔁 Continuous conversation loop
- ✋ Interrupt AI while speaking
- 💬 Chat UI with message history
- 🧠 Short-term conversation memory
- React.js
- Web Speech API (Speech Recognition + Speech Synthesis)
- Axios
- FastAPI (Python)
- Groq API (LLaMA 3.3)
- Uvicorn
- User speaks → Speech Recognition converts voice to text
- Text sent to backend (FastAPI)
- Groq LLM generates response
- Response sent back to frontend
- Speech Synthesis speaks the reply
- Loop continues for real-time conversation
- 🗣️ Natural conversation tone (human-like responses)
- 🎯 Language detection (auto Hindi / English / Hinglish)
- 🔁 Smart listening flow (AI speaks → mic restarts)
- ✋ Interrupt system (stop AI mid-speech)
.envfile is excluded for security- Never expose API keys in frontend
- Works best on Chrome (Web Speech API support)
- Better natural voice (advanced TTS integration)
- Long-term memory system
- User authentication
- Deployment optimization (Render + Netlify)
- Mobile support
Sukhad Sharma
BTech CSE (Full Stack Software Development)
If you like this project, give it a ⭐ on GitHub!