πŸš€ Introducing Flame Audio AI – Real-Time Voice Platform

Hello Google Cloud Build with AI community! :waving_hand: I’m excited to share Flame Audio AI, an open-source, full-stack voice platform powered by Google Generative AI, designed for seamless speech-to-text, text-to-speech, and speaker diarization capabilities in your applications .


:hammer_and_wrench: Quick Install & Setup

Follow these steps to get up and running locally.

git clone GitHub - Bag-zy/flame-audio

cd flame-audio

npm install

npm run dev

  1. Create a .env.local in the project root with your MONGODB_URI, NEXTAUTH_SECRET, and GOOGLE_API_KEY .

  2. Open localhost:3000 in your browser to see the live demo .


:bullseye: Key Features

Speech-to-Text: Real-time transcription with multi-speaker support

Text-to-Speech: Natural, human-like voice synthesis

Speaker Diarization: Automatically label who’s speaking when

Multi-Format & Multi-Language: Supports MP3, WAV, M4A, 50+ languages

Responsive UI: Light/dark mode toggle, mobile-friendly design


:wrench: Under the Hood (Tech Stack)

Next.js 15 for frontend & API routes

React + TypeScript for UI components

Tailwind CSS, Radix UI, Lucide React Icons for styling

NextAuth.js for authentication, MongoDB + Mongoose for persistence

Google Generative AI powering all speech features


:red_question_mark: How You Can Help

  1. Test & Report Issues: Run the demo and let me know any bugs or performance quirks .

  2. Feature Requests: What additional formats, languages, or AI capabilities would you like? .

  3. Performance Tips: Suggestions on scaling speech workloads in Vertex AI or Cloud Functions? .

Looking forward to your feedback and collaborationβ€”thanks in advance! :folded_hands:

:link: Repo & Demo: GitHub - Bag-zy/flame-audio

1 Like

I love this project! Thanks :grin:

1 Like