Project Overview

Real-Time AI-Powered Transcription and Translation Tool

This project is a state-of-the-art real-time tool designed for transcription and translation. By leveraging cutting-edge AI technologies, it delivers fast and accurate speech-to-text transcription and advanced multi-language translation powered by NLLB-200.

With a Rust-based backend for high performance and a lightweight Next.js frontend for seamless cross-platform usability, this tool ensures top-tier results while running efficiently on local machines.

Key Features

Real-Time Transcription
Powered by Whisper Turbo (whisper-rs) for fast and precise speech-to-text processing.
Efficient Audio Handling
Processes audio in real-time using WebGPU and FFmpeg for smooth, optimized performance.
Accurate Translation
Utilizes the NLLB-200 model from Hugging Face for multilingual translation across 200+ languages with unparalleled accuracy.
Cross-Platform Frontend
Built with Next.js to deliver a responsive and user-friendly interface.
State Management
Managed via Zustand for a streamlined and reactive application state.

Tech Stack

Layer	Technology	Purpose
Transcription	Whisper Turbo	Fast, accurate speech-to-text transcription
Audio Handling	WebGPU + FFmpeg	Real-time audio processing
Translation	NLLB-200	Advanced multilingual translation
Frontend	Next.js	Lightweight, cross-platform UI
State Management	Zustand	Simplified and reactive state management

Resources

NLLB-200: Hugging Face Documentation
Whisper Turbo: GitHub | Discussion
Audio Processing with FFmpeg: Docs
Realtime Whisper with WebGPU: Example
Client-Side Translator Tutorial: Codemotion

Installation and Usage

Clone the repository:

git clone https://github.com/itsyuimorii/chirimiri_lingo_AI_lively_tranlate_tool.git
cd chirimiri_lingo_AI_lively_tranlate_tool

Install dependencies:

npm install

Start the development server:

npm run dev

For production, build and run:

npm run build
npm start

Future Plans

Expand support for additional input formats and languages.
Integrate with streaming platforms for live transcription and translation.
Optimize further for on-device performance to enable offline functionality.

About

A real-time AI-powered transcription and translation tool. This tool processes live audio and translates it into multiple languages—all while ensuring privacy with local, client-side processing. Ideal for meetings, webinars, and live events.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Project Overview

Key Features

Tech Stack

Resources

Installation and Usage

Clone the repository:

Install dependencies:

Start the development server:

For production, build and run:

Future Plans

About

Files

README.md

Latest commit

History

README.md

File metadata and controls

Project Overview

Key Features

Tech Stack

Resources

Installation and Usage

Clone the repository:

Install dependencies:

Start the development server:

For production, build and run:

Future Plans

About