Skip to content

Latest commit

 

History

History
90 lines (61 loc) · 3.02 KB

README.md

File metadata and controls

90 lines (61 loc) · 3.02 KB

Project Overview

Real-Time AI-Powered Transcription and Translation Tool

This project is a state-of-the-art real-time tool designed for transcription and translation. By leveraging cutting-edge AI technologies, it delivers fast and accurate speech-to-text transcription and advanced multi-language translation powered by NLLB-200.

With a Rust-based backend for high performance and a lightweight Next.js frontend for seamless cross-platform usability, this tool ensures top-tier results while running efficiently on local machines.


Key Features

  • Real-Time Transcription
    Powered by Whisper Turbo (whisper-rs) for fast and precise speech-to-text processing.
  • Efficient Audio Handling
    Processes audio in real-time using WebGPU and FFmpeg for smooth, optimized performance.
  • Accurate Translation
    Utilizes the NLLB-200 model from Hugging Face for multilingual translation across 200+ languages with unparalleled accuracy.
  • Cross-Platform Frontend
    Built with Next.js to deliver a responsive and user-friendly interface.
  • State Management
    Managed via Zustand for a streamlined and reactive application state.

Tech Stack

Layer Technology Purpose
Transcription Whisper Turbo Fast, accurate speech-to-text transcription
Audio Handling WebGPU + FFmpeg Real-time audio processing
Translation NLLB-200 Advanced multilingual translation
Frontend Next.js Lightweight, cross-platform UI
State Management Zustand Simplified and reactive state management

Resources


Installation and Usage

Clone the repository:

git clone https://github.com/itsyuimorii/chirimiri_lingo_AI_lively_tranlate_tool.git
cd chirimiri_lingo_AI_lively_tranlate_tool

Install dependencies:

npm install

Start the development server:

npm run dev

For production, build and run:

npm run build
npm start

Future Plans

  • Expand support for additional input formats and languages.
  • Integrate with streaming platforms for live transcription and translation.
  • Optimize further for on-device performance to enable offline functionality.

About

A real-time AI-powered transcription and translation tool. This tool processes live audio and translates it into multiple languages—all while ensuring privacy with local, client-side processing. Ideal for meetings, webinars, and live events.