A simple browser extension that can transcribe and translate audio from any web page. The extension works by capturing audio on the web page, then streams it to Deepgram's Speech-To-Text API for transcribing, and then to Azure's translation service for a final translation into a target language. It is my entry to the Deepgram + DEV hackathon.
The project contains two parts, the main extension source and a API service proxy responsible for requesting short-lived access tokens for the extension to use.
|____extension
| |____background.js
| |____icons
| | |____speaker-48.png
| |____manifest.json
| |____content.js
|
|____service-proxy
| |____mvnw.cmd
| |____pom.xml
| |____src
| |____....
|
|____LICENSE
|____README.md
The extension uses some Chrome specific APIs, so it will only work on Chrome based browsers (e.g, Chrome, Edge). To install it locally, simply:
-> Manage extensions -> Load unpacked -> select the directory of the extension
-
create an Azure account, provision an Azure Translator service
-
create a
.env
file and add the Azure Translator subscription key, and Deepgram API key and project id.azure-translator-api.subscription-key=<your Azure translator subscription key> deepgram-api.authentication-key=<your Deepgram api key> deepgram-api.project-id=<your Deepgram project id>
-
start the local service proxy
> ./mvnw quarkus:dev