GitHub - rahulkarda/Speech-Recognition: A Speech Recognition web app that converts speech to text in real time.

Speech Recognition (Speech to Text)

A Speech Recognition Web App that converts speech to text in real time built using Web Speech API. SpeechRecognition interface was used which provides the ability to recognize voice context from an audio input (normally via the device's default speech recognition service) and respond appropriately.

The site is live at : https://speech2text.netlify.app/

Tech Stack

Bootstrap is a free and open-source CSS framework directed at responsive, mobile-first front-end web development.

Web Speech API enables you to incorporate voice data into web apps. The Web Speech API has two parts: SpeechSynthesis (Text-to-Speech), and SpeechRecognition (Asynchronous Speech Recognition.). SpeechRecognition interface, which provides the ability to recognize voice context from an audio input (normally via the device's default speech recognition service) and respond appropriately.

Interface Used

Speech Recognition

SpeechRecognition - The controller interface for the recognition service; this also handles the SpeechRecognitionEvent sent from the recognition service.
SpeechRecognitionEvent - The event object for the result and nomatch events, and contains all the data associated with an interim or final speech recognition result.
SpeechGrammar - The words or patterns of words that we want the recognition service to recognize.
SpeechRecognitionResult - Represents a single recognition match, which may contain multiple SpeechRecognitionAlternative objects.
SpeechRecognitionResultList - Represents a list of SpeechRecognitionResult objects, or a single one if results are being captured in continuous mode.

Optimizations

While improve this project, I would start by implementing the following features -

Adding support for more languages
Solving Browser compatibility issues

Lessons Learned

My learning was focused on making the use of Web Speech API and to create a simple UI where user can select a language and start Speech Recognition to convert the speech to text. Learned how the Web Speech API works and differnece between SpeechSynthesis (Text-to-Speech), and SpeechRecognition (Asynchronous Speech Recognition.).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
README.md		README.md
favicon.ico		favicon.ico
index.html		index.html
package.json		package.json
script.js		script.js
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Recognition (Speech to Text)

Tech Stack

Interface Used

Speech Recognition

Optimizations

Lessons Learned

About

Releases

Packages

Languages

rahulkarda/Speech-Recognition

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition (Speech to Text)

Tech Stack

Interface Used

Speech Recognition

Optimizations

Lessons Learned

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages