A Speech Recognition Web App that converts speech to text in real time built using Web Speech API. SpeechRecognition interface was used which provides the ability to recognize voice context from an audio input (normally via the device's default speech recognition service) and respond appropriately.
The site is live at : https://speech2text.netlify.app/
![Speech To Text Converter](https://github.com/rahulkarda/Speech-Recognition/raw/main/images/speech2text.jpg?raw=true)
Bootstrap is a free and open-source CSS framework directed at responsive, mobile-first front-end web development.
Web Speech API enables you to incorporate voice data into web apps. The Web Speech API has two parts: SpeechSynthesis (Text-to-Speech), and SpeechRecognition (Asynchronous Speech Recognition.). SpeechRecognition interface, which provides the ability to recognize voice context from an audio input (normally via the device's default speech recognition service) and respond appropriately.
- SpeechRecognition - The controller interface for the recognition service; this also handles the SpeechRecognitionEvent sent from the recognition service.
- SpeechRecognitionEvent - The event object for the result and nomatch events, and contains all the data associated with an interim or final speech recognition result.
- SpeechGrammar - The words or patterns of words that we want the recognition service to recognize.
- SpeechRecognitionResult - Represents a single recognition match, which may contain multiple SpeechRecognitionAlternative objects.
- SpeechRecognitionResultList - Represents a list of SpeechRecognitionResult objects, or a single one if results are being captured in continuous mode.
While improve this project, I would start by implementing the following features -
- Adding support for more languages
- Solving Browser compatibility issues
My learning was focused on making the use of Web Speech API and to create a simple UI where user can select a language and start Speech Recognition to convert the speech to text. Learned how the Web Speech API works and differnece between SpeechSynthesis (Text-to-Speech), and SpeechRecognition (Asynchronous Speech Recognition.).