Skip to content

Solvve/ml_speech2text_voice_denoiser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Speech2text & Denoise

License Python 3.7 scikit-learn 0.23.2 torch 0.23.2 Solvve

Description

Speech to text & Denoiser using Wav2Vec pretrained model. Denoiser using Dual-signal Transformation LSTM Network. Fine-Tune Wav2Vec2 model

We follow the next steps:

  1. Data preparation
  2. Data preprocessing
  3. Modeling with Wav2Vec2 model
  4. Modeling after denoise
  5. Fine-tune Wav2Vec multi-language ASR

From Wec2Vec2_Denoise.ipynb:

Levenshtein metrics Mean Median
Word Error Rate 0.26 0.20
Match Error Rate 0.25 0.2
Word Information Lost 0.40 0.36

About

Review of Speech to text voice denoisers

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published