-
HKUST
- Hong Kong
- https://xinshengwang.github.io/
-
robpitch Public
A pitch detection model trained to be robust against noise and reverberation environments.
-
-
chinese-xinhua Public
Forked from pwxcoo/chinese-xinhua📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。
Python MIT License UpdatedDec 26, 2023 -
Automatic-Prosody-Annotation Public archive
Forked from Daisyqk/Automatic-Prosody-AnnotationPython UpdatedApr 6, 2022 -
-
wesing Public
Forked from wenet-e2e/opencpopAn open-source high-quality Mandarin singing voice synthesis corpus
UpdatedJan 20, 2022 -
ddsp Public
Forked from magenta/ddspDDSP: Differentiable Digital Signal Processing
Python Apache License 2.0 UpdatedDec 15, 2021 -
-
spectacular-oregano-dc2d0 Public
Jamstack site created with Stackbit
JavaScript Other UpdatedNov 12, 2021 -
PortaSpeech Public
Forked from keonlee9420/PortaSpeechPyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Python MIT License UpdatedOct 7, 2021 -
Tacotron-pytorch Public
Tacotron series TTS model implemented with Pytorch
-
-
vits Public
Forked from jaywalnut310/vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Python MIT License UpdatedJun 14, 2021 -
denoiser Public
Forked from facebookresearch/denoiserReal Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Python Other UpdatedMay 19, 2021 -
Papers submitted to Interspeech 2021 in terms of text-to-speech (TTS) and voice conversion (VC)
UpdatedApr 15, 2021 -
ICASSP2021_paper_list-VC Public
ICASSP 2021 accepted papers in term of voice conversion (VC)
-
-
face-landmark-frontalization Public
Rotate 3D face landmarks to front
-
-
first-order-model Public
Forked from AliaksandrSiarohin/first-order-modelThis repository contains the source code for the paper First Order Motion Model for Image Animation
Jupyter Notebook Other UpdatedFeb 9, 2021 -
glow-tts Public
Forked from jaywalnut310/glow-ttsA Generative Flow for Text-to-Speech via Monotonic Alignment Search
Python MIT License UpdatedDec 7, 2020 -
-
No-audio-speech-detection Public
The code is for the No-audio Speech Detection task in MediaEval 2020
-
-
-
-
-
Word-boundary-discovery Public
word boundary discovery in continuous speech signal
-
Tacotron2_batch_inference Public
Pytorch tacotron2 that can be used to perform batch inference
-
academic-kickstart Public
Forked from HugoBlox/theme-academic-cv📝 Easily create a beautiful website using Academic, Hugo, and Netlify
Shell MIT License UpdatedMay 22, 2020