-
-
-
ichigo Public
Forked from menloresearch/ichigoLlama3.1 learns to Listen
Python Apache License 2.0 UpdatedOct 22, 2024 -
-
-
DINet-new Public
Forked from MRzzm/DINetThe source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Python UpdatedSep 24, 2024 -
ML-basics-freecodecamp Public
This is an overview of machine learning basics from the free youtube course by freecodecamp
Jupyter Notebook UpdatedSep 23, 2024 -
gaussian-splatting Public
Forked from graphdeco-inria/gaussian-splatting3D Gaussian Splatting for Real-Time Radiance Field Rendering
Python Other UpdatedSep 17, 2024 -
-
WhisperSpeech Public
Forked from WhisperSpeech/WhisperSpeechAn Open Source text-to-speech system built by inverting Whisper.
Jupyter Notebook MIT License UpdatedJun 18, 2024 -
-
mmpose Public
Forked from open-mmlab/mmposeOpenMMLab Pose Estimation Toolbox and Benchmark.
Python Apache License 2.0 UpdatedOct 24, 2022 -
-
-
audio-super-res Public
Forked from kuleshov/audio-super-resAudio super resolution using neural networks
Python MIT License UpdatedFeb 22, 2021 -
espeak-ng Public
Forked from espeak-ng/espeak-ngeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
C GNU General Public License v3.0 UpdatedJan 2, 2021 -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedOct 20, 2020 -
ForwardTacotron Public
Forked from spring-media/ForwardTacotron⏩ Generating speech in a single forward pass without any attention!
Python MIT License UpdatedSep 30, 2020 -
melgan Public
Forked from seungwonpark/melganMelGAN vocoder (compatible with NVIDIA/tacotron2)
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 3, 2020 -
-
merlin Public
Forked from CSTR-Edinburgh/merlinThis is now the official location of the Merlin project.
Python Apache License 2.0 UpdatedJul 8, 2020 -
PyKaldi-EndtoEnd-Recognition Public
A starter pack for complete end to end streaming pipeline with VAD, Wakeword detection, and Kaldi Speech Recognition
-
-
VisTraSAS Public
Visual Traffic Surveillance and Analytics System
-
tensorRTWrapper Public
Forked from lewes6369/tensorRTWrapperTensorRT Net Wrapper
C++ MIT License UpdatedJan 28, 2020 -
deep_sort Public
Forked from nwojke/deep_sortSimple Online Realtime Tracking with a Deep Association Metric
Python GNU General Public License v3.0 UpdatedJan 28, 2020 -
pytorch-pose Public
Forked from bearpaw/pytorch-poseA PyTorch toolkit for 2D Human Pose Estimation.
Python GNU General Public License v3.0 UpdatedDec 20, 2019 -
-
conversation_transcriber Public
To trascribe stereo audio samples to Conversation text
Shell UpdatedNov 3, 2019 -
TensorRT-Yolov3 Public
Forked from lewes6369/TensorRT-Yolov3TensorRT for Yolov3
C++ MIT License UpdatedOct 31, 2019