A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 9,125 928 Updated Mar 20, 2025

argosopentech / argos-translate

Open-source offline translation library written in Python

Python 4,290 313 Updated Feb 20, 2025

LibreTranslate / LibreTranslate

Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.

Python 10,960 1,039 Updated Mar 25, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 78,879 9,460 Updated Jan 4, 2025

mendableai / firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 32,679 2,813 Updated Mar 25, 2025

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 9,305 608 Updated Mar 25, 2025

EchoseChen / SPA-VL-RLHF

The reinforcement learning codes for dataset SPA-VL

Python 31 Updated Jun 24, 2024

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,318 411 Updated Sep 13, 2024

HumanSignal / label-studio-sdk

Label Studio SDK

Python 122 80 Updated Mar 19, 2025

wentaoL86 / Awesome-Human-Video-Generation

A work list of recent human video generation method. This repository focus on half/full body human video generation method, The Nerf, Gaussian splashing, Motion Pose, and talking head/Portrait is n…

219 14 Updated Oct 16, 2024

opencv / opencv_zoo

Model Zoo For OpenCV DNN and Benchmarks.

Python 734 215 Updated Jan 10, 2025

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 173,805 45,445 Updated Mar 25, 2025

antgroup / echomimic

[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,724 413 Updated Dec 10, 2024

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,605 781 Updated Mar 25, 2025

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,576 2,839 Updated Sep 4, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 25,805 2,483 Updated Mar 25, 2025

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,935 1,056 Updated Mar 6, 2025

Tobi-r9 / RaMViD

Python 99 11 Updated Nov 11, 2023

RERV / VDT

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Jupyter Notebook 230 14 Updated May 5, 2024

johannakarras / DreamPose

Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"

Python 997 77 Updated Nov 2, 2023

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 42,143 6,296 Updated Mar 25, 2025

Vchitect / Latte

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,800 187 Updated Mar 24, 2025

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,015 631 Updated May 31, 2024

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,029 105 Updated Mar 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cishidai

Block or report cishidai

Starred repositories

deepseek-ai / awesome-deepseek-integration

mermaid-js / mermaid

milvus-io / milvus

HqWu-HITCS / Awesome-Chinese-LLM

VITA-MLLM / VITA

BradyFU / Awesome-Multimodal-Large-Language-Models

modelscope / FunASR