-
University of Chinese Academy of Sciences
- BeiJing
- https://liewfeng.github.io/
- https://scholar.google.com/citations?user=gIfJkkQAAAAJ&hl=en
Stars
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
Wan: Open and Advanced Large-Scale Video Generative Models
Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and shaping the future!
LaTeX Thesis Template for the University of Chinese Academy of Sciences
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
[Arxiv 2024] Edicho: Consistent Image Editing in the Wild
[CVPR 2025🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
HunyuanVideo: A Systematic Framework For Large Video Generation Model
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Benchmark for generative image models
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
📚 Collection of awesome generation acceleration resources.
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
VideoSys: An easy and efficient system for video generation
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
A general and accurate MACs / FLOPs profiler for PyTorch models
CVPR2024, Semantic-aware SAM for Point-Prompted Instance Segmentation
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
[ICCV 2021] WaveFill: A Wavelet-based Generation Network for Image Inpainting
Pytorch implementation of 2D Discrete Wavelet (DWT) and Dual Tree Complex Wavelet Transforms (DTCWT) and a DTCWT based ScatterNet
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Code repository for T2V-Turbo and T2V-Turbo-v2