Stars
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
[3DV'25] 3D Reconstruction with Spatial Memory
OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
[CVPR2025] PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/
[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
[World-Model-Survey-2024] Paper list and projects for World Model
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥
SEED-Voken: A Series of Powerful Visual Tokenizers
🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
A curated list of awesome model based RL resources (continually updated)
Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation
[TCSVT 2024] Progressive Content-aware Coded Hyperspectral Compressive Imaging
[NeurIPS 2024] GS-Hider: Hiding Messages into 3D Gaussian Splatting
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.