Skip to content
View vvictoryuki's full-sized avatar

Block or report vvictoryuki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 407 11 Updated Mar 3, 2025

[ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos

Python 266 9 Updated Jan 15, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,670 493 Updated Mar 7, 2025

[3DV'25] 3D Reconstruction with Spatial Memory

Python 951 46 Updated Feb 25, 2025

OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation

14 1 Updated Dec 14, 2024

[CVPR2025] PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/

Python 125 2 Updated Jan 1, 2025

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook 310 16 Updated Feb 25, 2025

A Study Path for Game Programmer

Python 17,950 2,063 Updated Mar 28, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 5,969 642 Updated Sep 20, 2024

Official repository for LTX-Video

Python 3,126 273 Updated Mar 5, 2025

[World-Model-Survey-2024] Paper list and projects for World Model

9 1 Updated Oct 31, 2024

Inference script for Oasis 500M

Python 1,757 146 Updated Nov 8, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,741 1,777 Updated Mar 11, 2025

The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥

Python 796 52 Updated Apr 10, 2024

Next-Token Prediction is All You Need

Python 2,029 78 Updated Oct 24, 2024

Code release of Video2Game

JavaScript 316 23 Updated Apr 25, 2024

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 843 32 Updated Feb 19, 2025

🎮 A curated list of awesome game datasets, and tools to artificial intelligence in games

815 53 Updated Feb 25, 2025

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 1,207 43 Updated Nov 6, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 968 42 Updated Feb 1, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

398 16 Updated Jan 18, 2025

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,252 55 Updated Mar 12, 2025

A curated list of awesome model based RL resources (continually updated)

1,029 58 Updated Feb 17, 2025
Python 93 5 Updated Aug 16, 2024

Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation

Python 82 Updated Oct 11, 2024

[TCSVT 2024] Progressive Content-aware Coded Hyperspectral Compressive Imaging

Python 42 1 Updated Nov 12, 2024

[NeurIPS 2024] GS-Hider: Hiding Messages into 3D Gaussian Splatting

Python 45 1 Updated Mar 1, 2025

Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223

Python 121 4 Updated Mar 3, 2025
Jupyter Notebook 1,023 125 Updated Sep 18, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,922 1,057 Updated Mar 6, 2025
Next