Lists (6)
Sort Name ascending (A-Z)
Stars
📚Modern CUDA Learn Notes with PyTorch: 200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe API (Achieve 98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Wan: Open and Advanced Large-Scale Video Generative Models
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
FastVideo is a lightweight framework for accelerating large video diffusion models.
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
VideoSys: An easy and efficient system for video generation
Rich is a Python library for rich text and beautiful formatting in the terminal.
High-quality PNGs for logos I made for fun
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Generative Models by Stability AI
Easily create large video dataset from video urls
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
🌐 Jekyll is a blog-aware static site generator in Ruby
a fork of https://jonbarron.info/ for use in jekyll builds with markdown page updates
real Transformer TeraFLOPS on various GPUs
Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.