Skip to content
View ZitongYu's full-sized avatar

Block or report ZitongYu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[IEEE SPL] Official Implementation for Pose-promote: Progressive Visual Perception for Indoor Action Recognition

Python 3 Updated Apr 22, 2024

PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba

Python 1 Updated Sep 21, 2024

[ECCV 2024🔥] The official code for the paper DiffFAS: Face Anti-Spoofing via Generative Diffusion Models.

Python 35 2 Updated Sep 23, 2024

Bag of Augmentations for Generalized Face Anti-Spoofing

Python 6 Updated Sep 18, 2024

PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba

Python 48 5 Updated Nov 14, 2024

The source code of "SFDA-rPPG: Source-free Domain Adaptive rPPG Measurement with Spatial-Temporal Consistency"

5 Updated Apr 3, 2024

Offical code repository of ”DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News Detection“

16 Updated Aug 22, 2024

real time face swap and one-click video deepfake with only a single image

Python 49,512 7,265 Updated Apr 4, 2025
Python 12 1 Updated Aug 8, 2024

Official Implementation for "Cue-N: Cue-Aware Network for Audio-Visual Question Answering"

2 Updated Jul 2, 2024

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 426 37 Updated Mar 31, 2025

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,250 624 Updated Sep 26, 2024

The offical code implementation of paper "Interpretable Multimodal Misinformation Detection with Logic Reasoning", accepted by Finding of ACL 23.

Python 31 3 Updated Dec 8, 2023

MMPD: Multi-Domain Mobile Video Physiology Dataset(EMBC2023 Oral)

Python 125 15 Updated Sep 30, 2024

FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba

Python 152 10 Updated Feb 21, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,388 47 Updated Mar 31, 2025

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

627 24 Updated Apr 3, 2025

Accepted by IJCAI-24 Survey Track

Python 199 5 Updated Aug 25, 2024

[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces

Python 674 47 Updated Dec 3, 2024

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Python 774 77 Updated Apr 4, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,340 228 Updated Feb 13, 2025

GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing

24 Updated Mar 21, 2024

Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios

1 Updated Mar 7, 2024

[ECCV 2024🔥] The official code for the paper AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.

50 1 Updated Jul 15, 2024

[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios

Python 52 1 Updated Sep 4, 2024

GM-DF:Generalized Multi-Scenario Deepfake Detection

6 Updated Mar 4, 2024

Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing

Python 32 2 Updated Nov 17, 2024

Gemma open-weight LLM library, from Google DeepMind

Jupyter Notebook 3,109 420 Updated Apr 4, 2025
Next