Skip to content

Latest commit

 

History

History
61 lines (56 loc) · 16.2 KB

action-and-event-understanding.md

File metadata and controls

61 lines (56 loc) · 16.2 KB

ICCV-2023-Papers

Application App

Action and Event Understanding

Section Papers Preprint Papers Papers with Open Code Papers with Video

Title Repo Paper Video
Weakly-Supervised Action Segmentation and Unseen Error Detection in Anomalous Instructional Videos thecvf
Diffusion Action Segmentation GitHub thecvf
arXiv
Audio-Visual Glance Network for Efficient Video Recognition thecvf
arXiv
Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization GitHub thecvf
Video Action Recognition with Attentive Semantic Units thecvf
arXiv
Masked Motion Predictors are Strong 3D Action Representation Learners GitHub thecvf
arXiv
Boosting Positive Segments for Weakly-Supervised Audio-Visual Video Parsing thecvf
Weakly-Supervised Action Localization by Hierarchically-Structured Latent Attention Modeling thecvf
arXiv
Few-Shot Common Action Localization via Cross-Attentional Fusion of Context and Temporal Dynamics thecvf
Interaction-Aware Joint Attention Estimation using People Attributes WEB Page
GitHub
thecvf
arXiv
FineDance: A Fine-Grained Choreography Dataset for 3D Full Body Dance Generation GitHub Page
GitHub
thecvf
arXiv
SOAR: Scene-Debiasing Open-Set Action Recognition GitHub thecvf
arXiv
Leveraging Spatio-Temporal Dependency for Skeleton-based Action Recognition GitHub thecvf
arXiv
Cross-Modal Learning with 3D Deformable Attention for Action Recognition thecvf
arXiv
Generative Action Description Prompts for Skeleton-based Action Recognition GitHub thecvf
arXiv
Self-Feedback DETR for Temporal Action Detection thecvf
arXiv
Skip-Plan: Procedure Planning in Instructional Videos via Condensed Action Space Learning thecvf
The Unreasonable Effectiveness of Large Language-Vision Models for Source-Free Video Domain Adaptation GitHub thecvf
arXiv
Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection WEB Page
GitHub
thecvf
arXiv
YouTube
Video Anomaly Detection via Sequentially Learning Multiple Pretext Tasks thecvf
MiniROAD: Minimal RNN Framework for Online Action Detection GitHub thecvf
How much Temporal Long-Term Context is Needed for Action Segmentation? GitHub thecvf
arXiv
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion GitHub thecvf
arXiv
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos GitHub thecvf
arXiv
Efficient Video Action Detection with Token Dropout and Context Refinement GitHub thecvf
arXiv
FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology Structure and Knowledge Distillation thecvf
arXiv
Exploring Predicate Visual Context in Detecting of Human-Object Interactions GitHub thecvf
arXiv
E2E-LOAD: End-to-End Long-Form Online Action Detection GitHub thecvf
arXiv
Revisiting Foreground and Background Separation in Weakly-Supervised Temporal Action Localization: A Clustering-based Approach GitHub thecvf
Hierarchically Decomposed Graph Convolutional Networks for Skeleton-based Action Recognition GitHub thecvf
arXiv