ICCV-2023-Papers

Application

Action and Event Understanding

Title	Repo	Video
Weakly-Supervised Action Segmentation and Unseen Error Detection in Anomalous Instructional Videos	➖	➖
Diffusion Action Segmentation		➖
Audio-Visual Glance Network for Efficient Video Recognition	➖	➖
Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization		➖
Video Action Recognition with Attentive Semantic Units	➖	➖
Masked Motion Predictors are Strong 3D Action Representation Learners		➖
Boosting Positive Segments for Weakly-Supervised Audio-Visual Video Parsing	➖	➖
Weakly-Supervised Action Localization by Hierarchically-Structured Latent Attention Modeling	➖	➖
Few-Shot Common Action Localization via Cross-Attentional Fusion of Context and Temporal Dynamics	➖	➖
Interaction-Aware Joint Attention Estimation using People Attributes		➖
FineDance: A Fine-Grained Choreography Dataset for 3D Full Body Dance Generation		➖
SOAR: Scene-Debiasing Open-Set Action Recognition		➖
Leveraging Spatio-Temporal Dependency for Skeleton-based Action Recognition		➖
Cross-Modal Learning with 3D Deformable Attention for Action Recognition	➖	➖
Generative Action Description Prompts for Skeleton-based Action Recognition		➖
Self-Feedback DETR for Temporal Action Detection	➖	➖
Skip-Plan: Procedure Planning in Instructional Videos via Condensed Action Space Learning	➖	➖
The Unreasonable Effectiveness of Large Language-Vision Models for Source-Free Video Domain Adaptation		➖
Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection
Video Anomaly Detection via Sequentially Learning Multiple Pretext Tasks	➖	➖
MiniROAD: Minimal RNN Framework for Online Action Detection		➖
How much Temporal Long-Term Context is Needed for Action Segmentation?		➖
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion		➖
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos		➖
Efficient Video Action Detection with Token Dropout and Context Refinement		➖
FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology Structure and Knowledge Distillation	➖	➖
Exploring Predicate Visual Context in Detecting of Human-Object Interactions		➖
E2E-LOAD: End-to-End Long-Form Online Action Detection		➖
Revisiting Foreground and Background Separation in Weakly-Supervised Temporal Action Localization: A Clustering-based Approach		➖
Hierarchically Decomposed Graph Convolutional Networks for Skeleton-based Action Recognition		➖

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

action-and-event-understanding.md

action-and-event-understanding.md

ICCV-2023-Papers

Action and Event Understanding

Files

action-and-event-understanding.md

Latest commit

History

action-and-event-understanding.md

File metadata and controls

ICCV-2023-Papers

Action and Event Understanding