GitHub - showlab/GUI-Narrator: Repository of GUI Action Narrator

GUI Action Narrator: Where and When Did That Action Take Place?

Qinchen Wu, Difei Gao, Kevin Qinghong Lin, Zhuoyu Wu, Xiangwu Guo, Peiran Li, Weichen Zhang, Hengxu Wang, Mike Zheng Shou

🤖: Introduction

We introduce GUI action dataset Act2Cap as well as an effective framework: GUI Narrator for GUI video captioning that utilizes the cursor detection to enhance the interpretation of high-resolution screenshots and keyframe extraction in GUI actions.

📋 ToDo List

Model for Cursor detector and Narrator
Code of conduct

-- Our model and test benchmark are availble on .

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
assets		assets
frames_sample		frames_sample
static		static
README.md		README.md
index.html		index.html
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GUI Action Narrator: Where and When Did That Action Take Place?

🤖: Introduction

📋 ToDo List

About

Releases

Packages

Languages

showlab/GUI-Narrator

Folders and files

Latest commit

History

Repository files navigation

GUI Action Narrator: Where and When Did That Action Take Place?

🤖: Introduction

📋 ToDo List

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages