Gemini Cursor ✨

A second AI cursor 🖱️ for your desktop that can see your screen, hear you speak, and talk to you.

Powered by Google's Gemini 2.0 Flash (Experimental) model, the Multimodal Live API, Pointing, and Function calling capabilities.

Created by @13point5.

Features

📚 Understanding complex diagrams in Research papers, Architecture diagrams, etc
🌐 Navigating complex websites to perform a task like adding a payment method on Amazon
📝 Real time AI tutor with whiteboards

git clone https://github.com/13point5/gemini-cursor.git
cd gemini-cursor

npm install

npm run start