Skip to content

Latest commit

 

History

History
64 lines (40 loc) · 1.49 KB

README.md

File metadata and controls

64 lines (40 loc) · 1.49 KB

Gemini Cursor ✨

A second AI cursor 🖱️ for your desktop that can see your screen, hear you speak, and talk to you.

Demo

Powered by Google's Gemini 2.0 Flash (Experimental) model, the Multimodal Live API, Pointing, and Function calling capabilities.

Created by @13point5.

Features

  • 🖱️ Second AI cursor on your desktop
  • 🚀 Multimodality: The model can see 📸, hear 🎤, and speak 🔊
  • ⚡️ Real-time with low latency

Use Cases

  • 📚 Understanding complex diagrams in Research papers, Architecture diagrams, etc
  • 🌐 Navigating complex websites to perform a task like adding a payment method on Amazon
  • 📝 Real time AI tutor with whiteboards

Tech Stack

  • Frontend: Electron, React, TypeScript, Vite
  • AI: Google Gemini API

Acknowledgements

Prerequisites

Installation

  1. Clone the repository
git clone https://github.com/13point5/gemini-cursor.git
cd gemini-cursor
  1. Install dependencies
npm install
  1. Run the app
npm run start
  1. Enter the Gemini API key in the app

  2. Click the Play button and the Share Screen button

  3. Minimize the app and enjoy!