A handβgesture control system that lets you navigate slides, web pages and applications using your webcam, without touching the keyboard or mouse.
This project uses Python (OpenCV + MediaPipe) for gesture detection and sends relevant keyboard events (next/previous/zoom) to control active applications.
- π¨ Doubleβtap on right side β Next page / slide
- π₯ Doubleβtap on left side β Previous page / slide
- π€ Pinch close (thumb + index) β Zoom in
- π€ Pinch open (thumb + index) β Zoom out
- Works on PowerPoint, Google Slides, Chrome, PDF, and other apps
WhatsApp.Video.2026-01-31.at.6.10.02.PM.mp4
The Python client uses your webcam to detect hand landmarks in real time using the MediaPipe library. It tracks the position of your index finger and detects gestures like double taps and pinches. These gestures are then mapped to keyboard shortcuts using PyAutoGUI.
Gesture detection runs at video speed so interaction feels smooth and natural without touching the keyboard or mouse.
GestureAnaltucsPlatform/ β βββ backend/ # Node.js backend for analytics β βββ server.js β βββ package.json β βββ ...other backend files β βββ python-client/ # Python gesture client β βββ hand_double_click.py β βββ hand_navigation_zoom.py β βββ gesture_detector.py β βββ hand_test.py β βββ mouse_gesture_control.py β βββ requirements.txt β βββ .venv/ # Virtual environment (ignored) β βββ README.md # This file