AirControl
No description available
Install / Use
/learn @Raiden505/AirControlREADME
AirControl 🖐️
A gesture-based computer control system using hand tracking. Control your Windows PC with hand gestures captured through your webcam.
📖 About
AirControl is a C++ application developed for computer control using hand gestures. It uses MediaPipe for real-time hand landmark detection and OpenCV for video capture, allowing users to control system functions through intuitive hand gestures.
✋ Supported Gestures
| Gesture | Action | |---------|--------| | Pinkie out only | Volume Up | | Thumb out only | Volume Down | | Thumb + Index finger | Play/Pause Media | | Index + Middle finger | Scroll Down | | Index + Middle + Ring finger | Scroll Up | | Both hands - Index fingers only | Switch Windows (Alt+Tab) |
🛠️ Prerequisites
- Windows OS
- Webcam
- MediaPipe C Library (libmediapipe) with C++ bindings
- OpenCV (4.7.5 was used for this project)
- Visual Studio 2022 with C++14 support
⚙️ Setup
- Clone the repository:
- Install MediaPipe and OpenCV dependencies.
- Update the resource path in
AirControl.cpp - Open
AirControl.vcxprojin Visual Studio 2022. - Build and run the project.
🚀 Usage
- Run the application.
- Position your hand(s) in front of the webcam.
- Perform gestures to control your system.
- Press
qto quit the application.
🔧 How It Works
- Uses MediaPipe's hand landmark tracking to detect 21 key points on each hand
- Calculates finger extension by comparingdirect distance vs. bone segment lengths
- Implements gesture stabilization using consecutive frame validation to reduce false positives
- Sends Windows keyboard/mouse events based on detected gestures
Default resource locations
OpenCV: C:
Mediapipe lib, bin and include: C:\mediapipe
Mediapipe data: C:\mediapipe\
Related Skills
node-connect
344.4kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
99.2kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
344.4kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
344.4kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
