Introduction about Technologies
Computer vision is a field of artificial intelligence that trains computers to interpret and understand the visual world. Using digital images from cameras and videos and deep learning models, machines can accurately identify and classify objects — and then react to what they “see.”
OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision. Originally developed by Intel, it was later supported by Willow Garage then Itseez. The library is cross-platform and free for use under the open-source BSD license.
MediaPipe offers open source cross-platform, customizable ML solutions for live and streaming media.
How it works
Here we can control moves car game when yellow color come to;
- right box (press key 'd')
- left box (press key 'a')
- left hand when thumb finger open (press key 'w')
- right hand when thumb finger open (press key 's')
Technologies & Libraries
- opencv-python
- mediapipe
- pydirectinput
- numpy
Download and Setup
Cloning the repo.
https://github.com/0xSE/steering-wheel-opencv-mediapipe.git
Setup
Running script-Setup.py file For windows users use:
python script-Setup.py
For linux or macOS use:
python3 script-Setup.py
Note
When install pydirectinput
Go to libaray file and delete all lines have "sleep"
Script still have some bugs
