# demo.md

## Dependencies Overview

* OpenCV
* ROS Kinetic (Python 2.7 is required)
* A Camera connected to your device

## How to Run

1. Copy this package into your workspace and run `catkin_make`.
2. Simply Run `roslaunch gesture_teleop teleop.launch`. A window showing real time video from your laptop webcam will be activated. Place your hand into the region of interest (the green box) and your robot will take actions based on the number of fingers you show.
   1. Two fingers: Drive forward
   2. Three fingers: Turn left
   3. Four fingers: Turn right
   4. Other: Stop

![f5](/files/bUSkgS8JlREdVjO3r3SZ)

![f4](/files/STvJmElbnrSHT75n3ZQv)

![f3](/files/pEu39x7X4q3Rqwr7aSP2)

![f2](/files/x0ocnZsQsuF3iqQGmVpW)

## Brief Explaination towards the package

This package contains two nodes.

1. `detect.py`: Recognize the number of fingers from webcam and publish a topic of type `String` stating the number of fingers. I won't get into details of the hand-gesture recognition algorithm. Basically, it extracts the hand in the region of insteret by background substraction and compute features to recognize the number of fingers.
2. `teleop.py`: Subscribe to `detect.py` and take actions based on the number of fingers seen.

```python
while not rospy.is_shutdown():
   twist = Twist()
   if fingers == '2':
      twist.linear.x = 0.6
      rospy.loginfo("driving forward")
   elif fingers == '3':
      twist.angular.z = 1
      rospy.loginfo("turning left")
   elif fingers =='4':
      twist.angular.z = 1
      rospy.loginfo("turning right")
   else:
      rospy.loginfo("stoped")
      cmd_vel_pub.publish(twist)
   rate.sleep()
```

## Later Plan

1. Using Kinect on mutant instead of local webcam.
2. Furthermore, use depth camera to extract hand to get better quality images
3. Incorporate **Skeleton tracking** into this package to better localize hands (I am using **region of insterests** to localize hands, which is a bit dumb).


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://campus-rover.gitbook.io/lab-notebook/fiiva/reports/2020/gesture-recognition/demo.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
