Hand Gesture Recognition Using Neural Network Hand Gesture

View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Directory of Open Access Journals International Journal of Computer Science and Network (IJCSN) Volume 1, Issue 6, December 2012 www.ijcsn.org ISSN 2277-5420 Hand Gesture Recognition using Neural Network 1Rajesh Mapari, 2Dr. Govind Kharat 1 Dept of Electronics and Telecommunication Engineering, Anuradha Engineering College, Chikhli, Maharashtra-443201, India 2 Principal, Sharadchandra Pawar College of Engineering, Otur, Maharashtra-443201, India Abstract This paper presents a simple method to recognize sign gestures language recognition system [4]. Ryszard S . Choras of American Sign Language using features like number of Peaks proposed a method identification of persons based on the and Valleys in an image with its position in an image. Sign shape of the hand and the second recognizing gestures language is mainly employed by deaf-mutes to communicate and signs executed by hands using geometrical and Radon with each other through gestures and visions. We extract the transform (RT) features [5]. Salma Begum, Md. skin part which represents the hand from an image using L*a*b* Color space. Every hand gesture is cropped from an Hasanuzzaman proposed system which uses PCA image such that hand is placed in the center of image for ease of (Principal Component Analysis) based pattern matching finding features. The system does require hand to be properly method for recognition of sign [6]. Yang quan, Peng aligned to the camera and does not need any special color Jinye, Li Yulong proposed a novel vision-based SVMs [8] markers, glove or wearable sensors. The experimental results classifier for sign language recognition [7]. A vision show that 100% recognition rate for testing and training data based Sign Language recognition system uses many set. features of image like area, DCT and uses Neural Keywords: Gesture recognition, boundary tracing, Network [9] or HMM [14], [16]. segmentation, peaks & valleys. 2. Proposed Methodology 1. Introduction In this paper we present an efficient and accurate The ultimate aim of our research is to enable technique for sign detection. Our method has five phases communication between speech impaired (i.e. deaf-dumb) of processing viz., image cropping, resizing, peaks and people and common people who don’t understand sign valleys detection, dividing image in sixteen parts, finding language. This may work as translator [10] to convert location of peaks and valleys as shown in Figure 1. sign language into text or spoken words. Our work has explored modified way of recognition of sign using peaks InputInput Image Image ad valleys with added feature of positioning of finger in image. There were many approaches to recognize sign Image ImageCropping cropping and Resizing using data gloves [11], [12] or colored gloves [15] worn by signer to derive features from gesture or posture. MarkingMarking and and counting counting peaks peaks and valleys and valleys Ravikiran J. et al. proposed a method of recognizing sign using number of fingers opened in a gesture representing Dividing image in to sixteen parts and an alphabet of the American Sign Language [1]. Iwan finding positions of peaks and valleys Njoto Sandjaja et al . proposed a modification in color- coded gloves which uses less color compared with other Training neural network with parameters color-coded gloves in previous research to recognizes the and recognizing sign Filipino Sign Language [2]. Jianjie Zhang et al. proposed a new complexion model has been proposed to extract Fig. 1 Block Diagram of Sign Detection hand regions under a variety of lighting conditions [3]. Authors have collected data of 20 persons (students of V.Radha et al. developed a threshold based segmentation engineering college) who have been given little training process which helps to promote a better vision based sign about how to perform signs. For acquiring image we have 56 International Journal of Computer Science and Network (IJCSN) Volume 1, Issue 6, December 2012 www.ijcsn.org ISSN 2277-5420 used camera of 1.3M pixels (Interpolated 12M pixels still 2.2 Resizing image image resolution). After getting RGB image in size of either W* W or H*H , In first phase we have read image and cropped it by image is converted to gray scale image. maintaining height width ratio of hand portion only. Later Image is then filtered using Gaussian filter with size [8 8] on hand portion is resized to 256*256 size to extract and sigma value 2 which found suitable for this features. experimentation. 2.1 Cropping input image Filtered image is then resized to 256*256 sized image. Hand portion image is then converted to 256*256 size First converts the RGB image to L*a*b* Color space to RGB image, this way hand portion comes at the center of separate intensity information into a single plane of the image. This way cropping operation is performed. image, and then calculates the local range in each layer. Second and third layer is intensity images are converted to black and white image according to threshold value of each layer. Two images are then multiplied to get one result image. From the result image 4-connected components are labeled. Properties of each labeled region are measured using Bonding box to make structures. Convert structure to cell array. Convert cell array of matrices to single matrix. From this matrix hand portion is marked by marking square box on original RGB image. Fig. 4 Grayscale Image 2.3 Boundary Tracing for Peaks and Valleys Resized image is smoothed by moving average filter to remove unnecessary discontinuities. Fig. 2 Image of hand in with red box marked If the width (W) hand portion is more than height (H) then cropping is W*W size else it is H*H size. Fig. 5 Hand image before and after smoothing operation Using morphological operations this smoothed image is converted to boundary image. Fig. 3 Resized Image International Journal of Computer Science and Network (IJCSN) Volume 1, Issue 6, December 2012 www.ijcsn.org ISSN 2277-5420 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 Fig. 8 Condition I Condition II: If we don’t get any pixel it means we have Fig. 6 Boundary Image to search on existing pixels right side, if pixel exist we 2.4 Peaks and valleys detection follows the same way until we get no pixel on right side. We again follows as per condition I. if condition I and II After getting boundary image we first find the boundary not satisfied it means we have to search down, here we tracing point from where to start and where to stop mark as peak as shown in figure 9. finding peaks and valleys. For this we find maximum value of x where white pixel exists. 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 We call this point as opti_x and then find corresponding 0 0 0 1 0 0 0 0 0 value of y. The starting point on x direction as 0.80*x. 0 0 1 0 0 0 0 0 0 From this x value we find starting y co-ordinate of 0 0 1 0 0 0 0 0 0 starting point. 0 0 1 0 0 0 0 0 0 y 0 1 0 0 0 0 0 0 0 Fig. 9 Condition II If condition I and II not satisfied then we search on down side by making DN=1 Condition III: we start with DN=1, We first travel to Start(x,y) 0.80*opti_x Stop(x,y) down and check whether white pixel exist or not. If exist then continue in same way if not we check it on down left opti_x or down right. Again we search on down side and continue until we don’t get any pixel on down or down - x left or down -right. Fig. 7 Tracing Starting & Ending Point of Hand Image 0 0 0 0 0 0 0 0 0 0 0 1 0 1 This is our starting point to trace boundary and ending 0 0 0 0 0 0 0 0 0 0 point is starting point y position plus one i.e. next row of 1 1 1 1 0 0 0 0 0 0 1 0 starting point where white pixel exist. 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 1 0 0 Condition I: we start with UP=1, we first travel to top and 0 0 0 0 0 0 0 0 0 1 0 0 check whether white pixel exist or not. If exist then continue in same way if not we check it on top left or top Fig. 10 Condition III right. Again we search on top side and continue until we don’t get any pixel on top or top-left or top-right. Condition IV: If we don’t get any pixel it means we have Condition I is demonstrated using Figure 8.

Load more