• No se han encontrado resultados

ESTUDIO DEL CASO DE LA EMPRESA INDUSTRIALES DE TINTAS LTDA.

Fase III. En esta fase se procedió a evaluar los factores competitivos de la empresa que le han permitido alcanzar un buen desempeño en los últimos años.

3. ESTUDIO DEL CASO DE LA EMPRESA INDUSTRIALES DE TINTAS LTDA.

One of the limitations of the Mean-shift tracking algorithm together with the skin detection is occlusion handling. Because there is no significant information about the

boundaries of the hand and face i n the skin blobs, using an extra cue to distinguish the boundaries of the hand and face may be useful. In this section we describe the technique of using the depth information together with the Mean-shift algorithm. The depth information is extractable using a stereo-vision system.

A stereo-vision system can provide the depth information of the scene in terms of

distance to the camera. This information can be used for enhancing the algorithms

which we introduced in Chapters 3 and 4.

The simplest configuration of a stereo-vision system is hi-camera. However, three or more cameras also may be used in a stereo-vision system. Stereo-vision can produce a dense disparity map which can be translated to the depth information map. The resultant disparity map should be smooth, detailed and continuous. Moreover, surfaces should produce a region of smooth disparity values with their boundaries precisely delineated while small surface elements should be detected as separately distinguished regions. Unfortunately, satisfying all of these requirements simultaneously is not achievable. Algorithms that can produce a smooth disparity map tend to miss the details and those that can produce a detailed map tend to be noisy. The depth maps obtained by hi-camera stereo systems are not very accurate and reliable. A higher number of cameras may gain better quality depth information

[ 1 50] .

While the luxury of stereo-machines with more than two cameras is not yet commonly available, the normal hi-camera stereo-vision systems are the most available choice. However, the depth estimation ability is somewhat limited. We used

Chapter 5. A novel approach for robust tracking .. .

a Bumblebee19 bi-camera stereo-vision system for applying the techniques proposed

in this section. Connected to the fire-wire port, this camera is able to record two 1 024x768 calor or gray scale images in 25 fps. The test platform was Windows XP and the programming platform was C++ in Visual Studio.Net 2003. Although the

depth map obtained using this camera is not highly accurate (Figure 5 . 15), it can

provide an estimation of the distance of different objects to the camera which was sufficient for evaluation of our ideas.

Figure 5.15. a) A sample image, b) the depth information of the sample image. The light area is the object closer to the camera, the black patches are of unknown depth.

5.5.3.1 The depth information and the adaptive skin detection algorithm

Let' s take a step back and review the concept of motion detection for our adaptive

skin detection algorithm. In fact, one of the reasons for the motion detection is

separating the user and the background. Considering the fact that in an HCI environment the user is closer to the camera than the background, the depth information can significantly improve the accuracy of background elimination from

the image. This idea can be implemented using depth thresholding. Figure 5 . 1 6b represents the eliminated background and the remaining foreground after applying the depth thresholding technique.

Figure 5.16. Background elimination using the depth thresholding technique.

The detected skin pixels within the separated foreground are more likely to be the actual skin color and therefore the adaptive skin detection algorithm will provide more accurate detection. Some of the unwanted areas like the surface of the table behind the user will also automatically be eliminated.

5.5.3.2 The depth information and the Fuzzy Mean-shift blob tracker

The depth information can be used for occlusion resolution in some scenarios. Figure

5 . 1 7 presents two cases in which the distance of the hand and face to the camera are

different. Obviously, without the depth information the hand and face blobs in these

images are being considered as one single blob. In this section, we describe how the

depth information can be applied for enhancing the Fuzzy Mean-shift blob tracker algorithm for occlusion prevention.

Chapter 5. A novel approach for robust tracking . . .

Figure 5.17. Some of the scenarios in which depth information can be useful for occlusion resolution.

Based on the idea of applying the depth information as an extra cue for blob tracking

within the Mean-shift algorithm, we implemented another variant of the Fuzzy Mean- shift algorithm . The basics of this variant in Kernel shifting is similar to its ancestor. However the kernel management is slightly different.

The kernel in this version of the algorithm in addition to the vertical and horizontal boundaries of the tracking blob also carries the minimum and the maximum depth of the points within. The shrinking and enlarging procedure on the boundaries of the

kernel are based on the pixels with depths within ± 1 0% of the kernel ' s min-max

depth. The pixels which do not satisfy this constraint or their depth is unknown will be ignored. Therefore, blob trackers which are tracking blobs in different depth levels

will not interfere with each other. Figure 5 . 1 8 represents the result of this algorithm

on an image. We should note that without the depth information, the hand and the face blobs are considered as one connected blob. Using the depth information, the

Mean-shift algorithm is able to stay around the blob which has the homogeneous

Figure 5.18. Occlusion resolution using the depth information and the Mean-shift algorithm. The tracked blobs are displayed on the disparity image (right).

5.6 Chapter summary

In this chapter, we presented a new approach for boundary detection in blob tracking based on the Mean-shift algorithm. Our approach was based on continuous sampling of the boundaries of the kernel and changing the size of the kernel using our novel

algorithm. We also showed that the proposed method is superior in terms of

robustness and stability compared to the density-based tracking method known as the

CAM-Shift algorithm.

The robustness of our method against noise makes it a good candidate for use with cheap cameras and real-world vision-based HCI applications. This method is to be

applied in conjunction with a fast pixel-based skin color segmentation algorithm as