Pupil and blink detection algorithms for wearable eye tracking system

doi:10.21203/rs.3.rs-1592196/v1

Download PDF

Research Article

Pupil and blink detection algorithms for wearable eye tracking system

https://doi.org/10.21203/rs.3.rs-1592196/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

To obtain the pupil center of the human eye in infrared images taken by a near-eye device for gaze tracking, pupil and blink detection algorithms are proposed. The eye-detection model and the eye feature point model are trained through the dlib library machine learning, the eye area is segmented, rough positioning of the eye area is realized, the redundant image information is removed, the number of image processing calculations are removed for the subsequent pupil positioning, and the processing time is shortened. In pupil detection, the candidate pupil contours are screened based on the gray information, shape characteristics and other pupil image information to obtain the correct pupil contour information and realize precise pupil positioning. The eye feature model is used to obtain the coordinate of the feature point of the eye, the aspect ratio of the eye is obtained by conversion, and blink statistics are performed. Experiments show that the correct rate of the pupil detection method reaches 97.24%, and the correct rate of blink detection reaches 91.59%.

Near-eye image

Pupil detection

Blink detection

Region cropping

Contour screening

The human eye is the main channel through which people obtain external information. Research on the visual information obtained by the human eye adopts eye-tracking technology, which has good application prospects in the fields of psychology and human-computer interaction[1–2]. At present, eye-tracking systems are roughly divided into near-eye eye-tracking systems and desktop eye-tracking systems based on hardware types [3–6].

The principle of the desktop eye-tracking system is to first detect the face of the tester, then locate and segment the eye area in the face, and then locate the pupil center and gaze point calibration, recording long-distance eye movements. The near-eye eye-tracking system generally has two types: helmet-mounted and glasses-type. Compared with desktop eye-tracking systems, near-eye eye-tracking systems are mostly used in wearable devices. Only the eyelid area was recorded, which is convenient for pupil extraction. However, limited by the narrow near-eye shooting space and wearing weight, the near-eye eye-tracking system has poor processing capability and slower speed. Helmet-mounted eye-tracking system [5], the hardware platform is concentrated on the helmet, and the volume and weight are relatively heavy. It is suitable for research experiments and cannot be applied to daily life; glasses-type eye-tracking system [7], as the name suggests, is to install the camera acquisition module on the spectacle frame, generally under the spectacle frame, which does not affect the wearer's field of view, conforms to the wearer's wearing habits, and ensures wearing comfort and freedom of behavior. The near-eye hardware device used in this article is a spectacle-type eye-tracking system. It can effectively record the eye movements of the tester. The device fully meets the design requirements of the spectacle frame, ensuring the rationality and wearing comfort.

In the eye-tracking system, the pupil positioning algorithm is the key link. Commonly used methods are convolutional neural networks [8–10] and AdaBoost [11]. The AdaBoost eye-detection algorithm based on Haar features is widely used[12–13]. The algorithm uses integral graphs and cascading structures to perform statistical learning on the Haar features of eye samples, locate the human eye area, and improve eye-detection and positioning accuracy. The data-based method does not require high image quality but requires a large number of training samples, the training process is complicated, the positioning accuracy is poor, and it is generally suitable for rough positioning. The knowledge-based method is the traditional image processing method; the result obtained by this method is more accurate, and it has a wide range of applications in precise pupil positioning. Selection and judgment are made through prior knowledge of pupil gray information, edge information, and shape characteristics.

The current common methods of pupil detection include image binarization combined with Hough transform [14], Daugman's circle detection operator[15], pupil detection using circular Hough transform mentioned by R. H. Nugroho [16], and the SET algorithm, which extracts pupil pixels based on the brightness threshold, draws the shape of the threshold area, and compares it with a sine curve[17]. The above Hough transform methods are all applied to the entire eye image, with a large number of calculations and high time complexity, and it is difficult to achieve real-time requirements. Since the camera is located under the eyes, when the position of the human eye's gaze changes, the recorded pupil image presents an elliptical shape similar to a circle, so the positioning accuracy of this type of method is low. Additionally, the eye image contains interference information such as eyebrows, eyelashes, and eyelid edges, which affects the accuracy of pupil detection and pupil information acquisition.

In response to the above problems, this paper relies on the eye tracking system of the glasses-type hardware platform, and adopts the combination of machine learning and traditional image processing methods. It is proposed to identify the eye region by training the model, crop the eye image to remove the redundant region to obtain the eye region, filter out the correct pupil fitting by the traditional image processing algorithm, complete the pupil positioning, and realize the blink detection function combined with the concept of eye aspect ratio. This method shortens the pupil detection time and improves the pupil detection accuracy.

To obtain a relatively complete eye image, the eye image area captured by the eye camera of the eye capture device will be larger than the eye area, as shown in Fig. 1, which includes non-eye areas such as eyebrows.

Image cropping obtains the eye area, which has simple content and obvious features. For this reason, dlib[18] is used to train the human eye detector to locate the eye area in the image, and the image is cropped to obtain the eye area, which reduces the number of calculations for subsequent image processing.

Dlib is an open source library for image processing[18]. In addition to high-quality face recognition, it can also train target detectors. An eye video was collected using an eye capture device, and each frame was extracted as an image of 400 x 400 size as the original dataset. The original dataset was tagged with the ImgLab tagging tool in dlib. The tag is the eye area framed on the dataset image.

The core principle of dlib is to use the image HOG feature [19–20] to represent the object to be measured. Compared with other feature operators, the HOG feature operator has a very good invariance to the geometric features and optical deformation of the image. This feature extraction operator is used in conjunction with a support vector machine (SVM) for object detection[21].

1.1 Training data parameter setting

The hyperparameters for dlib target detector training are set by the dlib.simple_object_detector_training_option class, which contains the parameters C, add_left_right_image_flips, detection_window_size, and epsilon.

The C parameter is the regular term coefficient of the loss function in the SVM classifier, and the loss function is as follows:

$$\begin{gathered} \hbox{min} w,b,\xi \frac{1}{2}\parallel W{\parallel ^2}+C\sum\limits_{{i=1}}^{n} {\xi i} \hfill \\ s.t.yi({W^T}Xi+b) \geqslant 1 - \xi i \hfill \\ i=1,2,...,N \hfill \\ \end{gathered}$$

where ${\xi }_{i}$ is the classification loss of the i-th sample point, the classification becomes 0 if the classification is correct, and the classification error corresponds to a linear value. $\sum _{i=1}^{n}{\xi }_{i}$ is the total error, and the smaller the value corresponding to the optimization target, the more accurate the classification of the training set.

For parameter C, select all numbers greater than 0 according to the requirements. The larger the value of C is, the higher the degree of attention to the total error in the whole process. The larger the C value is, the better the fit of the training set, the worse the antinoise ability, and the easier it is to overfit. The smaller the C value is, the worse the fitting effect. This article sets the value of C to 5.

1.2 Training the detector

The main training function of the detector is “train_simple_object_detector”. The training data of this function are a tagged eye video image file. The label data are imported to start training and obtain the eye-detection model file at the end of the training.

The training model is used on the eye image to detect the eye area and obtain the coordinate information of the corresponding area. The eye area is obtained by cropping the eye image through the coordinate information to verify the accuracy of the training model and facilitate subsequent processing.

Before blink detection, the human eye area is determined, and then the eye feature point model file is trained to obtain the eye feature point coordinate information. Based on the concept of the eye aspect ratio proposed by Soukupova T [22], the constructor calculates the eye aspect ratio EAR, and sets a threshold to count the blink situation.

After determining the eye area, 6 points were used to represent the eye feature points, starting from the left corner of the eye, and marking 1.2.3.4.5.6 clockwise, as shown in Fig. 2:

The 6 points calibrated in Fig. 2 represent the open and closed eye states. When the eyes are open, the vertical distance increases; when the eyes are closed, the vertical distance decreases. Judging from this state alone is more prone to errors, so the equation deduced by Soukupova T represents the eye aspect ratio (EAR) [22]. As shown below:

$$EAR=\frac{{\parallel P2 - P6\parallel +\parallel P3 - P5\parallel }}{{2\parallel P1 - P4\parallel }}$$

where P_1...6 represents the coordinates of the 6 feature points marked, and EAR represents the aspect ratio.

As shown by the EAR curve in Fig. 2, when the eyes are opened and closed, the aspect ratio EAR will have different data value ranges. Therefore, when the EAR is lower than a certain threshold N, the eyes are in a closed state, and the value of N in this paper is 0.2. The complete blinking process is shown in Fig. 3. It generally takes M (2–3 frames) to complete the blink action. Therefore, when the algorithm detects the number of blinks, it also needs to count the continuous closed eye frames while judging whether the EAR is lower than the closed eye threshold. The N value and M value are all set based on a large quantity of experimental data.

In pupil detection, there is information such as eyelashes, eyelid edges, and environmental lighting. On the one hand, it increases the number of calculations in pupil detection, and on the other hand, it produces some incorrect pupil fittings and causes pupil misidentification.

After image processing, the edge contour information of the image is obtained, and the direct least square fitting algorithm is used to perform ellipse fitting[23]. Ellipse fitting only needs 6 points for fitting, and the actual image edge information contains a large number of edge points, which results in fitting multiple ellipse information, which contains incorrect pupil fitting information.

For this incorrect pupil fitting information, the pupil shape and movement characteristics are screened to eliminate the incorrect pupil fitting information. The grayscale parameters, long and short axis parameters, and area parameters are related to pupil characteristics and machine equipment parameters. The focal length and resolution of the camera affect the long and short axes, the relative position of the camera and the eye affects the area parameter, and the illumination intensity of the infrared fill light affects the gray parameter. The abovementioned parameters are different for each tester, multiple testers are used to conduct experiments, and appropriate empirical values are selected as the parameter thresholds.

3.1 Threshold of grayscale parameters

The composition of the human eye is the pupil, iris, and sclera from the inside to the outside. The corresponding gray values gradually increase from the pupil, iris, and sclera. The gray value of the pupil is the lowest. According to such grayscale distribution characteristics, the pupil area can be segmented by setting an appropriate grayscale threshold.

The gray threshold selection is very important in threshold segmentation[24–25]. A suitable threshold can segment the pupil area very well, and an inappropriate threshold will affect the extraction effect of the pupil area. A threshold that is too low will cause pupil area loss, and a threshold that is too high will cause the pupil area to have interference areas. Figure 4 (a) is the grayscale histogram of the human eye. From the grayscale histogram, it can be seen that the grayscale value of the pupil area is very low and relatively concentrated. From the overall histogram, the gray distribution of pupils and irises satisfies the characteristics of "double peaks and one valley", and the pupil area can be extracted by threshold segmentation. In this paper, the gray value corresponding to the trough appearing after the first peak in the histogram is marked as the threshold T. The threshold T is different for different human eye images. The human eye image is segmented by adaptive threshold T to obtain the pupil area.

As shown in Fig. 4 (b), the pupil area obtained by threshold segmentation, due to the similar gray value of some areas, may contain part of the incorrect pupil area, resulting in more error contour edge points. In the ellipse fitting, the wrong pupil fitting situation will appear.

3.2 Long and short axis parameter and area parameter setting

The pupils collected in the video are close to a perfect circle regardless of whether they are enlarged or reduced. The pupil size of a normal person is 2–5 mm, with an average of approximately 4 mm. It can be considered that the long and short axes of the ellipse obtained by pupil fitting in the video should be within a relatively fixed pixel distance range; since the fitted pupil is close to a perfect circle, the calculated long and short axis ratio should vary within a certain range. In addition, the fitting area is correspondingly within the range of the relative interval. Through the abovementioned three parameters, the length distribution of the major and minor axes, the ratio of the major and minor axes, and the pixel area of the ellipse fitting area, as the screening conditions for the correct pupil contour, the correct pupil fitting is finally obtained. In the experiment, 10,529 eye images of 6 testers in different states (with eyes open) were tested, the pupil area was accurately fitted, the corresponding parameter information was collected for statistics, and three parameter thresholds were obtained by analysis. The pupil images in different states are listed in Fig. 5(a).

Figure 5(b) shows the distribution curve of the length information of the long and short axes of the ellipse. Therefore, the minimum threshold of the long axis is set to 30, and the maximum threshold is 90; the minimum threshold of the short axis is 25, and the maximum threshold is 75. The ellipses whose lengths are less than the minimum threshold and greater than the maximum threshold are discarded.

Figure 5(c) shows the calculation of the long and short axis ratios of the abovementioned long and short axes. The ratio of the long and short axes in the figure is distributed between 0.6-1.0. This situation occurs due to eyeball rotation and the camera angle problem, resulting in the shooting pupil being elliptical. To ensure accurate pupil contour identification, the long-to-short axis ratio threshold is set in the range of 0.5-1.0. Figure 5(d) shows the distribution of the area (pupil area) of the ellipse fitting area, which is concentrated in the 800–1600 interval. After the tester’s pupils were dilated, they were concentrated in the 2000–4000 interval, and a small part of the area was 4000–4800. In the experiment, the area parameter threshold range was set to 600–5000.

In screening candidate pupil contours, only contour information that simultaneously meets the requirements of the above three parameters is judged to be the correct pupil contour, and the pupil contours that do not meet any of the parameter requirements are discarded.

4.1 Eye image capture equipment

The experimental device is shown in Fig. 6. The main body is a glasses try-on frame, an eye camera is installed at the bottom of the frame, and infrared lights are installed around the frame to provide supplementary light.

In Fig. 6, the diameter of the frame ring is 48 mm, the length of the center beam is 19 mm, the overall tilt angle of the frame is 7°, and the back vertex of the lens is 12 mm away from the eyeball. The frame can be installed with myopia inserts to meet the requirements of myopia and can be tested and recorded for different groups of wearers; the camera module adopts a highly integrated, large FOV exposure module to achieve simultaneous binocular image collection.

4.2 Experimental platform

The parameters of the experimental system are shown in Table 1.

Table 1

Parameters of the experimental system
Operating system	Window10
Programming language	Python3.8
Camera resolution	400x400
Camera model	OVM6211
Infrared light source wavelength	850 nm

4.3 Experimental procedure

Several testers were randomly selected, and videos of each tester's eyes looking in different directions were collected to ensure that the different postures of the eyes looking at the screen were captured.

In the video processing process, pupil detection and blink detection were performed in accordance with the experimental procedure shown in Fig. 7, and the correct rate of eye image cropping, pupil detection and blink detection accuracy were counted.

The eye image video was collected by the device, and a model of the detected eye image area was generated by machine learning training. The training model was imported to detect the eye image area, obtain the cropping coordinate information, perform crop preprocessing on the eye image, and crop and remove the redundant part of the eye image that was not related to the eye information. The cropped image was converted to a grayscale image, and Gaussian filtering and image preprocessing operations such as binarization were performed. By setting an adaptive pupil binarization threshold, edge detection was performed on the binarized eye image to obtain the edge area. Through the set pupil constraint conditions, the wrong pupil was excluded, the pupil was fitted, and the center was taken as the pupil center position. Finally, the eye feature point model obtained by the above training was used to calculate the eye aspect ratio and count the number of blinks.

The experiment flowchart Fig. 7 is as follows:

5.1 Image cropping

The eye image collection effect of the six testers is shown in Fig. 8, where the green rectangular frame is the cropping area, the cropping area contains the upper and lower eyelids, and the left and right eyelids indicate that the cropping is correct. The correct rate of eye region cropping of different testers is 100%, which can remove the redundant information in the eye image very well.

Pupil detection with and without image cropping was performed on a piece of eye image of 6 test subjects, and the time comparison is shown in Table 2 below.

Table 2

Time comparison
Tester	Image cropping time/s	No image cropping time/s	Time reduction rate/%
1	0.1892	0.2153	12.12
2	0.1830	0.2103	12.98
3	0.1877	0.2174	13.66
4	0.1899	0.2235	15.03
5	0.1861	0.2154	13.60
6	0.1832	0.2137	14.27
average value	0.1865	0.2159	13.61

It can be seen in the table that the pupil detection time was reduced by 13.61% after image cropping, which effectively improved the detection time.

5.2 Pupil positioning and fitting situation

After the incorrect pupil was screened, the correct pupil fitting situation was obtained, and the ellipse fitting boundary and the pupil edge coincided well with the correct pupil fitting image.

Figure 9(a) is the incorrect pupil fitting diagram, and Fig. 9(b) is the correct pupil fitting diagram. It can be seen in the figure that after the incorrect pupil condition was screened, the incorrect fitting of the corner of the eye was removed, and the correct pupil fitting was obtained. The pupils in different positions are fitted correctly.

The eye diagrams of 6 testers were counted, and 1,800 eye diagrams of each tester were tested. The correct rate of pupil detection was calculated as shown in Table 3.

Table 3

Correct rate of pupil detection
Tester Number	The correct rate of pupil detection in this article
1	98.95
2	93.59
3	97.97
4	96.83
5	96.83
6	99.28
average value	97.24

Table 3 shows that the average correct rate of pupil detection was 97.24%, and there were differences between individuals of different testers. In comparison, tester No. 2’s pupil detection accuracy rate was low because the pupil position in the eye diagram was too deviated from the camera position, which caused the pupil to be deformed and did not conform to the pupil characteristics; there is also a part of the pupil that overlapped with the upper eyelid. In general, the incorrect pupil screening method had a higher correct rate for most testers.

5.3 Blink situation

In the experiment, we counted the blinking conditions of multiple testers’ videos, and compared the number of blinks calculated by the algorithm with the actual blinking conditions. The data are shown in Table 4 below. From the data, for the blinking conditions of different testers under uncertainty, the accuracy of the algorithm statistics was 91.59%. The correct rate of blinking of tester No. 5 was 65%, which was due to the squinting state of the tester during the shooting process, which led to incorrect statistics in judging the blinking action.

Table 4

Blink statistics
Tester	Test duration/s	Blink count/pcs	True count/pcs	Correct rate/%
1	22	13	14	92.86
2	48	19	19	100
3	28	19	19	100
4	29	11	12	91.67
5	30	20	13	65
6	78	16	16	100
average value				91.59

A pupil detection algorithm with image cropping and false pupil screening was proposed, and blink detection was realized. Image cropping was performed by training the model, which reduced the number of calculations for image processing and processing time. The use of gray information, long and short axis lengths and contour area parameter feature screening conditions improved pupil detection accuracy. The algorithm was tested on different subjects, and the average positioning time for each picture was 0.1865 s. The accuracy rate of pupil detection was 97.24%, and the accuracy rate of blink detection was 91.59%. Experiments show that the algorithm can effectively and accurately detect and locate pupils. The algorithm in this paper can be applied to the near-eye device platform, and can also be applied to the fields of psychology, human-computer interaction, etc., to provide a choice of applications.

Data availability

The datasets generated during and/or analysed during the current study are available in the [GitHub] repository, [https://github.com/Lin-546/pupil-detection.git]

Acknowledgments

The research was supported by National Key R&D Program of China (2020YFB2007501) and the Open Foundation of Shanghai Key Laboratory of Online Test and Control Technology (Grants No.ZX2021103)

Martin, C., Cegarra, J., & Averty, P.. (2011). Analysis of Mental Workload during En-route Air Traffic Control Task Execution Based on Eye-Tracking Technique. In Practical Aspects of Declarative Languages (pp. 592–597). Practical Aspects of Declarative Languages. https://doi.org/10.1007/978-3-642-21741-8_63
Debeljak, M., Ocepek, J., & Zupan, A.. (2012). Eye Controlled Human Computer Interaction for Severely Motor Disabled Children. In Practical Aspects of Declarative Languages (pp. 153–156). Practical Aspects of Declarative Languages. https://doi.org/10.1007/978-3-642-31534-3_23
Morgante, J. D., Zolfaghari, R., & Johnson, S. P.. (2012). A Critical Test of Temporal and Spatial Accuracy of the Tobii T60XL Eye Tracker. Infancy, 17(1), 9–32. https://doi.org/10.1111/j.1532-7078.2011.00089.x
Hennessey, C., & Fiset, J.. (2012). Long range eye tracking. https://doi.org/10.1145/2168556.2168608
Cognolato, M., Atzori, M., & Müller, H.. (2018). Head-mounted eye gaze tracking devices: An overview of modern devices and recent advances. Journal of Rehabilitation and Assistive Technologies Engineering, 5, 205566831877399. https://doi.org/10.1177/2055668318773991
Min-Allah, N., Jan, F., & Alrashed, S.. (2021). Pupil detection schemes in human eye: a review. Multimedia Systems, 27(4), 753–777. https://doi.org/10.1007/s00530-021-00806-5
Huang, C.-W., Jiang, Z.-S., Kao, W.-F., & Huang, Y.-L.. (2013). Low-Cost and High-Speed Eye Tracker. In Lecture Notes in Electrical Engineering (pp. 421–427). Lecture Notes in Electrical Engineering. https://doi.org/10.1007/978-1-4614-6747-2_50
Eivazi, S., Santini, T., Keshavarzi, A., Kübler, T., & Mazzei, A.. (2019). Improving real-time CNN-based pupil detection through domain-specific data augmentation. https://doi.org/10.1145/3314111.3319914
Han, S. Y., Kwon, H. J., Kim, Y., & Cho, N. I.. (2020). Noise-Robust Pupil Center Detection Through CNN-Based Segmentation With Shape-Prior Loss. IEEE Access, 8, 64739–64749. https://doi.org/10.1109/access.2020.2985095
Antonioli, L., Pella, A., Ricotti, R., Rossi, M., Fiore, M. R., Belotti, G., Magro, G., Paganelli, C., Orlandi, E., Ciocca, M., & Baroni, G.. (2021). Convolutional Neural Networks Cascade for Automatic Pupil and Iris Detection in Ocular Proton Therapy. Sensors, 21(13), 4400. https://doi.org/10.3390/s21134400
Hu, Z., Zhang, Y., Zhao, Y., Cao, L., Bai, Y., & Huang, M. (2017). Fish eye recognition based on weighted constraint AdaBoost and pupil diameter automatic measurement with improved Hough circle transform. Transactions of the Chinese Society of Agricultural Engineering, 33(23), 226–232.
Lin, Y.-N., Hsieh, T.-Y., Huang, J.-J., Yang, C.-Y., Shen, V. R. L., & Bui, H. H.. (2020). Fast Iris localization using Haar-like features and AdaBoost algorithm. Multimedia Tools and Applications, 79(45–46), 34339–34362. https://doi.org/10.1007/s11042-020-08907-5
Wan, Z.-H., Xiong, C.-H., Chen, W.-B., & Zhang, H.-Y.. (2021). Robust and accurate pupil detection for head-mounted eye tracking. Computers & Electrical Engineering, 93, 107193. https://doi.org/10.1016/j.compeleceng.2021.107193
Wildes, R. P.. (1997). Iris recognition: an emerging biometric technology. Proceedings of the IEEE, 85(9), 1348–1363. https://doi.org/10.1109/5.628669
Daugman, J. G.. (1993). High confidence visual recognition of persons by a test of statistical independence. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(11), 1148–1161. https://doi.org/10.1109/34.244676
Nugroho, R. H., Nasrun, M., & Setianingsih, C.. (2017). Lie detector with pupil dilation and eye blinks using hough transform and frame difference method with fuzzy logic. https://doi.org/10.1109/iccerec.2017.8226697
Javadi, A. H., Hakimi, Z., Barati, M., Walsh, V., & Tcheang, L. (2015). SET: a pupil detection method using sinusoidal approximation. Frontiers in neuroengineering, 8, 4.
King, D. E. (2009). Dlib-ml: A machine learning toolkit. The Journal of Machine Learning Research, 10, 1755–1758.
Dadi, H. S., Pillutla, G. K. M., & Makkena, M. L.. (2018). Face Recognition and Human Tracking Using GMM, HOG and SVM in Surveillance Videos. Annals of Data Science, 5(2), 157–179. https://doi.org/10.1007/s40745-017-0123-2
Flores Calero, M. J., Aldas, M., Lazaro, J., Gardel, A., Onofa, N., & Quinga, B.. (2019). Pedestrian Detection Under Partial Occlusion by using Logic Inference, HOG and SVM. IEEE Latin America Transactions, 17(09), 1552–1559. https://doi.org/10.1109/tla.2019.8931190
Huang, X., Ti, C., Hou, Q.-Z., Tokuta, A., & Yang, R.. (2013). An experimental study of pupil constriction for liveness detection. https://doi.org/10.1109/wacv.2013.6475026
Cech, J., & Soukupova, T. (2016). Real-time eye blink detection using facial landmarks. Cent. Mach. Perception, Dep. Cybern. Fac. Electr. Eng. Czech Tech. Univ. Prague, 1–8.
Fitzgibbon, A., Pilu, M., & Fisher, R. B.. (1999). Direct least square fitting of ellipses. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21(5), 476–480. https://doi.org/10.1109/34.765658
Goni, S., Echeto, J., Villanueva, A., & Cabeza, R.. (2004). Robust algorithm for pupil-glint vector detection in a video-oculography eyetracking system. https://doi.org/10.1109/icpr.2004.1333928
Lin, L., Pan, L., Wei, L., & Yu, L.. (2010). A robust and accurate detection of pupil images. https://doi.org/10.1109/bmei.2010.5639646

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Pupil and blink detection algorithms for wearable eye tracking system

Status:

Version 1

Abstract

Figures

Introduction

1. Eye Image Cropping

1.1 Training data parameter setting

1.2 Training the detector

2. Blink Detection

3. Pupil Fitting And Screening Analysis

3.1 Threshold of grayscale parameters

3.2 Long and short axis parameter and area parameter setting

4. Experimental Design

4.1 Eye image capture equipment

4.2 Experimental platform

4.3 Experimental procedure

5. Analysis Of The Experimental Results

5.1 Image cropping

5.2 Pupil positioning and fitting situation

5.3 Blink situation

6. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1