Support Vector Machine Based Feature Extraction for Gender Recognition From Objects Using Lasso Classifier

DOI: https://doi.org/10.21203/rs.3.rs-17037/v2

Abstract

The blind people has their difficulty to identify the object moving around them, therefore with a high accuracy score object detection and human face recognition system will helps them in identifying the things around them with ease. In this research article,a minimum distance trainer for feature selection by accessing SVM feature optimization process, then LASSO classifier applied to perform object recognition and gender classification. Database of 100 images (50 male and 50 female face images considered from 5 different databases) and 10 categories of vehicle types are used for gender and vehicle recognition and classification. Original face image database used for the gender classification. This approach was implemented with dual classification model [(1) Recognizing or classifying human faces from various objects and (2) Classifying gender through face recognition] is made possible with the help of combining modified SIFT feature in combination with ridge regression (RR), elastic net (EN), lasso regression(LR) and lasso regression with Gaussian Support Vector Machines (LRGS) based classificatioins. The final classification results accurate are as follows RR- 89.6%, EN- 93.5%, LR-93.2% and the proposed approach is LRGS with 98.4% accurate detection rate with rediction names.

Introduction

Sex or gender is a feature that helps to identify a person initially. Gender based separation among humans is classified into two: male and female. Distinctive gender groups might also show off diverse propensities. For instance, younger guys and young girls may like toys; human beings may have an inclination towards a variety of clothes varying in colours anf styles.In enterprise understanding, to know the gender (and age) of the customers may also assist business chiefs to publicize their objects as indicated by using numerous purchaser groups and accumulate sizeable statistic facts about the clients, for instance, the number of women enteriup during a retail region or a shopping centre inside a given period.In picture healing, gender (or age) can be a treasured semantic concept for photo association and search, as coming across men or women in specific scenes. In social collaborations or exercises, guys and ladies can also perform contrastingly and have numerous jobs. In this manner perceiving the gender of a person has extensive variations. People can see the gender of each other. It has been a dynamic research in brain science to reflect on consideration on the effect of gender on people [1–3]. In brain technology about, the development is usually the face photographs for gender popularity. In the visible computational investigation for gender characterization or acknowledgement, a large portion of the current works applied face pictures. For example, see [4–8]. As of past due, some methodologies, for example, [9, 10], established that human bodies must used be for gender grouping. Victories display on unmarried dataset images wherein a bodily drawn box containing someone walking utilized all round for frame records extraction. The upsides of using body over face pictures for gender acknowledgement include the accompanying list.

 (1) Image Resolution. When the face locale has a low dream or is motion obscured, the face highlights in all likelihood won't be usable for gender order. Be that as it can, the self-belief may also at gift help isolate guys from ladies.

(2) Viewpoint Change. At the factor when the head present is altogether one of a kind from frontal perspectives, the face-primarily based on gender acknowledgement may additionally have low execution or even can not be appropriate. Notwithstanding, self-perception can at gift use. Indeed, even the returned angle on the frame can be used for gender acknowledgement [9, 10].

(3) Acquisition Distance. At the factor when the camera is some distance away from the person, the face photograph won't supply good enough records to gender segregation. Be that as it could, self-belief may additionally even now be usable.

(4) Occlusion. At the point, when the facial part is blocked, the face image probable might not be applied to do away with gender statistics. Be that as it can, the self-notion is as but useful, in any event, while a person features are impeded. Be that as it could, in the ones spearheading ways to deal with sex acknowledgement from the body, as an instance, [9, 10], merely the upstanding self-perceptions with the complete frame look explored. The information normally applied are individual strolling images, as an instance, the MIT passer by database [11],can be named which is an interactive database for walker identity. One pressing necessity is that the whole body in upstanding shows up in each picture, as regarded in Figure 1, and in this way, there is no compelling motive to pressure over to adjust the self-perceptions in coordination. Highlights can be extricated from the entire self-perceptions and used to put together a classifier.

The organisation of this paper is as follows, in section various machine learning schemes applied for object detection and face based gender recogntion were identified. In sectioin 3, SVM based feature extraction in combinatioin with MSIFT (Multi Scale Invariant Feature Transform) demonstrated along with LASSO based regession classification analysis, in section 4 various dataset based results were plotted alongwith simulation setup.Section 5, concludes the paper with breife note on how our research paper finds a solution with performance metrics.

Literature Survey

One of the simple issues with any photo making prepared or PC vision undertaking is the clear up of a becoming language for addressing the realities. Face association, carries the extraction of a collection of parameters or estimations that permit portraying of the variations between human beings, instead of the sorts amongst snapshots of individuals.

This method is authentic notably after the photographs have been coded with the component of the dismissal of non-facial variations as the internal man or woman, regardless need to be banned. A regularly carried out method for characterization is Support Vector Machine (SVM) [16,18]; those tune in to the ideal hyper-flying system for keeping apart several getting ready by way of the method for confining the peril of mis-association. Recognization method is regularization or subject based strategy. SVMs have in recent times been used for face grouping [10, 14], displaying dependably better.These compositions grew to become out to be maximum of the way maintained through Ministry of Education, Culture, Sports, Science and Technology Grant-in-Aid for Scientific Research(B) 15300076. The makers are thankful to the Softopia Japan Foundation for the HOIP dataset. Prevalence execution than famous nearest neighbour strategies. Facial sex plan has, also, been taken into consideration [3, 6, 9, 11]. Likewise, a non-instantly SVM based honestly classifier has appeared to make through and via higher outcomes than particular frameworks [12].

The sizeable separation between sexual orientations changed into approval to be strikingly twisted, and this comes to be the pinnacle gone to utilize the help of vector attitude. Regardless, none of those structures chooses to become, robust estimations at which point to organize the countenances. In this paper, a machine for showing up highlight the determination-making use of SVMs delineated wherein the improvement occurs in the parameter location.

Using the board model throughout the eccentrics work, a pitiful dating of features made with a considerable scope of the classifier's parameter is assessed to be 0. The capabilities are greater vigorous than SVM estimations and are efficaciously among the considered. This tool has been named Lasso regression[15], or dynamic [1] or purpose intrigue [4] characterization.

When acting characteristic devotion, it's miles simple to realize the trustworthiness of the element choice method. This approach can do with the aid of discovering the most elements with bunches of complicated insufficient classifiers. In this paper, a different approach for finding the entire relationship of small SVMs is portrayed, which produces internationally perfect classifiers for the development of lin-ear programming sub-problems.

It is then suggested to ap-contract Automatic Relevance Detection (ARD) [13] and choose a lone rendition it is enhancing the probability of the practice statistics explained over the various stages of scantiness, while it is the far some distance normal that numerous portrayals and an ever-growing number of versatile amounts supply the advanced arrangement exactness.

The simple thought system in this paper is to depict how the individual which wants, and between congruity of confirmation to bring about talented highlights works of art and smash down their duties in a prime dimension between contain a choice problem.

Most of the approaches mentioned in this section were focused on gender recognition with the help of various classifiers and most of the approaches were applied on nominal databases. Therefore in this scenario, in this research article a new dataset was implemented using the college student database (SRK college, Vijayawada) to proceed for the real time extinction of the research work.

The major contributions of the all other researchers were stopped with just detecting the gender with 2 class identification approach but here in this research article a combination of 13 classes which supports in identifying the objects and human faces will separated in them and the gender will be identified. This made other researchers to think on blind vision people problem solution tracing.

Proposed Methods

Due to technical limitations, the Proposed Methods section is only available as a download in the supplementary files.

Results and Discussion

4.1 Datasets Performance metrics with Description:

FG-NET Database: The face and gesture recognition network (FG-NET) database discharged in 2004. The FG-NET database is a freely accessible picture database containing face pictures of several subjects at various ages not just for age estimation for gender based extraction at various ages. The database contains 1002 pictures from 82 distinct subjects with ages extending between infants to 69 years of ages. Be that as it may, ages between zero to 40 years are the most populated in the database.

ORL Database: ORL(Olivetti Research Laboratory) database[19] contains many face images taken between April 1992 and April 1994 at the lab. The database utilised with regards to a face based gender extraction task did in a joint effort with the Speech, Vision and Robotics Group of the Cambridge University Engineering Department. There are ten unique images of every one of 40 particular subjects. For certain subjects, the images were taken on various occasions, differing the lighting, outward appearances and facial subtleties. Every one of the images taken against a dim homogeneous foundation with the subjects in an upright, frontal position. Ten various images of every one of 40 particular subjects. For certain subjects, the images were taken on various occasions, fluctuating the lighting, outward appearances.

FERET Database: The FERET database gathered in 15 sessions between August 1993 and July 1996. The database contains 1564 arrangements of pictures for a sum of 14,126 pictures that incorporates 1199 people and 365 copy sets of pictures. A copy set is a moment set of pictures of an individual as of now in the database and typically taken on an alternate day. The Facial Recognition Technology (FERET) database is a dataset utilised for facial based gender extraction framework assessment as a significant aspect of the Face Recognition Technology (FERET) program. The FERET database fills in as a standard database of facial pictures for specialists to use to create different calculations and report results.

LFW database: (LFW) Labeled Faces in the Wild, a database of face photos intended for examining the issue of the unconstrained face-based gender extraction. The informational collection contains more than 13,000 pictures of appearances gathered from the web. Each face has marked with the name of the individual imagined. One thousand six hundred eighty of the general population imagined having at least two unmistakable photographs in the informational collection.

Private Database: private database contains various gender orientations and also had various ages of coloured facial images with proper resolution. Human images are accumulated through the camera with quality facial images with 275*314 dimensional measurements and also 254 dpi resolutions. These images are in JPEG format with reasonable contrast and white balance. The database provided 259 images gathered for face recognition. Simulation SetupIn this section, the paper describes the experimental setup for different objects based on feature extractions concerning facial and other related images. For simulation MATLAB latest version with minimum 4GB RAM and 250 HD for processing multi-label images. Implementation of design for uploading data sets shown in figure 7.

Matched and mis-matched class detection is carried out by various algorithms and compared with performance metrics like MSE, RMSE, SSE, PRECISION, RECALL, FMEASURE, ACCURACY and TIME for various datasets. In figrue 4, to show case the performance of different classification algorithms naive bayes (NBC), Random Forest (RFC), Gaussian SVM (GSVM), Fuzzy C Means Gaussian SVM (FGSVM) [20],

Multivariant ridge logistic regression (RIDGE), Multivariant elastic net logistic regression (ELASTIC), Multivariant Lasso logistic regression(LASSO)Here data set size varied based results are tabulated for accuracy and Time for various datasets mentioned in the intial stages of results section.

Performance Metrics:

The Metrics used to compare the performace of various algorithms for feature extraction, selection and classification are Mean Square Error, Accuracy, Precision, Recall, Sum of Squared Error, Root Mean Square Error, F measure.

Their formulation is dependent on true positive, true negative, false positive and false negative conditions. These were selected during the classification of the images and were mathematically represented using the below equations.

See equations 8 - 14 in the supplementary files.

From Tabe 4 to Table 8 Accuracies were compared with different databases under different classifiers was presented. Outratedly our proposing approach delivers the best accuracy out of all algorithms with a minimum of 3% best rate

 

Conclusion

In this paper, 5 different databases are tested with different algorithms and some thechniques are compared with with repect to references. In results section of this paper, 3 different approaches are compared with multivariant mode. At most of the references are authors implemented FERET databases, to show case the best outcome cited the comparison this dataset results are compared with other results are compared at dataset level. The proposed approach results in best accuracy with less time of execution. In future the enhanced process can be applied on video processing units for real time operations of detection in objects.

Declarations

Availability of data and materials

Not applicable

Competing interests

Not applicable
Funding

Not applicable
• Authors' contributions

Not applicable
• Acknowledgements

Not applicable

Abbreviations

SIFT –Scale Invariant Feature Transform,RR-ridge regression ,EN-elastic net ,LR-lasso regression,LRGS-lasso regression with Gaussian Support Vector Machines ,SVM-Support Vector Machine,MSIFT -Multi Scale Invariant Feature Transform,ARD-Automatic Relevance Detection ,GSVM-Gaussian Support Vector Machine,MSVM-Multi Support Vector Machine,LASSO- Least Absolute Shrinkage and Selection Operator,

PCA-Principle Component Analysis,HOG- histogram of oriented gradients,GLM-generalised linear model ,MSE-Mean Square Error,SSE-Sum of Squared Error,RMSE-Root Mean Square Error,ORL-Olivetti Research Laboratory,FERET-Facial Recognition Technology ,LFW- Labeled Faces in the Wild

FG-NET-The face and gesture recognition network ,NBC-Naive bayes Classifier,RFC-Random Forest Classifier ,FGSVM- Fuzzy C Means Gaussian SVM

References

[1] Nguyen, D. T., & Park, K. R. Enhanced Gender Recognition System Using an ImprovedHistogram of Oriented Gradient (HOG) Feature from Quality Assessment of Visible Light andThermal Images of the Human Body. Sensors, 16(7), 1134. (2016).

[2] Liu. Tianyu, Li. Fei ., & Wang. Rui, "Human face gender identification system based on MB-LBP," the 30th Chinese Control And Decision Conference (2018 CCDC), DOI:10.1109/CCDC.2018.8407405, IEEE Publisher, Shenyang, china, pp. 1721-1725, (2018).

[3] Llukman Cerkezi., & Cihan Topa, "Gender recognition with uniform local binary patterns," 26th Signal Processing and Communications Applications Conference (SIU), doi:10.1109/siu.2018.8404587,IEEE publisher, Izmir, pp. 1-4, 2018.

[4] Francesco Bianconi , Fabrizio Smeraldi, Maryam Abdollahyan ., & Perry Xiao, On the use of skin texture features for gender recognition: An experimental evaluation. 2016 Sixth International Conference on Image Processing Theory,ToolsandApplications (IPTA). doi: 10.1109 /ipta .2016.7821018, IEEE publisher,Oulu, pp. 1-6, 2016 

[5] Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva., & Antonio Torralba"Places: A 10 million image database for scene recognition" IEEE Transactions on Pattern Analysis and Machine Intelligence,40(6),1452-1464,pp.1-14, 2017.

[6] Luis Felipe Zeni., & Claudio Jung "Real-Time Gender Detection in the Wild Using Deep Neural Networks," 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), DOI: 10.1109/SIBGRAPI.2018.00022, IEEE publisher, Parana, Brazil, pp. 118-125, 2018.

[7] Joseph Redmon., & Ali Farhadi "Yolo9000: Better faster stronger" The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), DOI 10.1109/CVPR.2017.690, IEEE computer society,pp.6517-6525,July 2017.

[8] Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy Scott Reed, Cheng-Yang Fu., & Alexander C. Berg "SSD: Single shot multibox detector" Lecture notes in computer science, DOI: 10.1007/978-3-319-46448-0_2, Springer publisher, ECCV 2016, Part I, LNCS 9905, pp. 21-37 2016.

[9] Gil Levi ., & Tal Hassner "Age and Gender Classification Using Convolutional Neural Networks ", IEEE Workshop on Analysis and Modeling of Faces and Gestures (AMFG), at the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Boston ,pp. 34-42 2015.

[10] Rajeev Ranjan, Vishal M. Pate., & Rama Chellappa, Hyperface: A deep multitask learning framework for face detection landmark localization pose estimation and gender recognition, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE , VOL. XX, NO. XX,pp.1-16, 2016.

[11] Mahendran ., & A. Vedaldi "Visualizing deep convolutional neural networks using natural pre-images" International Journal of Computer Vision vol. 120 Issu no. 3 pp. 233-255, 2016.

[12] Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva., & Antonio Torralba, "Learning deep features for discriminative localization,2016 IEEE Conference on computer vision and Pattern Recognition, DOI 10.1109/CVPR.2016.319, IEEE computer society, pp. 2921-2929, 2016.

[13] Sergio Montazzolli, Claudio Jung," Real-Time Brazilian License Plate Detection and Recognition Using Deep Convolutional Neural Networks “30th SIBGRAPI Conference on Graphics, Patterns and Images, doi:10.1109/sibgrapi, IEEE publisher, pp. 55-62, 2017.14.

[14] Cunjian Chen and Arun Ross " A Multi-Task Convolutional Neural Network for Joint Iris Detection and Presentation Attack Detection " 2018 IEEE Winter Applications of Computer Vision Workshops (WACVW), DOI 10.1109/WACVW.2018.00011, IEEE publisher, pp. 44-51 2018.

[15] J. Wang, C. Xu, X. Yang ., & J. M. Zurada, "A Novel Pruning Algorithm for Smoothing Feedforward Neural Networks Based on Group Lasso Method," in IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 5, pp. 2012-2024, May 2018.

[16] Dat Tien Nguyen ., & Kang Ryoung Park. “Enhanced Gender Recognition System Using an Improved Histogram of Oriented Gradient (HOG) Feature from Quality Assessment of Visible Light and Thermal Images of the Human Body”, Sensors (Basel) Article. 2016; v.16 (7):1134, doi:10.3390/s16071134,pp.1-25, Published Jul 21, 2016.

[17] G. D. K. Kishore., & M.Babu Reddy, “Analysis and Prototype Sequences of Face Recognition Techniques in Real-Time Picture Processing”, 6th International Conference on FICTA, Intelligent Engineering Informatics, Advances in Intelligent Systems and Computing, volume 695, doi.org/10.1007/978-981-10-7566-7,Springer publisher,pp.323-335,2018.

[18] D.K. Kishore Galla., & Babu Reddy Mukamalla,” Gender classification based on similarity features through SURF and SVM”, International Journal of Knowledge Engineering and Data Mining, DOI: 10.1504/IJKEDM.2019.097353, INDERSCIENCE Publishers, Vol.6 No.1, pp.89 – 104, 2019.

[19] G. D. K. Kishore., & M.Babu Reddy, “ Detecting Human and classification of Gender using Facial Images MSIFT Features Based GSVM” ,International Journal of Recent Technology and Engineering(IJRTE),pg No:1466-1471,Vol.8,Issue3,Sep2019.

[20]Damodara Krishna Kishore Galla., & BabuReddy Mukkamalla,” CLASSIFICATION OF MULTI-LABEL OBJECT BASED ON MSIFT FEATURE PROBABILISTIC FUZZY C-MEANS CLUSTERING CLASSIFIED BY GSVM” JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES (JMCMS),Pg No:247-263, Vol.-15, No.-4, April -2020.

Tables

Tables 1 and 2 can be found in the Proposed Methods file in the supplementary files section.


Table 3 Accuracy and Time comparison of 3classification algorithms with MSIFT features training based on Multivariant principle for FERET database comparing with Dictioinary feature based approach


 

Dataset size

Training images

Testing images

RIDGE

LASSO

Proposed

Accuracy

Time

Accuracy

Time

Accuracy

Time

170

130

40

92.4

0.59s

94.6

0.6s

99.75

0.48s

280

130

190

91.7

0.59s

94.6

0.6s

98.6

0.48s

350

130

220

91.7

0.59s

92.7

0.6s

98.7

0.48s

500

130

370

90.6

0.59s

91.5

0.6s

97.6

0.48s

750

130

620

90.3

0.59s

89.4

0.6s

97.6

0.48s

Table 4 Accuracy for different classifiers worked on FERET database


 

Technique

Dictionary

SVM-MSIFT

GSVM-MSIFT

LASSO

RIDGE

Proposed

Accuracy

93.86

92.45

94.86

94.23

92.46

98.3


 

Table 5 Accuracy for different classifiers worked on FGNET database


 

Technique

Dictionary

SVM-MSIFT

GSVM-MSIFT

LASSO

RIDGE

Proposed

Accuracy

-

86.45

91.69

93.4

92.46

98.3


 

Table 6 Accuracy for different classifiers worked on ORL database


 

Technique

Dictionary

SVM-MSIFT

GSVM-MSIFT

LASSO

RIDGE

Proposed

Accuracy

-

92.45

94.86

94.23

92.46

98.3


 

Table 7 Accuracy for different classifiers worked on PRIVATE database


 

Technique

Dictionary

SVM-MSIFT

GSVM-MSIFT

LASSO

RIDGE

Proposed

Accuracy

-

92.45

94.86

94.23

92.46

98.3


 

Table 8 Accuracy for different classifiers worked on FERET database


 

Technique

Dictionary

SVM-MSIFT

GSVM-MSIFT

LASSO

RIDGE

Proposed

Accuracy

-

92.45

94.86

94.23

92.46

98.4