It is the commonly used neural network based on supervised learning in which information flows in one direction, and it has no loops. The main objective is to find the optimized function f() that maps input to the desired output and learning the optimized value of bias(θ) for it. Learning occurs in the MLP Using a back propagation algorithm by adjusting the connection weights when there is a deviation from the expected output and the actual output. Their main applications are to solve optimization problems in the area of finance, transportation, fitness and energy.
2.1Architecture, Algorithm and Characteristics of MLP:
It has three layers, namely the input, output and one or more hidden layers. The input layer collects the input features to be processed. An arbitrary number of hidden layers lies between both the input and output layers. They work as the computational unit of the MLP. The output unit performs tasks such as prediction and classification.
Property 1
Universality
MLP is capable of learning both linear as well as non-linear functions. MLPs are designed to approximate any continuous function and can solve problems that are not linearly separable.
Property 2
Adaptive learning and Optimal
MLP can learn how to do tasks from the data given for training and initial experience. MLP minimizes the loss function. Hence it is optimal. Learning the function that maps the inputs to the outputs reduces the loss to an acceptable level.
Property 3
Stochastic
MLP is a stochastic program. In a stochastic program, some or all problem parameters are uncertain and use probability distributions to solve highly complex optimization problems.
Property 4
The power of depth
Compared to shallow ones, deep nets can represent some functions more compactly, such as parity function and a deep network, whose size is linear in the number of inputs computes it.
2.2 Application of MLP:
There are various convolution neural network-based models for remote sensing image classification and better performance. VHR remote sensing image scene classification plays a vital role in remote sensing research; hence they help manage land resources, urban planning, tracking of disasters, and traffic monitoring. Osama A. Shawky et al. (2020) proposed a VHR(Very High Resolution) image scene classification model comprising three phases: Data augmentation to learn robust features, a pre-trained CNN model to extract features from the original image, and an adaptive gradient algorithm multi-layer perceptron to improve the accuracy of the classifier.
With the advent of modern remote sensing technologies, various very fine spatial resolutions (VFSR) dataset is now commercially available. These VFSR images have opened up many opportunities such as urban land use rescue, agriculture, and tree crown description. Zhang, C. et al.(2018) proposed a hybrid classification system that combines the contextually based classifier CNN and pixel-based classifier MLP with a rule-based decision fusion strategy. The decision fusion rules formed based on the confidence distribution of the contextual-based CNN classifier. If the input image patch is at the homogeneous region, the confidence is high.
On the other hand, if the image pixels contains other land cover classes as related information, the confidence is low. As a result, the MLP can rectify the classified pixels with low confidence at the pixel level. This paper also compares the proposed method's performance with benchmark standards such as pixel-based MLP, spectral texture-based MLP, and contextual-based CNN classifiers.
Md Manjurul Ahsan et al. (2020) proposed a hybrid model with a combination of a Convolutional Neural Network (CNN) and Multilayer Perceptron (MLP) in which MLP handles the numerical/categorical data, and CNN extracts features from the X-ray images. Parameter tuning using the grid search method to decide the number of hidden layers, number of neurons, epochs, and batch size. Meha Desai et al.(2020), in their study, compare and analyze the function and designing of MLP and CNN for the application of breast cancer detection and conclude CNN give slightly higher accuracy than MLP.
Vinod Kumar et al.(2020) suggested a hybrid CNN-MLP model that analyzes novel and diversified attacks. The problem of intrusion detection is a classification task using machine learning and deep learning techniques. The model used feature selection and reduction techniques, random forest regressor, along the correlation parameter. The CICIDS2017 dataset used the performance of the proposed model outperforms that of the performance of the individual CNN and MLP models. Hanwen Feng et al.(2020) suggested a CNN model for Classification of Points of Interest in Side-channel attacks and compared it with MLP and concludes MLP is more suitable for PCA traces and CNN is for POI traces; shorter traces improves the classification results. Bikku(2020) proposed a model using MLP to predict future health risk with a certain probability, compare it with LSTM and RNN, and suggest MLP outperforms the other two. Salah, L. B(2019) presented a model to control a bioreactor using deep learning feed-forward neural networks with different MLP structures. The trained model emulates the inverse dynamics of the bioreactor and then uses neural controllers for neural control strategies of the chosen bioreactor. [8]
S..Bairavel et al. (2020) suggested a model for multimodal sentiment analysis using feature-level fusion technique and novel oppositional grass bee optimization (OGBEE) algorithm for fusing the extracted features from different modalities and MLP for classification.[9] Foody. G et al. compared three different neural network approaches, MLP, RBF and PNN, for Thematic mapping from remotely sensed data. For the proposed model, PNN outperforms. [10] Singh, N.H. et al.(2018) designed a model to find the optimal collision-free path and control the robot's speed in a dynamic environment for the mobile robots to reach the destination using MLP. The ultrasonic sensors in the robot sense the obstacle in its path and calculate the distance between them. [11] Meng Wang et al. (2020) devised a model to detect the Distributed Denial of Service (DDoS) attack using MLP with feature selection for optimal feature selection and the Back Propagation algorithm to reconstruct the detector when errors are perceived. The model comprises three modules knowledge base, detection model, and feedback mechanism and MLP act as binary classifier during attack detection.[12]
The model proposed by Morteza Taki et al.(2018) predicts the irrigated and rainfed wheat output energy using artificial network models MLP, RBF and Gaussian Process Regression (GPR). The RBF model performs better than the other two models in predicting wheat output energy under various irrigated and rainfed farms. [13]
Author & Year | The objective of the articles | Experimental design Techniques | Datasets used | Response and Performance measure |
A. Shawky et al. (2020) | Remote sensing image scene classification using CNN-MLP with data augmentation | Augmentation to expand the image dataset., pre-trained CNN(The xception pre-trained on the ImageNet dataset) model for feature extraction, Adaptive gradient with MLP for classification Activation function used RELU and softmax, learning rate 0.01 | VHR image datasets used UC-Merced, Aerial Image (AID), and NWPU-RESISC45 | Accuracy UC-Merced dataset:99.86 AID Dataset: 98.10 NWPU-RESISC45 dataset: 97.40 |
Zhang, C.et al. (2018) | A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification | contextual-based classifier CNN, pixel-based classifier MLP with shallow structures | Custom dataset with aerial imagery of Southampton with 50 cm spatial resolution and four multispectral bands (Red, Green, Blue and Near Infrared). Two study sites S1 (3087 X 2750 pixels) and S2 (2022 X 1672 pixels) | Accuracy:89.64 |
Md Manjurul Ahsan et al.,(2020) | Deep MLP-CNN Model Using Mixed-Data to Distinguish between COVID-19 and Non-COVID-19 Patients | MLP to analyze and classify numerical or categorical data and cnn to analyze and classify the X ray image Optimization algorithms used:adaptive learning rate optimization algorithm (Adam), stochastic gradient descent (Sgd), and root mean square propagation (Rmsprop). | COVID-19 data set collected from the open-source GitHub repository | For balanced dataset, Adam algorithm achieved highest accuracy of 96.3 For imbalanced dataset Rmsprop algorithm achieved highest accuracy of 95.38 |
Meha Desai et al.,(2020) | An anatomization on breast cancer detection and diagnosis employing multi-layer perceptron neural network (MLP) and Convolutional neural network (CNN) | CNN CNN MLP MLP MLP | BreakHis dataset Mammographic Image Ananlysis Society database Digital database for Screening Mammograms (DDSM) are used Wisconsin breast cancer dataset WDBC dataset | 98.86 82.71 65.21 95.74 97.51 |
Vinod Kumar et al. (2020) | Evaluating Hybrid Cnn-Mlp Architecture For Analyzing Novel Network Traffic Attacks | feature selection and reduction technique - random forest regressor with the correlation parameter, hybrid CNN-MLP architecture for classification | CICIDS2017 dataset | Analyzing the novel and diversified attacks. Accuracy 0.972 Precision 0.9827 Recall 0.9813 |
Hanwen Feng et al.(2020) | MLP and CNN-based Classification of Points of Interest in Side-channel Attacks | CNN and MLP | ANSSI SCA Database (ASCAD) ,SM4 traces | MLP is more suitable for PCA traces and CNN is more suitable for POI traces |
Tulasi Bikku,(2020) | Multi-layered deep learning perceptron approach for health risk prediction | MLP | real historical medical data from the University of California at Irvine (UCI) ML Repository | Better performance than LSTM and RNN |
Salah, L. B(2019) | An anatomization on breast cancer detection and diagnosis employing multi-layer perceptron neural network (MLP) and Convolutional neural network (CNN) | MLP | | neural network structure with two hidden layers gives better performances |
S.Bairavel et al. (2020) | Novel OGBEE-based feature selection and feature-level fusion with MLP neural network for social media multimodal sentiment analysis | Feature-level fusion technique- to fuse the extracted features from different modalities. Novel oppositional grass bee optimization (OGBEE) algorithm -feature selection MLP- for classification | web recordings that utilize audio, video, and textual modalities | Accuracy in sentimental analysis classification 95.2% |
M.Foody et al., (2001) | Thematic mapping from remotely sensed data with neural networks: MLP, RBF and PNN based approaches | MLP, RBF and PNN for classification | Remotely sensed data from ATM dataset | The accuracy of the MLP, RBF and PNN models are 86.56,82.5 and 87.18, respectively. |
Singh, N.H. et al.(2018) | Mobile Robot Navigation Using MLP-BP Approaches in Dynamic Environments | MLP | Real-time system | Identifies the Optimal collision-free path for mobile robots to reach the destination |
Meng Wang et al. (2020) | A dynamic MLP-based DDoS attack detection method using feature selection and feedback, Computers & Security | MLP Feedback mechanism used to detect the error and reconstruct the detector to improve the performance. | ISOT dataset comprises of attack and regular packets and ISCX dataset comprises of normal packets | Classifies the samples as normal or attack |
Morteza Taki et al. (2018) | Assessment of energy consumption and modelling of output energy for wheat production by neural network (MLP and RBF) and Gaussian process regression (GPR) models | MLP, RBF and GPR | | Predicts output energy consumptions in wheat farms, and the RBF model outperforms the other two. |