Multilayer Perceptron (MLP)

Supervised learning algorithm
MLP learns a non-linear function approximator for either classification or regression depending on the given dataset.
In sklearn, we implement MLP using:
- MLPClassifier for classification
- MLPRegressor for regression
MLPClassifier supports multi-class classification by applying Softmax as the output function.
It also supports multi-label classification.
MLPRegressor also supports multi-output regression, in which a sample can have more than one target.

MLPClassifier - Implementation

from sklearn.neural_network import MLPClassifier

MLP_clf = MLPClassifier()
MLP_clf.fit(X_train, y_train)

MLP_clf.predict(X_test)
#gives a vector of probability estimates per sample
MLP_clf.predict_proba(X_test)

Parameters

hidden_layer_sizes:
- This parameter sets the number of layers and the number of neurons in each layer.
- It is a tuple where each element in the tuple represents the number of neurons at the $i$th position where $i$ is the index of the tuple.
- The length of tuple denotes the total number of hidden layers in the network.
```
MLPClassifier(hidden_layer_sizes=(15,10,5))
```
alpha (regularisation):
- Strength of the L2 regularization term.
- Default: alpha = 0.0001
activation function (for hidden layers):
solver (weight optimisation):
batch_size:
- If the solver is ‘lbfgs’, the classifier will not use minibatch.
- Size of minibatches can be set to other stochastic optimizers: batch_size (int)
- default batch_size is 'auto'.
- batch_size=min(200, n_samples)

Attributes

coefs_ (weight matrix coefficients):
- It is a list of shape (n_layers - 1,)
- The $i$th element in the list represents the weight matrix corresponding to layer $i$.
intercepts_ (bias vector):
- It is a list of shape (n_layers - 1,)
- The $i$th element in the list represents the weight matrix corresponding to layer $i$.

Multilayer Perceptron (MLP)

MLPClassifier - Implementation

Parameters

Attributes

MLPRegressor