Face Recognition

This project demonstrates the implementation of a face recognition system using Siamese networks. Siamese networks are widely used in face recognition tasks because they can effectively measure the similarity between two images, making them ideal for verification and identification purposes.

Algorithm

Siamese Networks

A Siamese network consists of two identical subnetworks that share the same architecture and parameters. These twin networks process two separate inputs in parallel.
Siamese networks are primarily used for comparing pairs of data points. These inputs could be images, texts, or any other kind of data that can be represented numerically.
Each twin network converts its input into a lower-dimensional representation known as an “embedding.” The embedding is a vector of numerical values that captures essential features of the input data. The network is trained to ensure that the embeddings of similar input pairs are close in the embedding space, and those of dissimilar pairs are far apart.
One crucial aspect of Siamese networks is that the twin networks share the same set of weights. This weight sharing ensures that the two networks learn to map inputs to embeddings in a consistent and compatible manner.

Contrastive Loss

This is the loss function used for learning the similarity function.

where Dw is defined as the Euclidean distance between the outputs of the sister networks, Y is either 1 or 0. If the first image and second image are from the same class, then the value of Y is 0, otherwise, Y is 1.

Implementation

Siamese Network: The core of the project is the Siamese network model. This model consists of convolutional and fully connected layers that learn to extract discriminative features from face images. It is defined in the SiameseNetwork class.
Contrastive Loss: The training of the Siamese network is guided by the contrastive loss, defined in the ContrastiveLoss class. This loss function encourages the network to minimize the distance between images from the same person while maximizing the distance between images from different people.
Custom Dataloader: The project uses the Olivetti Faces dataset, which is loaded and preprocessed by the OlivettiFaces class. It prepares pairs of face images with labels (0 for the same person, 1 for different people) for training and evaluation.
Training Loop: The training loop trains the Siamese network using the prepared dataset. It optimizes the model using the Adam optimizer and records the loss history.

Results

The model produced results with an accuracy of 95.5%

GitHub