Overview
This project implements a facial landmark detection system using the Xception model, trained on the ibug dataset. The goal is to accurately detect facial landmarks from images and provide a user-friendly web application for real-time detection.
This model will support real-time inference for applications such as:
- Face recognition
- Emotion detection
- Virtual makeup
- Augmented reality (AR)
data:image/s3,"s3://crabby-images/0e467/0e46733fcc5d6391d64a6506dfea109832faa62c" alt=""
Learning Process
data:image/s3,"s3://crabby-images/082d4/082d4161d14874e6355996305bfb11e9cb41ea0f" alt=""
Dataset used
- Commonly Used Datasets:
- iBUG 300-W: Thousands of images labeled with 68 facial landmarks (eyes, nose, mouth, jawline).
- Applications: Face recognition, emotion analysis, augmented reality.
- Link for the dataset : here
data:image/s3,"s3://crabby-images/207b8/207b8084c0943187c89948e30299cd72cb48e608" alt=""
Procedure
- Augmentation:
- Implemented a
FaceLandmarksAugmentation
class for tailored augmentation techniques:- Cropping
- Random cropping
- Random rotation
- Color jittering
- Key parameters include image dimensions, brightness, and rotation limits.
- Methods like
offset_crop
andrandom_rotation
adjust landmark coordinates accordingly.
- Implemented a
- Preprocessor:
- Initializes augmentation methods.
- Normalizes data.
- Dataset Class:
- Inherits from
Dataset
. - Handles image paths, landmark coordinates, and cropping information parsed from XML files.
- Splits data into training and validation sets.
- Utilizes
DataLoader
for batch processing.
- Inherits from
data:image/s3,"s3://crabby-images/30324/30324f31db8ad7dd6f84697e626f3cc68ccd2ae9" alt=""
Network Design
- Architecture:
- Modular CNN with depthwise separable convolutions to improve efficiency.
- Entry Block: Initial feature extraction.
- Middle Blocks: Residual connections.
- Exit Block: Outputs facial landmark coordinates.
- Enhancements:
- Batch normalization and LeakyReLU for better performance.
- Training Loop:
- Runs for 30 epochs.
- Computes training loss and performs backpropagation.
- Updates weights using an optimizer.
- Validates model after each epoch.
- Saves checkpoints to prevent overfitting and retain the best model.
data:image/s3,"s3://crabby-images/f1c7e/f1c7e298429705f6868b27ca64e8c30ac3b1849f" alt=""
Results
data:image/s3,"s3://crabby-images/a45ff/a45ff85dd393e1dd95590720248e3dc1ccb1024c" alt=""
data:image/s3,"s3://crabby-images/78418/784187d777c9813e70a0a2b8b6a61a8dd3240707" alt=""