My Research Notes: Siamese Networks

Siamese networks are a type of neural network architecture designed for tasks involving similarity comparison between two inputs. Instead of making predictions directly, they learn to determine whether two inputs are similar or not. This architecture is widely used in applications like face verification, image similarity, signature verification, and more.

Key Concepts of Siamese Networks

Architecture:
- A Siamese network consists of two identical subnetworks that share the same architecture and weights.
- Each subnetwork processes one of the two inputs independently.
- The outputs of the subnetworks are combined using a distance metric to determine the similarity between the inputs.
Working:
- Two inputs (e.g., images, text, or feature vectors) are passed through the twin networks.
- The network generates embeddings (feature vectors) for each input.
- A similarity or distance metric (e.g., Euclidean distance, cosine similarity) compares the embeddings.
- Based on the distance, the network determines whether the inputs are similar or dissimilar.
Loss Function:
- Contrastive Loss: Encourages the embeddings of similar inputs to be closer and dissimilar inputs to be farther apart. $L = (1 - Y) \cdot \frac{1}{2} \cdot (D^{2}) + Y \cdot \frac{1}{2} \cdot \max (0, m - D)^{2}$ Where $Y$ is the label (1 for dissimilar, 0 for similar), $D$ is the distance, and $m$ is the margin.
- Triplet Loss: Uses three inputs (anchor, positive, and negative) to ensure the anchor is closer to the positive than the negative by a margin.
Applications:
- Face Verification: Determines if two face images belong to the same person.
- Signature Verification: Verifies if two signatures are from the same person.
- Image Similarity: Measures the similarity between two images.
- One-shot Learning: Learns to recognize new classes with minimal examples by comparing against existing embeddings.
Advantages:
- Handles new categories without retraining by comparing embeddings.
- Suitable for problems with limited labeled data or a large number of classes.
Limitations:
- Training requires carefully constructed pairs of similar and dissimilar inputs.
- Performance depends on the quality of the embedding space and the choice of distance metric.

Example Workflow

Data Preparation:
- Create pairs of inputs with labels indicating similarity (e.g., (input1, input2, label)).
Network Design:
- Use convolutional layers (for images) or dense layers (for structured data) to create the subnetworks.
Training:
- Train the network with pairs of inputs using contrastive or triplet loss.
Inference:
- Generate embeddings for unseen inputs and compare them using the chosen metric.

Siamese networks are particularly effective in domains where comparisons are more meaningful than absolute predictions, making them a cornerstone in tasks involving similarity and verification.

My Research Notes

Sunday, 1 December 2024

Siamese Networks

Key Concepts of Siamese Networks

Example Workflow

No comments:

Post a Comment

Understading the Paper: Fine Grained Image Analysis with Deep Learning