Self-supervised Learning

<aside> ❓

What is Collapse?

Model doesn’t learn diverse and informative embeddings and treat “everything as the same”. To prevent collapse:

BYOL (Bootstrap your Own Latent)

image.png

SSL: DINO

image.png

image.png

Multi-crop

essentially make the model learn the same concept from different viewpoints.

image.png

Global Embedding Methods Summaru

image.png

Breaking Down the Components of the Joint/Global-Embedding Architecture

This diagram represents the core architecture behind self-supervised learning (SSL) methods like SimCLR, MoCo, BYOL, VicReg, and DINO. Let’s go through each part in depth.


1️⃣ Data Augmentation: Generating Different Views


2️⃣ Encoder: Extracting Features