Shera Jafaritabar

GitHub logo View on GitHub

Project Overview

This self-supervised contrastive learning pipeline offers a unified framework for learning powerful image representations without labels. It supports five leading model families, each leveraging a distinct strategy to build invariance into the learned feature space:

Pipeline Capabilities

Examples of Learned Representations

Below we showcase how the contrastive models capture meaningful structure in image data. Each example includes a UMAP projection and nearest-neighbor retrievals (or hexbin histograms) to illustrate clustering by morphology or physical properties.

🪼 Jellyfish Galaxies

Data Source: Zooniverse – Cosmological Jellyfish
Using galaxy cutouts from this project, we trained a SimCLR model to learn morphology-aware embeddings.

UMAP Projection

2D UMAP of jellyfish galaxies

In this projection, galaxies with similar tail-like morphology cluster tightly, revealing the model’s ability to distinguish visual features purely from contrastive signals.

Nearest Neighbors Visualization

Nearest neighbors for jellyfish galaxies

For each query image, the top-10 nearest neighbors are shown with model-inferred “jellyfish” probability scores. High visual similarity and probabilities confirm robust clustering in embedding space.

🌌 X-ray Galaxy Clusters (TNG-Cluster)

Data Source: TNG-Cluster Simulations
We applied DINO on raw X-ray maps across multiple snapshots to uncover morphological groupings.

UMAP Projection

UMAP of X-ray cluster embeddings

Distinct regions correspond to different cluster morphologies—relaxed, merging, or cool-core systems—demonstrating DINO’s capacity to encode high-level astrophysical features.

Nearest Neighbors Visualization

Nearest neighbors for X-ray clusters

Query cluster (left) and its top-9 nearest neighbors reveal strong morphological consistency, supporting the embedding’s semantic organization.

Hexbin Histograms

Hexbin observables

Observables (e.g., X-ray luminosity, temperature) mapped onto UMAP bins.

Hexbin unobservables

Unobservables (e.g., merger stage, substructure metrics) reveal hidden physical correlations learned without direct supervision.