YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

MeFEm: Medical Face Embedding Models

Vision Transformers pre-trained on face data for potential medical applications. Available in Small (MeFEm-S) and Base (MeFEm-B) sizes.

Quick Start

import torch
import timm

# Load model (MeFEm-S example)
model = timm.create_model(
    'vit_small_patch16_224',
    pretrained=False,
    num_classes=0,           # No classification head
    global_pool='token'      # Use CLS token (default)
)
model.load_state_dict(torch.load('mefem-s.pt'))
model.eval()

# Forward pass
x = torch.randn(1, 3, 224, 224)  # Your face image
embeddings = model(x)  # [1, 384] CLS token embeddings

Model Details

Architecture: ViT-Small/16 (384-dim) or ViT-Base/16 (768-dim) with CLS token
Training: Modified I-JEPA on ~6.5M face images
Input: Face crops with 2× expanded bounding boxes, 224×224 resolution
Output: CLS token embeddings (global_pool='token') or all tokens (global_pool='')

Usage Tips

# For all tokens (CLS + patches):
model = timm.create_model('vit_small_patch16_224', num_classes=0, global_pool='')
tokens = model(x)  # [1, 197, 384]

# For patch embeddings only:
tokens = model.forward_features(x)
patch_embeddings = tokens[:, 1:]  # [1, 196, 384]

Training Data

Face images from FaceCaption-15M, AVSpeech, and SHFQ datasets (~6.5M total). Images were cropped with expanded (2×) face bounding boxes.

Notes

Optimized for face images with loose cropping
Intended for representation learning and transfer to medical tasks
Results may vary for non-face or tightly-cropped images
More info on training and metrics here

License

CC BY 4.0. Reference paper if used:

@misc{borets2026mefemmedicalfaceembedding,
      title={MeFEm: Medical Face Embedding model},
      author={Yury Borets and Stepan Botman},
      year={2026},
      eprint={2602.14672},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2602.14672},
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for boretsyury/MeFEm

MeFEm: Medical Face Embedding model

Paper • 2602.14672 • Published 7 days ago