8 31

Zachary Bessinger

zbessinger

https://www.zachbessinger.com

AI & ML interests

Multimodal Computer Vision

Recent Activity

upvoted a paper 3 days ago

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

upvoted a paper 3 days ago

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

upvoted a paper 3 days ago

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

View all activity

Organizations

None yet

upvoted 6 papers 3 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published 9 days ago • 104

Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

Paper • 2603.03143 • Published 13 days ago • 137

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published 3 days ago • 76

liked a model 2 months ago

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • Updated Nov 26, 2025 • 2.55M • • 551

liked a Space 3 months ago

Vision Arena (Testing VLMs side-by-side)

🖼

560

Analyze images with multiple vision models for labels and boxes

upvoted a paper 4 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

liked 3 Spaces 4 months ago

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection

Transformers Timeline

🤗

Interactive timeline to explore the 🤗Transformers models

The Smol Training Playbook

📚

3.05k

The secrets to building world-class LLMs

liked a model 5 months ago

zai-org/GLM-4.6-FP8

Text Generation • Updated Oct 16, 2025 • 20k • • 98

liked 2 models 6 months ago

merve/smol-vision

Image-Text-to-Text • Updated Nov 5, 2025 • 192

kudzueye/boreal-qwen-image

Text-to-Image • Updated Sep 5, 2025 • 5.83k • • 125

upvoted a collection 9 months ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 6 items • Updated 14 days ago • 164

liked a model 9 months ago

TIGER-Lab/VLM2Vec-Qwen2VL-7B

Image-Text-to-Text • Updated May 3, 2025 • 1.87k • 10

liked a Space 9 months ago

MMEB Leaderboard

📊

103

The massive multimodal embedding benchmark

liked a model 9 months ago

DeepGlint-AI/UniME-LLaVA-OneVision-7B

Image-Text-to-Text • 8B • Updated May 7, 2025 • 91 • 3

liked a model about 1 year ago

ByteDance/Sa2VA-8B

Image-Text-to-Text • Updated Sep 8, 2025 • 872 • 65

Zachary Bessinger

AI & ML interests

Recent Activity

Organizations

zbessinger's activity

Vision Arena (Testing VLMs side-by-side)

Open VLM Leaderboard

Transformers Timeline

The Smol Training Playbook

MMEB Leaderboard