Harris Zhang

HanSolo9682

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

MAOAM: Unified Object and Material Selection with Vision-Language Models

upvoted a paper about 2 months ago

Personal AI Agent for Camera Roll VQA

authored a paper about 2 months ago

Your Embedding Model is SMARTer Than You Think

View all activity

Organizations

upvoted 2 papers about 2 months ago

MAOAM: Unified Object and Material Selection with Vision-Language Models

Paper • 2606.04880 • Published Jun 2 • 10

Personal AI Agent for Camera Roll VQA

Paper • 2606.05275 • Published Jun 3 • 20

upvoted a collection about 2 months ago

SMART

Your Single-Vector Embedding Model is SMARTer Than You Think • 5 items • Updated May 26 • 2

upvoted 2 papers 2 months ago

Your Embedding Model is SMARTer Than You Think

Paper • 2605.24938 • Published May 24 • 25

From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing

Paper • 2605.15181 • Published May 14 • 12

upvoted a paper 3 months ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Paper • 2604.13151 • Published Apr 14 • 25

upvoted a paper 4 months ago

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Paper • 2603.18004 • Published Mar 18 • 14

upvoted a paper 9 months ago

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Paper • 2511.03774 • Published Nov 5, 2025 • 13

upvoted a paper almost 2 years ago

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7