3 46 15

Beichen Zhang

BeichenZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

upvoted a paper 2 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

liked a dataset 11 days ago

internlm/DL3DV-2k

View all activity

Organizations

None yet

upvoted a paper about 9 hours ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 1 day ago • 36

upvoted a paper 2 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 9 days ago • 184

liked 3 datasets 11 days ago

liked a model 11 days ago

internlm/ETCHR-FLUX.2-klein-9B

Image-to-Image • Updated 25 days ago • 144 • 8

upvoted a paper 25 days ago

ETCHR: Editing To Clarify and Harness Reasoning

Paper • 2605.23897 • Published 28 days ago • 13

updated a dataset 25 days ago

BeichenZhang/ETCHR-SFT-400K

Viewer • Updated 24 days ago • 405k • 528 • 2

published a dataset 25 days ago

BeichenZhang/ETCHR-SFT-400K

Viewer • Updated 24 days ago • 405k • 528 • 2

liked a model 29 days ago

internlm/CapRL-Qwen3VL-4B

Image-Text-to-Text • 4B • Updated Apr 16 • 414 • 12

liked a dataset 3 months ago

internlm/WildClawBench

Benchmark • Updated May 15 • 11.9k • 62

upvoted 2 papers 3 months ago

Visual-ERM: Reward Modeling for Visual Equivalence

Paper • 2603.13224 • Published Mar 13 • 21

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Paper • 2603.12252 • Published Mar 12 • 12

upvoted 2 papers 4 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Paper • 2602.12205 • Published Feb 13 • 83

Unified Personalized Reward Model for Vision Generation

Paper • 2602.02380 • Published Feb 2 • 20

upvoted 3 papers 7 months ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 50

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 22

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 9

commented a paper 7 months ago

Think Visually, Reason Textually: Vision-Language Synergy in ARC

Paper • 2511.15703 • Published Nov 19, 2025 • 9 •

upvoted a paper 8 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31, 2025 • 31

Beichen Zhang

AI & ML interests

Recent Activity

Organizations

BeichenZhang's activity