2 99 30

Xing Yun

xing0047

xing0047

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper 6 days ago

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

upvoted a paper 19 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

upvoted a paper about 2 months ago

ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models

View all activity

Organizations

upvoted a paper 6 days ago

Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs

Paper • 2605.00814 • Published 11 days ago • 21

upvoted a paper 19 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 20 days ago • 239

upvoted a paper about 2 months ago

ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models

Paper • 2603.19466 • Published Mar 19 • 41

published a model about 2 months ago

xing0047/etchat_dev

5B • Updated Apr 18, 2025 • 9

updated a model 3 months ago

xing0047/REF_20260223_201222

Updated Feb 25

published a model 3 months ago

xing0047/REF_20260223_201222

Updated Feb 25

upvoted a paper 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523

upvoted 3 papers 4 months ago

SAMTok: Representing Any Mask with Two Words

Paper • 2601.16093 • Published Jan 22 • 43

HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding

Paper • 2601.14724 • Published Jan 21 • 75

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204

updated a model 4 months ago

xing0047/transformers

Updated Jan 15

published a model 4 months ago

xing0047/transformers

Updated Jan 15

liked a model 4 months ago

xing0047/cca-llava-1.5-7b

Image-Text-to-Text • Updated Oct 28, 2024 • 37 • 4

liked a dataset 4 months ago

MLL-Lab/MindCube

Viewer • Updated Nov 20, 2025 • 4.28k • 471 • 8

upvoted 6 papers 4 months ago

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Paper • 2601.02427 • Published Jan 4 • 46

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published Oct 14, 2025 • 149

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

Paper • 2510.05034 • Published Oct 6, 2025 • 51

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 132

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 135

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 162

Xing Yun

AI & ML interests

Recent Activity

Organizations

xing0047's activity