arxiv:2602.21461
Weiming Ren
wren93
AI & ML interests
Multimodal Understanding, Generative Modelling
Recent Activity
upvoted a paper 12 minutes ago
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation authored a paper 19 days ago
VecGlypher: Unified Vector Glyph Generation with Language Models upvoted a paper 19 days ago
Watch Before You Answer: Learning from Visually Grounded Post-Training