Wujian Peng

wjpoom

2 42 22

https://wjpoom.github.io/

wjpoom

AI & ML interests

None yet

Recent Activity

updated a Space 21 days ago

ShareLab-SII/uniar

liked a Space 27 days ago

ShareLab-SII/uniar

updated a model about 1 month ago

ShareLab-SII/UniAR-SFT

View all activity

Organizations

updated a Space 21 days ago

UniAR

🎨

Unified AR model for image understanding & generation

liked a Space 27 days ago

UniAR

🎨

Unified AR model for image understanding & generation

updated 2 models about 1 month ago

ShareLab-SII/UniAR-SFT

Image-to-Text • 10B • Updated Jun 23 • 272 • 1

ShareLab-SII/UniAR-RL

Image-to-Text • 10B • Updated Jun 23 • 229 • 1

authored a paper about 1 month ago

Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification

Paper • 2606.18249 • Published Jun 16 • 14

upvoted a paper about 2 months ago

Unified Multimodal Autoregressive Modeling with Shared Context-Visual Tokenizer is Key to Unification

Paper • 2606.18249 • Published Jun 16 • 14

published 2 models about 2 months ago

ShareLab-SII/UniAR-RL

Image-to-Text • 10B • Updated Jun 23 • 229 • 1

ShareLab-SII/UniAR-SFT

Image-to-Text • 10B • Updated Jun 23 • 272 • 1

updated a collection about 2 months ago

UniAR

Collection

Model checkpoints for UniAR: Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key to Unification. • 2 items • Updated Jun 16

upvoted a paper about 2 months ago

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

Paper • 2606.11188 • Published Jun 9 • 27

upvoted a paper 2 months ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published May 28 • 146

upvoted 2 papers 3 months ago

World Action Models: The Next Frontier in Embodied AI

Paper • 2605.12090 • Published May 12 • 71

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published May 12 • 195

upvoted a paper 4 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 150

upvoted a paper 5 months ago

CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization

Paper • 2603.06449 • Published Mar 6 • 6

upvoted a paper 9 months ago

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 62

upvoted a paper 10 months ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15, 2025 • 48

upvoted a paper 11 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28, 2025 • 91

updated a model about 1 year ago

wjpoom/SPEC-CLIP-ViT-B-32

Updated Jun 16, 2025 • 1

Wujian Peng

AI & ML interests

Recent Activity

Organizations

wjpoom's activity

UniAR

UniAR