IXCLab@Shanghai AI Lab

community

https://github.com/OpenIXCLab

OpenIXCLab

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

yuhangzang authored a paper 5 days ago

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning

yuhangzang authored a paper 5 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

myownskyW7 authored a paper 9 days ago

Visual-ERM: Reward Modeling for Visual Equivalence

View all activity

Papers

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

View all Papers

yuhangzang

authored 2 papers 5 days ago

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning

Paper • 2606.09393 • Published 17 days ago

Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

Paper • 2606.19338 • Published 8 days ago • 46

myownskyW7

authored 14 papers 9 days ago

SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction

Paper • 2605.20110 • Published May 19 • 4

DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders

Paper • 2605.22777 • Published May 21 • 5

Channel-wise Vector Quantization

Paper • 2605.26089 • Published May 25 • 15

LoMo: Local Modality Substitution for Deeper Vision-Language Fusion

Paper • 2605.30265 • Published 28 days ago • 23

Not only where, But when: Temporal Scheduling for RLVR

Paper • 2605.25381 • Published May 25 • 6

AdaCodec: A Predictive Visual Code for Video MLLMs

Paper • 2606.02569 • Published 24 days ago • 5

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning

Paper • 2606.09393 • Published 17 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 15 days ago • 200

myownskyW7

submitted a paper to Daily Papers 20 days ago

AdaCodec: A Predictive Visual Code for Video MLLMs

Paper • 2606.02569 • Published 24 days ago • 5

yuhangzang

authored a paper 20 days ago

OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

Paper • 2606.03890 • Published 23 days ago • 31

rookiexiong

authored a paper 27 days ago

LoMo: Local Modality Substitution for Deeper Vision-Language Fusion

Paper • 2605.30265 • Published 28 days ago • 23

rookiexiong

submitted a paper to Daily Papers 27 days ago

LoMo: Local Modality Substitution for Deeper Vision-Language Fusion

Paper • 2605.30265 • Published 28 days ago • 23

AI & ML interests

Recent Activity

Papers

Team members 5

OpenIXCLab's activity