Zili Wang

MarkWang

5 11 4

MarkXCloud

AI & ML interests

Multi-modality learning and inference acceleration

Recent Activity

upvoted a paper about 2 months ago

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

authored a paper 2 months ago

Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration

upvoted a paper 2 months ago

Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration

View all activity

Organizations

upvoted a paper about 2 months ago

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

Paper • 2606.12191 • Published Jun 10 • 70

authored a paper 2 months ago

Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration

Paper • 2605.28184 • Published May 27 • 6

upvoted a paper 2 months ago

Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration

Paper • 2605.28184 • Published May 27 • 6

submitted a paper to Daily Papers 2 months ago

Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration

Paper • 2605.28184 • Published May 27 • 6

upvoted 2 papers 9 months ago

Taming Modality Entanglement in Continual Audio-Visual Segmentation

Paper • 2510.17234 • Published Oct 20, 2025 • 5

Knowledge-based Visual Question Answer with Multimodal Processing, Retrieval and Filtering

Paper • 2510.14605 • Published Oct 16, 2025 • 5

upvoted a paper 11 months ago

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28, 2025 • 111

upvoted a paper 12 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 146

liked a model 12 months ago

YannQi/R-4B

Image-Text-to-Text • 5B • Updated Sep 4, 2025 • 227k • 183

upvoted a paper about 1 year ago

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28, 2025 • 47

authored a paper about 1 year ago

Faster and Better LLMs via Latency-Aware Test-Time Scaling

Paper • 2505.19634 • Published May 26, 2025

upvoted an article about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

authored a paper over 1 year ago

Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published Nov 18, 2024 • 16

upvoted a paper over 1 year ago

Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published Nov 18, 2024 • 16

commented a paper over 1 year ago

Continuous Speculative Decoding for Autoregressive Image Generation

Paper • 2411.11925 • Published Nov 18, 2024 • 16 •

authored 3 papers almost 2 years ago

Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis

Paper • 2409.06135 • Published Sep 10, 2024 • 16

Layerwise Recurrent Router for Mixture-of-Experts

Paper • 2408.06793 • Published Aug 13, 2024 • 32

AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation

Paper • 2408.01708 • Published Aug 3, 2024 • 4

commented a paper almost 2 years ago

AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation

Paper • 2408.01708 • Published Aug 3, 2024 • 4 •

authored a paper about 2 years ago

A Closer Look into Mixture-of-Experts in Large Language Models

Paper • 2406.18219 • Published Jun 26, 2024 • 17

Zili Wang

AI & ML interests

Recent Activity

Organizations

MarkWang's activity

Vision Language Models (Better, faster, stronger)