1 22

Wenjun Wang

juezhi

wwjzhy

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

submitted a paper 2 days ago

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

authored a paper 2 days ago

Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs

View all activity

Organizations

upvoted a paper 2 days ago

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

Paper • 2605.16882 • Published 6 days ago • 1

submitted a paper to Daily Papers 2 days ago

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

Paper • 2605.16882 • Published 6 days ago • 1

authored 7 papers 2 days ago

Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs

Paper • 2409.10994 • Published Sep 17, 2024 • 1

Unconstrained Model Merging for Enhanced LLM Reasoning

Paper • 2410.13699 • Published Oct 17, 2024 • 1

InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

Paper • 2502.11573 • Published Feb 17, 2025 • 9

InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models

Paper • 2509.22536 • Published Sep 26, 2025 • 2

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

Paper • 2605.09608 • Published 12 days ago • 51

FeatCal: Feature Calibration for Post-Merging Models

Paper • 2605.13030 • Published 9 days ago • 7

E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring

Paper • 2605.16882 • Published 6 days ago • 1

upvoted a paper 5 days ago

FeatCal: Feature Calibration for Post-Merging Models

Paper • 2605.13030 • Published 9 days ago • 7

upvoted 2 papers 9 days ago

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

Paper • 2605.09608 • Published 12 days ago • 51

Model Merging Scaling Laws in Large Language Models

Paper • 2509.24244 • Published 11 days ago • 44

upvoted a collection 28 days ago

DeepSeek-V4

Collection

4 items • Updated 28 days ago • 651

upvoted 2 papers 4 months ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 222

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 328

upvoted a paper 5 months ago

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27

published a dataset 5 months ago

juezhi/limo_multi_solution

Preview • Updated Dec 9, 2025 • 10

updated a dataset 5 months ago

juezhi/limo_multi_solution

Preview • Updated Dec 9, 2025 • 10

upvoted a paper 6 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 215

upvoted a paper 7 months ago

InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models

Paper • 2509.22536 • Published Sep 26, 2025 • 2

Wenjun Wang

AI & ML interests

Recent Activity

Organizations

juezhi's activity