Mishig Davaadorj's picture

Mishig Davaadorj

mishig

huggingface

·

AI & ML interests

NP-completeness, grammars, universality

Recent Activity

updated a dataset about 14 hours ago

hf-doc-build/doc-builder-embeddings-tracker

updated a bucket about 23 hours ago

huggingchat/papers-content

updated a Space 3 days ago

mishig/jepawiki

View all activity

Organizations

commented a paper 5 days ago

There Will Be a Scientific Theory of Deep Learning

Paper • 2604.21691 • Published 13 days ago • 1 •

New activity in deepseek-ai/DeepSeek-V4-Pro 12 days ago

Technical Report Summary

#129 opened 12 days ago by

New activity in mishig/traces-replay 15 days ago

share?

#1 opened 15 days ago by

commented 12 papers 27 days ago

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published 29 days ago • 119 •

Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning

Paper • 2604.05404 • Published 29 days ago • 42 •

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 30 days ago • 235 •

Learning to Retrieve from Agent Trajectories

Paper • 2604.04949 • Published Mar 30 • 70 •

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

Paper • 2604.04934 • Published 30 days ago • 46 •

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 30 days ago • 203 •

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Paper • 2604.03922 • Published about 1 month ago • 53 •

Self-Execution Simulation Improves Coding Models

Paper • 2604.03253 • Published Mar 11 • 35 •

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Paper • 2604.02288 • Published Apr 2 • 32 •

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 501 •

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published Apr 2 • 42 •

LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models

Paper • 2603.28301 • Published Mar 30 • 82 •

commented 2 papers 28 days ago

Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Paper • 2603.06507 • Published Mar 6 • 1 •

Test-Time Scaling Makes Overtraining Compute-Optimal

Paper • 2604.01411 • Published Apr 1 • 28 •

commented 2 papers 29 days ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 30 days ago • 203 •

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 184 •

commented a paper about 1 month ago

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

Paper • 2505.22954 • Published May 29, 2025 • 15 •