Project-O1

community

AI & ML interests

None defined yet.

Recent Activity

Snyhlxde authored a paper 14 days ago

AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications

zsqzz authored a paper 19 days ago

Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages

zsqzz authored a paper 19 days ago

The Many Faces of On-Policy Distillation: Pitfalls, Mechanisms, and Fixes

View all activity

authored a paper 14 days ago

AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications

Paper • 2602.22769 • Published Feb 26 • 10

authored 2 papers 19 days ago

Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages

Paper • 2605.05558 • Published 25 days ago • 3

The Many Faces of On-Policy Distillation: Pitfalls, Mechanisms, and Fixes

Paper • 2605.11182 • Published 22 days ago • 5

submitted a paper to Daily Papers 19 days ago

The Many Faces of On-Policy Distillation: Pitfalls, Mechanisms, and Fixes

Paper • 2605.11182 • Published 22 days ago • 5

submitted a paper to Daily Papers 21 days ago

Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages

Paper • 2605.05558 • Published 25 days ago • 3

authored 3 papers 22 days ago

Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey

Paper • 2602.06052 • Published Jan 14 • 6

Probing the Knowledge Boundary: An Interactive Agentic Framework for Deep Knowledge Extraction

Paper • 2602.00959 • Published Feb 1

Agentic AI Systems Should Be Designed as Marginal Token Allocators

Paper • 2605.01214 • Published about 1 month ago • 4

submitted a paper to Daily Papers 27 days ago

Agentic AI Systems Should Be Designed as Marginal Token Allocators

Paper • 2605.01214 • Published about 1 month ago • 4

authored a paper 5 months ago

OpenTinker: Separating Concerns in Agentic Reinforcement Learning

Paper • 2601.07376 • Published Jan 12 • 7

submitted a paper to Daily Papers 5 months ago

OpenTinker: Separating Concerns in Agentic Reinforcement Learning

Paper • 2601.07376 • Published Jan 12 • 7

authored 2 papers 5 months ago

Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-Bench

Paper • 2512.02942 • Published Dec 2, 2025 • 5

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Paper • 2512.14681 • Published Dec 16, 2025 • 43

submitted a paper to Daily Papers 6 months ago

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Paper • 2512.14681 • Published Dec 16, 2025 • 43

authored a paper 7 months ago

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Paper • 2510.23595 • Published Oct 27, 2025 • 13

authored 5 papers 7 months ago

Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs

Paper • 2310.16355 • Published Oct 25, 2023

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 43

Toward Inference-optimal Mixture-of-Expert Large Language Models

Paper • 2404.02852 • Published Apr 3, 2024

LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch

Paper • 2501.07124 • Published Jan 13, 2025

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17, 2025 • 50