Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published 25 days ago • 21
Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published 25 days ago • 21
Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published 25 days ago • 21
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published Mar 11 • 44
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107
FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents Paper • 2510.03204 • Published Oct 3, 2025 • 7
FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents Paper • 2510.03204 • Published Oct 3, 2025 • 7
LineRetriever: Planning-Aware Observation Reduction for Web Agents Paper • 2507.00210 • Published Jun 30, 2025 • 6
LineRetriever: Planning-Aware Observation Reduction for Web Agents Paper • 2507.00210 • Published Jun 30, 2025 • 6
Maintaining MTEB: Towards Long Term Usability and Reproducibility of Embedding Benchmarks Paper • 2506.21182 • Published Jun 26, 2025 • 2
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper • 2504.08942 • Published Apr 11, 2025 • 28
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning Paper • 2504.07128 • Published Apr 2, 2025 • 87
SafeArena: Evaluating the Safety of Autonomous Web Agents Paper • 2503.04957 • Published Mar 6, 2025 • 21
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models Paper • 2502.14802 • Published Feb 20, 2025 • 13
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 48
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 48
BM25S: Orders of magnitude faster lexical search via eager sparse scoring Paper • 2407.03618 • Published Jul 4, 2024 • 14