Pratyay Banerjee's picture

In a Training Loop 🔄

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

IR, NLP, Pattern Recognition, xAI, Interpretability, Evals

Recent Activity

upvoted a paper about 7 hours ago

OpenSkill: Open-World Self-Evolution for LLM Agents

upvoted a paper about 7 hours ago

Rethinking the Divergence Regularization in LLM RL

upvoted a paper about 7 hours ago

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

View all activity

Organizations

upvoted 7 papers about 7 hours ago

OpenSkill: Open-World Self-Evolution for LLM Agents

Paper • 2606.06741 • Published 8 days ago • 26

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 4 days ago • 28

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 4 days ago • 49

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 7 days ago • 52

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

Paper • 2606.06087 • Published 8 days ago • 59

KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks

Paper • 2606.03458 • Published 10 days ago • 60

Agents' Last Exam

Paper • 2606.05405 • Published 9 days ago • 261

upvoted an article 1 day ago

Article

Introducing North Mini Code: Cohere’s First Model For Developers

CohereLabs

•

2 days ago

• 54

upvoted a collection 3 days ago

Deepseek Papers

Deepseek papers collection • 31 items • Updated 4 days ago • 350

upvoted 2 collections 5 days ago

Laguna XS.2

Designed for agentic coding and long-horizon work on a local machine. Apache 2.0. • 5 items • Updated May 7 • 25

Gemma 4 QAT

Gemma 4 QAT (Quantization-Aware Training) for 3x less memory use and near original accuracy. • 16 items • Updated 5 days ago • 81

upvoted 3 collections 6 days ago

Gemma 4 QAT Q4_0

19 items • Updated 6 days ago • 111

Gemma 4 QAT Mobile

4 items • Updated 6 days ago • 31

Bonsai Image

6 items • Updated 7 days ago • 85

upvoted 6 papers 7 days ago

MemTrain: Self-Supervised Context Memory Training

Paper • 2606.03197 • Published 10 days ago • 17

Joint Agent Memory and Exploration Learning via Novelty Signals

Paper • 2606.01528 • Published 11 days ago • 15

Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents

Paper • 2605.30723 • Published 14 days ago • 16

Self-Distilled Policy Gradient

Paper • 2606.04036 • Published 10 days ago • 24

When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs

Paper • 2605.24202 • Published 21 days ago • 17

Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

Paper • 2605.30621 • Published 15 days ago • 22