-
CyberSecEvalTest
📈71Evaluate LLMs' cybersecurity risks and capabilities
-
meta-llama/Llama-Guard-3-8B
Text Generation • 8B • Updated • 181k • • 277 -
meta-llama/Prompt-Guard-86M
Text Classification • Updated • 29.8k • 318 -
protectai/deberta-v3-base-prompt-injection-v2
Text Classification • 0.2B • Updated • 330k • • 99
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
upvoted a paper about 9 hours ago
Fish Audio S2 Technical Report liked
a dataset about 9 hours ago
TuringEnterprises/Open-RL liked
a dataset about 9 hours ago
HuggingFaceFW/finephrase Organizations
Agents
-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 95 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 109 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26
Large Language Models Utils
Utils useful for LLM
- Running106
Predict Memory
🧮106Calculate and visualize model memory usage from config
- Running on CPU UpgradeFeatured1.01k
Model Memory Utility
🚀1.01kCalculate VRAM needed to train and run Hugging Face models
- Running78
Transformers Timeline
🤗78Interactive timeline to explore the 🤗Transformers models
- Running on CPU UpgradeFeatured3.04k
The Smol Training Playbook
📚3.04kThe secrets to building world-class LLMs
Reasoning
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 94 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 126
Safety & Security
- Running71
CyberSecEvalTest
📈71Evaluate LLMs' cybersecurity risks and capabilities
-
meta-llama/Llama-Guard-3-8B
Text Generation • 8B • Updated • 181k • • 277 -
meta-llama/Prompt-Guard-86M
Text Classification • Updated • 29.8k • 318 -
protectai/deberta-v3-base-prompt-injection-v2
Text Classification • 0.2B • Updated • 330k • • 99
Large Language Models Utils
Utils useful for LLM
- Running106
Predict Memory
🧮106Calculate and visualize model memory usage from config
- Running on CPU UpgradeFeatured1.01k
Model Memory Utility
🚀1.01kCalculate VRAM needed to train and run Hugging Face models
- Running78
Transformers Timeline
🤗78Interactive timeline to explore the 🤗Transformers models
- Running on CPU UpgradeFeatured3.04k
The Smol Training Playbook
📚3.04kThe secrets to building world-class LLMs
Agents
-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 95 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 109 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 26
Reasoning
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 94 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 115 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 126