16 22

Mayurkumar Surani

Mayurkumar

AI & ML interests

Machine Learning , AI, NLP, Computer Vision, Reinforcement Learning

Recent Activity

liked a model 14 days ago

MiniMaxAI/MiniMax-M2.7

liked a model 3 months ago

zai-org/GLM-OCR

liked a model 3 months ago

unsloth/Kimi-K2.5-GGUF

View all activity

Organizations

None yet

liked a model 14 days ago

MiniMaxAI/MiniMax-M2.7

Text Generation • 229B • Updated 9 days ago • 496k • • 1.08k

liked 2 models 3 months ago

zai-org/GLM-OCR

Image-to-Text • Updated 14 days ago • 8.04M • • 1.67k

unsloth/Kimi-K2.5-GGUF

1T • Updated Jan 28 • 120k • 257

liked a Space 3 months ago

The Smol Training Playbook

📚

3.13k

The secrets to building world-class LLMs

liked a model 5 months ago

tencent/HunyuanOCR

Image-Text-to-Text • 1.0B • Updated Jan 13 • 184k • 747

liked a model 6 months ago

inclusionAI/Ling-1T

Text Generation • 1000B • Updated 16 days ago • 919 • • 539

liked 3 models 8 months ago

google/embeddinggemma-300m

TheBloke/Open_Gpt4_8x7B_v0.2-GGUF

47B • Updated Jan 12, 2024 • 1.76k • 22

tencent/HunyuanVideo-Foley

Text-to-Audio • Updated Sep 29, 2025 • 333 • 163

liked a model 9 months ago

moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated 6 days ago • 409k • • 2.36k

upvoted an article 11 months ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

May 23, 2025

•

172

liked a model 12 months ago

ostris/Flex.2-preview

Text-to-Image • Updated Apr 25, 2025 • 518 • 390

liked a model about 1 year ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1, 2025 • 69.7k • 3.59k

upvoted a collection about 1 year ago

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Dec 31, 2025 • 127

liked a Space about 1 year ago

Chat With Janus-Pro-7B

🌍

2.02k

A unified multimodal understanding and generation model.

upvoted a paper over 1 year ago

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5, 2024 • 92

upvoted a paper almost 2 years ago

Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Paper • 2407.12327 • Published Jul 17, 2024 • 79

liked a model almost 2 years ago

Qwen/Qwen2-72B-Instruct

Text Generation • 73B • Updated Oct 8, 2024 • 71.8k • • 718

updated a collection almost 2 years ago

LLM

Collection

3 items • Updated May 26, 2024