1 38 10

Yan Varakin

ZDPLI

https://www.researchgate.net/profile/Yan-Varakin

ZDPLI

AI & ML interests

All areas of NLP, computational mathematics, reinforcement learning, robotics.

Recent Activity

upvoted an article 18 days ago

Mimicking Consciousness in LLMs: Ascending the Dimensions of Thought with Recurrent Processing

upvoted a paper about 1 month ago

3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model

upvoted a paper about 1 month ago

Demystifing Video Reasoning

View all activity

Organizations

upvoted an article 18 days ago

Article

Mimicking Consciousness in LLMs: Ascending the Dimensions of Thought with Recurrent Processing

Feb 20, 2025

•

upvoted 3 papers about 1 month ago

upvoted a paper 3 months ago

Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization

Paper • 2601.21358 • Published Jan 29 • 7

upvoted an article 9 months ago

Article

Activation Steering: A New Frontier in AI Control—But Does It Scale?

Feb 2, 2025

•

upvoted an article 10 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26, 2025

•

121

liked a Space 10 months ago

Lingshu 7B

🩻

Chat with Lingshu 7B, a multimodal medical model

updated a Space 11 months ago

SkinLesionClassifierHAM10K

📈

Diagnose skin conditions from images

upvoted 2 articles 12 months ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

•

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

•

upvoted a paper 12 months ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30, 2025 • 55

upvoted an article 12 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

291

upvoted 4 papers 12 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1, 2025 • 36

LLMs for Engineering: Teaching Models to Design High Powered Rockets

Paper • 2504.19394 • Published Apr 27, 2025 • 13

AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

Paper • 2504.21659 • Published Apr 30, 2025 • 14

Llama-Nemotron: Efficient Reasoning Models

Paper • 2505.00949 • Published May 2, 2025 • 43

published a Space 12 months ago

SkinLesionClassifierHAM10K

📈

Diagnose skin conditions from images

upvoted a paper 12 months ago

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29, 2025 • 95

updated a Space 12 months ago

DermaScanBeta

🌍

Updated version of DermaScan system

Yan Varakin

AI & ML interests

Recent Activity

Organizations

ZDPLI's activity

Mimicking Consciousness in LLMs: Ascending the Dimensions of Thought with Recurrent Processing

Activation Steering: A New Frontier in AI Control—But Does It Scale?

Gemma 3n fully available in the open-source ecosystem!

Lingshu 7B

SkinLesionClassifierHAM10K

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tune Llama 2 with DPO

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

SkinLesionClassifierHAM10K

DermaScanBeta