Artem

kabachuha

kabachuha

AI & ML interests

deep learning, natural language processing, text2image, text2video, computer vision, fusion power

Recent Activity

new activity about 15 hours ago

Qwen/Qwen3.6-27B:Wan when?

new activity 2 days ago

jukofyork/creative-writing-control-vectors-v3.0:“The doom lies in yourself, not in your name.”

liked a model 3 days ago

google/gemma-4-26B-A4B-it

View all activity

Organizations

upvoted a collection 7 days ago

KVAE 2.0

Collection

KVAE 2.0 is a family of video tokenizers with a time compression ratio of 4 and spacial compression ratio of 8 and 16 • 2 items • Updated 7 days ago • 2

upvoted a paper 21 days ago

Interpreting CLIP with Hierarchical Sparse Autoencoders

Paper • 2502.20578 • Published Feb 27, 2025 • 1

upvoted a collection 21 days ago

My LTX-2 Loras

Collection

9 items • Updated 19 days ago • 6

upvoted a paper about 2 months ago

SOM Directions are Better than One: Multi-Directional Refusal Suppression in Language Models

Paper • 2511.08379 • Published Nov 11, 2025 • 4

upvoted 3 papers 2 months ago

Effective Reasoning Chains Reduce Intrinsic Dimensionality

Paper • 2602.09276 • Published Feb 9 • 11

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published Feb 9 • 71

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Paper • 2602.05027 • Published Feb 4 • 63

upvoted 3 papers 3 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 230

Cross-Frame Representation Alignment for Fine-Tuning Video Diffusion Models

Paper • 2506.09229 • Published Jun 10, 2025 • 7

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 177

upvoted 2 papers 4 months ago

Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM

Paper • 2512.21580 • Published Dec 25, 2025 • 8

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

Paper • 2303.10845 • Published Mar 20, 2023 • 3

upvoted 2 papers 5 months ago

HunyuanVideo 1.5 Technical Report

Paper • 2511.18870 • Published Nov 24, 2025 • 29

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published Nov 19, 2025 • 91

upvoted 3 papers about 1 year ago

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17, 2025 • 95

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 233

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20, 2025 • 175

upvoted 3 papers almost 2 years ago

Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

Paper • 2406.17660 • Published Jun 25, 2024 • 5

An Image is Worth 32 Tokens for Reconstruction and Generation

Paper • 2406.07550 • Published Jun 11, 2024 • 60

Make It Count: Text-to-Image Generation with an Accurate Number of Objects

Paper • 2406.10210 • Published Jun 14, 2024 • 78

Artem

AI & ML interests

Recent Activity

Organizations

kabachuha's activity