1 109 49

MihailSlutsky

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

upvoted a paper 2 days ago

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

upvoted a paper 4 days ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

View all activity

Organizations

None yet

upvoted 2 papers 2 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published 5 days ago • 77

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

Paper • 2604.24026 • Published 9 days ago • 16

upvoted a paper 4 days ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published 7 days ago • 97

upvoted 2 papers 7 days ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 9 days ago • 116

From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Paper • 2604.22446 • Published 12 days ago • 118

liked a model 7 days ago

google/siglip-so400m-patch14-384

Zero-Shot Image Classification • 0.9B • Updated Sep 26, 2024 • 2.15M • 673

liked 2 datasets 9 days ago

pickapic-anonymous/pickapic_v1

Viewer • Updated May 4, 2023 • 616k • 3.14k • 12

MizzenAI/HPDv3

Viewer • Updated Aug 26, 2025 • 1.15M • 990 • 25

liked a model 15 days ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 739k • • 12.8k

liked a dataset 16 days ago

ymhao/HPDv2

Updated Feb 21, 2024 • 928 • 27

liked 2 datasets about 1 month ago

pcuenq/coco-2017-mirror

Preview • Updated Jul 4, 2025 • 395 • 2

phiyodr/coco2017

Viewer • Updated Mar 21, 2024 • 123k • 2.21k • 25

liked a Space about 2 months ago

VBench Leaderboard

📊

351

Submit video model evaluation results to a public benchmark

liked a dataset 3 months ago

longvideobench/LongVideoBench

Viewer • Updated Oct 14, 2024 • 6.68k • 29.6k • 40

upvoted a paper 3 months ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

Paper • 2601.21937 • Published Jan 29 • 19

liked 2 datasets 3 months ago

SpatialVID/SpatialVID

Viewer • Updated Mar 24 • 2.72M • 12.4k • 42

FelixYuan/SpatialVID-HQ

Viewer • Updated Mar 24 • 365k • 3.86k • 29

liked a model 3 months ago

robbyant/lingbot-world-base-cam

Image-to-Video • Updated Feb 2 • 331

liked a dataset 3 months ago

theairlabcmu/TartanGround

Updated Oct 14, 2025 • 17.8k • 3

upvoted a paper 4 months ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

Paper • 2601.08955 • Published Jan 13 • 13

MihailSlutsky

AI & ML interests

Recent Activity

Organizations

MihailSlutsky's activity

VBench Leaderboard