CV Depth Anything with Any Prior Paper ⢠2505.10565 ⢠Published May 15, 2025 ⢠13 ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking Paper ⢠2505.08581 ⢠Published May 13, 2025 ⢠9
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking Paper ⢠2505.08581 ⢠Published May 13, 2025 ⢠9
VLMs PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper ⢠2505.09990 ⢠Published May 15, 2025 ⢠12 Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper ⢠2505.10558 ⢠Published May 15, 2025 ⢠16 Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis Paper ⢠2505.10046 ⢠Published May 15, 2025 ⢠9 X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real Paper ⢠2505.07096 ⢠Published May 11, 2025 ⢠5
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper ⢠2505.09990 ⢠Published May 15, 2025 ⢠12
Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper ⢠2505.10558 ⢠Published May 15, 2025 ⢠16
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis Paper ⢠2505.10046 ⢠Published May 15, 2025 ⢠9
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real Paper ⢠2505.07096 ⢠Published May 11, 2025 ⢠5
CV Depth Anything with Any Prior Paper ⢠2505.10565 ⢠Published May 15, 2025 ⢠13 ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking Paper ⢠2505.08581 ⢠Published May 13, 2025 ⢠9
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking Paper ⢠2505.08581 ⢠Published May 13, 2025 ⢠9
VLMs PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper ⢠2505.09990 ⢠Published May 15, 2025 ⢠12 Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper ⢠2505.10558 ⢠Published May 15, 2025 ⢠16 Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis Paper ⢠2505.10046 ⢠Published May 15, 2025 ⢠9 X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real Paper ⢠2505.07096 ⢠Published May 11, 2025 ⢠5
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper ⢠2505.09990 ⢠Published May 15, 2025 ⢠12
Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper ⢠2505.10558 ⢠Published May 15, 2025 ⢠16
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis Paper ⢠2505.10046 ⢠Published May 15, 2025 ⢠9
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real Paper ⢠2505.07096 ⢠Published May 11, 2025 ⢠5