Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

VISIONx @ NYU

university
https://www.sainingxie.com/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

georgysavva  updated a collection about 9 hours ago
Solaris-Data
georgysavva  updated a collection about 9 hours ago
Solaris-Data
bytetriper  new activity about 11 hours ago
nyu-visionx/RAE-dinov2-wReg-base-ViTXL-n08:Update config for diffusers AutoencoderRAE refactor
View all activity

Papers

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding

View all Papers

Ellis Brown's profile picture Peter Tong's profile picture Manoj Middepogu's profile picture Sai Charitha Akula's profile picture Penghao Wu's profile picture Jihan Yang's profile picture Saining Xie's profile picture Bingda Tang's profile picture BoYang Zheng's profile picture Sayak Paul's profile picture Shusheng Yang's profile picture Chenyu, Li's profile picture Anjali W Gupta's profile picture Xichen Pan's profile picture Pinzhi Huang's profile picture Nanye Ma's profile picture Jaskirat Singh's profile picture Ziteng Wang's profile picture Junwan Kim's profile picture Georgy Savva's profile picture
nyu-visionx 's Papers 4
Submitted by
BoYang Zheng
52

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

nyu-visionx VISIONx @ NYU
208 2
Submitted by
Ellis Brown
5

SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding

nyu-visionx VISIONx @ NYU
9 2
Submitted by
Jihan Yang
8

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

nyu-visionx VISIONx @ NYU
2
Submitted by
Peter Tong
166

Diffusion Transformers with Representation Autoencoders

nyu-visionx VISIONx @ NYU
1.77k 6
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs