Abstract
Hyperbolic Scene Graph (HSG) improves scene graph modeling by learning embeddings in hyperbolic space, enhancing hierarchical structure quality and retrieval performance through natural encoding of hierarchical relationships.
Scene graph representations enable structured visual understanding by modeling objects and their relationships, and have been widely used for multiview and 3D scene reasoning. Existing methods such as MSG learn scene graph embeddings in Euclidean space using contrastive learning and attention based association. However, Euclidean geometry does not explicitly capture hierarchical entailment relationships between places and objects, limiting the structural consistency of learned representations. To address this, we propose Hyperbolic Scene Graph (HSG), which learns scene graph embeddings in hyperbolic space where hierarchical relationships are naturally encoded through geometric distance. Our results show that HSG improves hierarchical structure quality while maintaining strong retrieval performance. The largest gains are observed in graph level metrics: HSG achieves a PP IoU of 33.17 and the highest Graph IoU of 33.51, outperforming the best AoMSG variant (25.37) by 8.14, highlighting the effectiveness of hyperbolic representation learning for scene graph modeling. Code: https://github.com/AIGeeksGroup/HSG.
Community
Although this is a relatively foundational work, we do open source it: https://github.com/AIGeeksGroup/HSG. IMO, scene graphs are a very important domain for spatial and embodied intelligence and should not be ignored.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models (2026)
- A Hyperbolic Perspective on Hierarchical Structure in Object-Centric Scene Representations (2026)
- ARGENT: Adaptive Hierarchical Image-Text Representations (2026)
- ReLaGS: Relational Language Gaussian Splatting (2026)
- Contrastive Language-Colored Pointmap Pretraining for Unified 3D Scene Understanding (2026)
- Riemannian and Symplectic Geometry for Hierarchical Text-Driven Place Recognition (2026)
- SGR3 Model: Scene Graph Retrieval-Reasoning Model in 3D (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2604.17454 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper