2D-3D-Semantic Data for Indoor Scene Understanding

Alexander Sax*, Iro Armeni*, Amir Zamir, Jitendra Malik, Leonidas Guibas, Silvio Savarese

June 2017

PDF Code Project Project Site

Abstract

We present a dataset of large-scale indoor spaces that provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotations. The dataset covers over 6,000 m2 and contains over 70,000 RGB images, along with the corresponding depths, surface normals, semantic annotations, global XYZ images (all in forms of both regular and 360◦ equirectangular images) as well as camera information. It also includes registered raw and semantically annotated 3D meshes and point clouds. The dataset enables development of joint and cross-modal learning models and potentially unsupervised approaches utilizing the regularities present in large-scale indoor spaces.

Type

Report

The dataset is available using the links above. The code link above provides some utilities for interacting with the dataset in both Python and C++.

Robustness

2D-3D-Semantic Data for Indoor Scene Understanding

Abstract

Alexander Sax*

Related