Reconstructing texture-less surfaces poses unique challenges in computer vision, primarily due to the lack of specialized datasets that cater to the nuanced needs of depth and normals estimation in the absence of textural information. We introduce "Shape2.5D," a novel, large-scale dataset designed to address this gap. Comprising 1.17 million frames spanning over 39,772 3D models and 48 unique objects, our dataset provides depth and surface normal maps for texture-less object reconstruction. The proposed dataset includes synthetic images rendered with 3D modeling software to simulate various lighting conditions and viewing angles. It also includes a real-world subset comprising 4,672 frames captured with a depth camera. Our comprehensive benchmarks demonstrate the dataset's ability to support the development of algorithms that robustly estimate depth and normals from RGB images and perform voxel reconstruction. Our open-source data generation pipeline allows the dataset to be extended and adapted for future research.
The trained models that were used to evaluate the dataset to report benchmarking results are available for
download here (480 MB). You can download this archive and use the
eval.sh
bash script in the source code repository to reproduce the results reported in the paper
yourself.
Alternatively, if you would like to train the networks from scratch, you can use the train.py
script in the same repository.
Category | Objects | Num. Objects | Download |
---|---|---|---|
animals |
asian_dargon ,
bunny , cats ,
dragon , duck , pig
|
6 | Link (27 GB) |
clothing |
cape ,
dress ,
hoodie ,
jacket ,
shirt ,
suit ,
tracksuit ,
tshirt
|
8 | Link (37 GB) |
furniture |
armchair ,
bed ,
chair ,
rocking_chair ,
sofa ,
table
|
6 | Link (33 GB) |
misc |
diego ,
kettle ,
plants ,
skeleton ,
teapot
|
5 | Link (29 GB) |
statues |
armadillo ,
buddha ,
lucy ,
roman ,
thai_statue
|
5 | Link (21 GB) |
vehicles |
bicycle ,
car ,
jeep ,
ship ,
spaceship
|
5 | Link (18 GB) |
shapenet |
plane ,
bench ,
cabinet ,
car ,
chair
display
lamp
speaker
rifle
sofa
table
phone
watercraft
|
13 | Link (18 GB) |
real |
chair ,
tshirt ,
hoody ,
lamp ,
|
6 | Link (5 GB) |
These models were obtained from several sources in the public domain.
Models obtained from this repository include 5 Stanford models and 2 XYZ RGB models. These include
bunny
, dragon
, buddha
, armadillo
, lucy
,
asian_dargon
, and
thai
.
This repository was published by Keenan Crane of Carnegie Mellon University under the CC0 1.0 Universal (CC0 1.0) Public Domain
License. duck
,
pig
, skeleton
and diego
were obtained from here.
The teapot
is Martin Newell’s Utah Teapot, and the remaining 24
models
were all obtained for free from CGTrader with a Royalty Free
License.
A
complete list of sources for each individual model can be found here.
@article{khan2024shape,
author={Khan, Muhammad Saif Ullah and Sinha, Sankalp and Stricker, Didier and Liwicki, Marcus and Afzal, Muhammad Zeshan},
journal={IEEE Access},
title={Shape2.5D: A Dataset of Texture-less Surfaces for Depth and Normals Estimation},
year={2024},
volume={},
number={},
pages={1-1},
doi={10.1109/ACCESS.2024.3492703}
}