We formulate the estimation of dense depth maps from video sequences as a problem of intrinsic image estimation. Our approach synergistically integrates the estimation of multiple intrinsic images including depth, albedo, shading, optical flow, and surface contours. We build upon an example-based framework for depth estimation that uses label transfer from a database of RGB and depth pairs. We combine this with a method that extracts consistent albedo and shading from video. In contrast to raw RGB values, albedo and shading provide a richer, more physical, foundation for depth transfer. Additionally we train a new contour detector to predict surface boundaries from albedo, shading, and pixel values and use this to improve the estimation of depth boundaries. We also integrate sparse structure from motion with our method to improve the metric accuracy of the estimated depth maps. We evaluate our Intrinsic Depth method quantitatively by estimating depth from videos in the NYU RGB-D and SUN3D datasets. We find that combining the estimation of multiple intrinsic images improves depth estimation relative to the baseline method.
Here we publish our database and testing examples.
Please click the link text in order to download.
Data | Size | Descriptions |
Database | 328265M (MD5) | Our database plus an extra sequence. Note that we take a subset of the raw data. |
Non-rigid examples (full sequences) |
9195M (MD5) | Full sequences that contain non-rigid examples. These are a subset of our data above. |
Non-rigid examples (clips only) |
4076M (MD5) | Data for the non-rigid examples. See above for full sequences containing these clips. |
SUN3D examples | 11700M (MD5) | Data for the examples from the SUN3D dataset |
Training and testing lists | 23K (MD5) | Text files with names of the clips in the training and testing sets |
Outdoor examples | 2885M (MD5) | Data for the outdoor examples in the video |
Here we publish the results and evaluation code.
Please click the link text in order to download.
Data | Size | Descriptions |
Results | 56928M (MD5) | Results from Intrinsic Depth, Depth Transfer, and the fully-metric method for all examples |
Results for non-rigid examples |
8380M (MD5) | Results only for the non-rigid examples |
Evaluation code | 12K (MD5) | Evaluation code for the results above |
Coming soon (planned to come after a release of Intrinsic Video code)