Leveraging Geometry for Shape Estimation from a Single RGB Image
View / Open Files
Authors
Langer, Florian
Budvytis, Ignas
Cipolla, Roberto
Publication Date
2021-11-10Conference Name
32nd British Machine Vision Conference
Type
Conference Object
This Version
AM
Metadata
Show full item recordCitation
Langer, F., Budvytis, I., & Cipolla, R. (2021). Leveraging Geometry for Shape Estimation from a Single RGB Image. 32nd British Machine Vision Conference. https://doi.org/10.17863/CAM.80293
Abstract
Predicting 3D shapes and poses of static objects from a single RGB image is
an important research area in modern computer vision. Its applications range
from augmented reality to robotics and digital content creation. Typically this
task is performed through direct object shape and pose predictions which is
inaccurate. A promising research direction ensures meaningful shape predictions
by retrieving CAD models from large scale databases and aligning them to the
objects observed in the image. However, existing work does not take the object
geometry into account, leading to inaccurate object pose predictions,
especially for unseen objects. In this work we demonstrate how cross-domain
keypoint matches from an RGB image to a rendered CAD model allow for more
precise object pose predictions compared to ones obtained through direct
predictions. We further show that keypoint matches can not only be used to
estimate the pose of an object, but also to modify the shape of the object
itself. This is important as the accuracy that can be achieved with object
retrieval alone is inherently limited to the available CAD models. Allowing
shape adaptation bridges the gap between the retrieved CAD model and the
observed shape. We demonstrate our approach on the challenging Pix3D dataset.
The proposed geometric shape prediction improves the AP mesh over the
state-of-the-art from 33.2 to 37.8 on seen objects and from 8.2 to 17.1 on
unseen objects. Furthermore, we demonstrate more accurate shape predictions
without closely matching CAD models when following the proposed shape
adaptation. Code is publicly available at
https://github.com/florianlanger/leveraging_geometry_for_shape_estimation .
Keywords
cs.CV, cs.CV
Identifiers
External DOI: https://doi.org/10.17863/CAM.80293
This record's URL: https://www.repository.cam.ac.uk/handle/1810/332862
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk