Real-time joint semantic segmentation and depth estimation using asymmetric annotations

Nekrasov, V.; Dharmasiri, T.; Spek, A.; Drummond, T.; Shen, C.; Reid, I.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/122927

Scopus	Web of Science®	Altmetric
Citations
?	?

Full metadata record

DC Field	Value	Language
dc.contributor.author	Nekrasov, V.	-
dc.contributor.author	Dharmasiri, T.	-
dc.contributor.author	Spek, A.	-
dc.contributor.author	Drummond, T.	-
dc.contributor.author	Shen, C.	-
dc.contributor.author	Reid, I.	-
dc.contributor.editor	Howard, A.	-
dc.contributor.editor	Althoefer, K.	-
dc.contributor.editor	Arai, F.	-
dc.contributor.editor	Arrichiello, F.	-
dc.contributor.editor	Caputo, B.	-
dc.contributor.editor	Castellanos, J.	-
dc.contributor.editor	Hauser, K.	-
dc.contributor.editor	Isler, V.	-
dc.contributor.editor	Kim, J.	-
dc.contributor.editor	Liu, H.	-
dc.contributor.editor	Oh, P.	-
dc.contributor.editor	Santos, V.	-
dc.contributor.editor	Scaramuzza, D.	-
dc.contributor.editor	Ude, A.	-
dc.contributor.editor	Voyles, R.	-
dc.contributor.editor	Yamane, K.	-
dc.contributor.editor	Okamura, A.	-
dc.date.issued	2019	-
dc.identifier.citation	IEEE International Conference on Robotics and Automation, 2019 / Howard, A., Althoefer, K., Arai, F., Arrichiello, F., Caputo, B., Castellanos, J., Hauser, K., Isler, V., Kim, J., Liu, H., Oh, P., Santos, V., Scaramuzza, D., Ude, A., Voyles, R., Yamane, K., Okamura, A. (ed./s), vol.2019-May, pp.7101-7107	-
dc.identifier.isbn	153866027X	-
dc.identifier.isbn	9781538660270	-
dc.identifier.issn	1050-4729	-
dc.identifier.issn	2577-087X	-
dc.identifier.uri	http://hdl.handle.net/2440/122927	-
dc.description.abstract	Deployment of deep learning models in robotics as sensory information extractors can be a daunting task to handle, even using generic GPU cards. Here, we address three of its most prominent hurdles, namely, i) the adaptation of a single model to perform multiple tasks at once (in this work, we consider depth estimation and semantic segmentation crucial for acquiring geometric and semantic understanding of the scene), while ii) doing it in real-time, and iii) using asymmetric datasets with uneven numbers of annotations per each modality. To overcome the first two issues, we adapt a recently proposed real-time semantic segmentation network, making changes to further reduce the number of floating point operations. To approach the third issue, we embrace a simple solution based on hard knowledge distillation under the assumption of having access to a powerful `teacher' network. We showcase how our system can be easily extended to handle more tasks, and more datasets, all at once, performing depth estimation and segmentation both indoors and outdoors with a single model. Quantitatively, we achieve results equivalent to (or better than) current state-of-the-art approaches with one forward pass costing just 13ms and 6.5 GFLOPs on 640×480 inputs. This efficiency allows us to directly incorporate the raw predictions of our network into the SemanticFusion framework [1] for dense 3D semantic reconstruction of the scene.	-
dc.description.statementofresponsibility	Vladimir Nekrasov, Thanuja Dharmasiri, Andrew Spek, Tom Drummond, Chunhua Shen and Ian Reid	-
dc.language.iso	en	-
dc.publisher	IEEE	-
dc.relation.ispartofseries	IEEE International Conference on Robotics and Automation ICRA	-
dc.rights	©2019 IEEE	-
dc.source.uri	https://ieeexplore.ieee.org/xpl/conhome/8780387/proceeding	-
dc.title	Real-time joint semantic segmentation and depth estimation using asymmetric annotations	-
dc.type	Conference paper	-
dc.contributor.conference	IEEE International Conference on Robotics and Automation (ICRA) (20 May 2019 - 24 May 2019 : Montreal, Canada)	-
dc.identifier.doi	10.1109/ICRA.2019.8794220	-
dc.relation.grant	http://purl.org/au-research/grants/arc/CE140100016	-
dc.relation.grant	http://purl.org/au-research/grants/arc/FL130100102	-
pubs.publication-status	Published	-
dc.identifier.orcid	Nekrasov, V. [0000-0001-9653-7539]	-
dc.identifier.orcid	Reid, I. [0000-0001-7790-6423]	-
Appears in Collections:	Aurora harvest 8 Computer Science publications

Files in This Item:

There are no files associated with this item.

Show simple item record

Adelaide Research & Scholarship