Pyramid Scene Parsing Network in 3D: improving semantic segmentation of point clouds with multi-scale contextual information

Fang, Hao; Lafarge, Florent

Published 2019 | Version v1

Journal article Metadata-only

Pyramid Scene Parsing Network in 3D: improving semantic segmentation of point clouds with multi-scale contextual information

Contributors

Others:

Geometric Modeling of 3D Environments (TITANE) ; Inria Sophia Antipolis - Méditerranée (CRISAM) ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
The authors thank CSTB for financial support and Sven Oesau for technical discussions.

Analyzing and extracting geometric features from 3D data is a fundamental step in 3D scene understanding. Recent works demonstrated that deep learning archi-tectures can operate directly on raw point clouds, i.e. without the use of intermediate grid-like structures. These architectures are however not designed to encode contextual information in-between objects efficiently. Inspired by a global feature aggregation algorithm designed for images, we propose a 3D pyramid module to enrich pointwise features with multi-scale contextual information. Our module can be easily coupled with 3D semantic segmantation methods operating on 3D point clouds. We evaluated our method on three large scale datasets with four baseline models. Experimental results show that the use of enriched features brings significant improvements to the semantic segmentation of indoor and outdoor scenes.

Abstract

International audience

Additional details

URL: https://hal.inria.fr/hal-02159279
URN: urn:oai:HAL:hal-02159279v1

Origin repository: UNICA

	All versions	This version
Views	6	6
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Pyramid Scene Parsing Network in 3D: improving semantic segmentation of point clouds with multi-scale contextual information

Creators

Contributors

Others:

Description

Abstract

Additional details

Identifiers

Origin repository