Fast and Regularized Reconstruction of Building Façades From Street-View Images Using Binary Integer Programming

Han, Hu; Wang, L.; Zhang, M.; Ding, Yulin; Zhu, Qing

doi:10.5194/isprs-annals-v-2-2020-365-2020

Cited by 12 publications

(8 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition, a set of suitable window bounding boxes {B i } is marked as the same cluster, and a single component instance C k is created for a window cluster. Automatic clustering is possible, e.g., according to the geometry information Hu et al (2020); however, we find the accuracy of automatic clustering is not good enough, and interactive processing for this step is also quite efficient.…”

Section: Interactive Modeling System With Window Instancesmentioning

confidence: 91%

“…Therefore, we assume the details of windows are visible and apparent in the texture image. The bounding boxes of windows can be detected using a standard two-stage object detection method (Ren et al, 2015) at very high confidence (Figure 2c), as in our previous work (Hu et al, 2020); more sophisticated approaches for façade parsing are also possible (Liu et al, 2020;Ma et al, 2020). In addition, we also allow interactive sketching of the window regions in cases of occlusion.…”

Section: Overview Of the Approachmentioning

confidence: 94%

“…Two typical strategies in image processing are widely adopted, e.g., object detection (Ren et al, 2015) and semantic segmentation (Long et al, 2015). For the former, the rectangular regions of the windows could be detected using a typical detector (Ren et al, 2015) and used for façade reconstruction (Hu et al, 2020;Hensel et al, 2019;Kong and Fan, 2020); however, there is still ample space to improve the localization accuracy, e.g., the intersection of union (IoU). For the latter, pixel-wise segmentation results can be learned end-to-end (Mathias et al, 2016;Gadde et al, 2018); fusing with point clouds (Gadde et al, 2018) and multi-view voting (Ma et al, 2020) can also be adopted to improve the segmentation results.…”

Section: Related Workmentioning

confidence: 99%

“…A straightforward approach for regularization explicitly offsets adjacent points or boundaries in a least-squares optimization (Arikan et al, 2013;Zhu et al, 2020b;Xie et al, 2018). Because least-squares optimization cannot model the logic operators, such as sameColumn and sameWidth (Dehbi et al, 2017), clustering (Fan et al, 2021), learned Markov Logic Networks (Dehbi et al, 2017), or binary integer programming (Hu et al, 2020;Hensel et al, 2019;Monszpart et al, 2015) are exploited to snap the feature points on the façade.…”

Section: Related Workmentioning

confidence: 99%

“…With the advent of aerial oblique images and ground mobile mapping systems, city-scale photo-realistic 3D models with façade visibility are readily available (Verdie et al, 2015;Han et al, 2021). Detailed analysis and parsing of façade windows (Fan et al, 2021;Hu et al, 2020), which enriches the 3D models with semantic information conformal to the LOD-3 (Level-of-Details) protocol in CityGML (Gröger and Plümer, 2012), have recently raised extensive attention in the community. Although the LOD-3 models already contain computational information for various applications, the taxonomy and refined structure of windows are often ignored (Zhu et al, 2020a).…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Semi-Supervised Adversarial Recognition of Refined Window Structures for Inverse Procedural Façade Modeling

Huang¹,

Liang²,

Ding³

et al. 2022

Preprint

View full text Add to dashboard Cite

Deep learning methods are notoriously data-hungry, which requires a large number of labeled samples. Unfortunately, the large amount of interactive sample labeling efforts has dramatically hindered the application of deep learning methods, especially for 3D modeling tasks, which require heterogeneous samples. To alleviate the work of data annotation for learned 3D modeling of façades, this paper proposed a semi-supervised adversarial recognition strategy embedded in inverse procedural modeling. Beginning with textured LOD-2 (Level-of-Details) models, we use the classical convolutional neural networks to recognize the types and estimate the parameters of windows from image patches. The window types and parameters are then assembled into procedural grammar. A simple procedural engine is built inside an existing 3D modeling software, producing fine-grained window geometries. To obtain a useful model from a few labeled samples, we leverage the generative adversarial network to train the feature extractor in a semi-supervised manner. The adversarial training strategy can also exploit unlabeled data to make the training phase more stable. Experiments using publicly available façade image datasets reveal that the proposed training strategy can obtain about 10% improvement in classification accuracy and 50% improvement in parameter estimation under the same network structure. In addition, performance gains are more pronounced when testing against unseen data featuring different façade styles.

show abstract

Section: Interactive Modeling System With Window Instancesmentioning

confidence: 91%