(1) Background: The smart city management system, with GIS technology as its core, is based on realistic visualization of multiple types of 3D model data syntheses. However, the efficiency barriers to achieving smooth and continuous visualization from outdoor scenes to small indoor scenes remain a challenge. (2) Methods: This paper uses the visibility prediction method to obtain potential visual sets at three levels—outdoor, indoor and outdoor connection, and indoor—and constructs an R-tree spatial index structure for organizing potential visual sets. By integrating these potential visible sets with spatial indexes, scene visualization can be carried out effectively. (3) Results: A near-reality indoor and outdoor scene was used for experimentation, resulting in stable 10% fluctuation visual frame rates around 90 FPS. (4) Conclusions: Spatial indexing methods that combine potential visible sets can effectively solve the continuity and stability problem of indoor and outdoor scene visualization in smart city management systems.