Mental Image Directed Semantic Theory (MIDST) has proposed an omnisensory mental image model and its description language Lmd-This language can provide multimedia expressions with intermediate semantic descriptions in predicate logic. This paper describes systematic and efficient computing guided by Lmd expression and 3D map data, here so called direct knowledge of space, in cross-media operation between linguistic and pictorial expressions as spatial language understanding.