“…Since fuzzy rule-based systems simulate human reasoning, it is natural to use them in scene description. From the spatial relationship neural networks, five spatial relation membership values (LEFT, ABOVE, RIGHT, BELOW, and SURROUND) between two individual objects were obtained by the method described in [16,17]. However, when a human being describes a scene, he/she will describe it using natural language instead of providing strict numeric values.…”