Fig. 1. Comparison of the ability of a simulated environment and a real dozer to grade an area studded with piles. Top row: RGB images of our experimental setup (see Section III-C) showing the scaled dozer facing the sand piles. Middle row: Representative height-maps of states in the real environment. Depth observations were projected onto world coordinates using orthographic projections [4]. Bottom row: Similar states observed in the simulated environment. In the middle and bottom rows, the right column is the initial state space, and all the others, states after actions were taken in the simulated and real environments. This figure clearly depicts the resemblance between the simulated and real height-maps in the grading task.