The simulation of geophysical mass flows, including debris flows, rock and snow avalanches, has become an important tool in engineering hazard assessment. Especially the runout and deposition behaviour of observed and expected mass flows are of interest. When being confronted with the evaluation of model performance and sensitivity, there are no standard, objective approaches. In this contribution, we review methods that have been used in literature and outline a new approach to quantitatively compare 2D simulations of observed and simulated deposition pattern. Our proposed method is based on the comparison of normalized partial areas which can be plotted in a ternary diagram to visualize the degree of over-and under-estimation. Results can be summed up by a single metric between -1 (no fit) and 1 (perfect fit). This study shall help developers and end-users of simulation models to better understand model behaviour and provides a possibility for comparison of model results, independent of simulation platform and type of mass flow.