“…User study [42,43,56,65,[69][70][71][72]80,83,84,87,101,103,116,120] Similarly, Figure 6a shows the number of papers included in this study that employ each of the evaluation approaches. Notably, a large majority of the papers employ offline evaluation, although we should mention that one of the quality criteria applied is the use of publicly available datasets, so we are aware that this work is biased towards the presence of such studies.…”