“…From a statistical point of view, systematic deviations of the, possibly unknown, real distributions of the variables represented in the data can lead to inaccurate estimations and constitute a statistical bias. For example, representation disparities in the data of the users [29], items [2,20,65], or their recorded interactions [85] can compromise the Categories of bias depending on the location in the AI workflow where bias originates [64] Bias location Due to Reference Bias at source External bias [5,6,18,61,67] Functional bias [2,27,29,54,65,74,85] Bias at collection Sampling [34,43,47] Querying [19,76] Data pre-processing Annotation [3,14,17,20,21,24,31,38,39,41,46,51,57] Aggregation [88] Data analysis Inference and prediction [22,53,80] quality and fairness of RS. Searching for information based only on the distributions of a specific dataset can lead to irrelevant results or results biased to other meanings of the words used in the query [19,76].…”