“…The ME models have three issues: (1) the gating mechanism does not explicitly leverage the input-output dependencies of the data. Rather, it performs probabilistic inputspace partitioning, based on assumed data distributions such as the multinomial distribution (Jordan & Jacobs, 1994), Gaussian distribution (Yuan & Neubauer, 2009), Dirichlet process (Rasmussen & Ghahramani, 2002), Gaussian process (Tresp, 2001), etc; (2) in ME models strong experts are often needed to gain good performance (Yuksel et al, 2012); (3) the structure of the ME models, namely the tree depth and the number of experts, is often optimized through extra procedures, such as pruning (Waterhouse & Robinson, 1995) and Bayesian model selection (Bishop & Svenskn, 2002;Kanaujia & Metaxas, 2006). This increases the complexity of model learning.…”