“…In addition, Π needs to satisfy a product bin condition, i.e., ∀z n 1 = (z 1 , .., z n ) ∈ R d·n every event A ∈ π n (z n 1 ) is expressed by [7], A = A 1 × A 2 , where A 1 ∈ B(R p ) and A 2 ∈ B(R q ). With this, the learning-estimation process involves three phases: first, to use the empirical data to partition R d by π n (Z n 1 ), second, to use again the data to estimate P X,Y and P X × P Y restricted to the sigma field σ(π n (Z n 1 )) 1 , and finally, to consider the plug-in technique to get an empirical MI estimate on (R d , σ(π n (Z n 1 ))) [12]. Concerning the phase 2, the product bin condition is needed to estimate P X,Y as well as the reference measure P X × P Y only based on the iid realizations of the joint distribution P X,Y [7], [12].…”