“…For instance, one could use cross-lingual Latent Semantic Indexing (Dumais et al, 1996), probabilistic Principal Component Analysis (Tipping and Bishop, 1999), or a probabilistic interpretation of non-negative matrix factorization (Lee and Seung, 1999;Gaussier and Goutte, 2005;Ding et al, 2008) on concatenated documents in aligned document pairs. Other more recent models include matching canonical correlation analysis (Haghighi et al, 2008;Daumé III and Jagarlamudi, 2011) and multilingual probabilistic topic models (Ni et al, 2009;De Smet and Moens, 2009;Mimno et al, 2009;Boyd-Graber and Blei, 2009;Zhang et al, 2010;Fukumasu et al, 2012).…”