The McGurk effect is a well known perceptual phenomenon in which listeners perceive [da], when presented with visual instances of [ga] syllables combined with the audio of [ba] syllables. However, the underlying cognitive mechanisms are not yet fully understood. In this study, we investigated the McGurk effect from the perspective of two learning theories – the Exemplar theory and Discriminative Learning. We hypothesized that the McGurk effect arises from distributional differences of the [ba, da, ga] syllables in a lexicon of a given language. We tested this hypothesis using computational implementations of these theories, simulating learning on the basis of lexica in which we varied the distributions of these syllables systematically. These simulations support our hypothesis. For both learning theories, we found that the probability of observing the McGurk effect in our simulations was greater, when lexica contained a larger percentage of [da] and [ba] instances. Crucially, the probability of the McGurk effect was inversely proportional to the percentage of [ga], what ever the percentage of [ba] and [da]. To validate our results, we inspected the distributional properties of [ba, da, ga] in different languages for which the McGurk effect was or was not previously attested. Our results mirror the findings for these languages. Our study shows that the McGurk effect – an instance of multi-modal perceptual integration – arises from distributional properties in different languages and thus depends on language learning.