This article argues that museum visiting and the act of 'spectatorship', both of which are often assumed to be ocularcentric, are multimodal events. Anchored in Goffman's dramaturgy and frame analysis theory, as well as Kress's multimodal and social semiotic theory of representation and communication, this article presents an apposite interpretative and methodological framework to account for what has not been widely addressed by museum studies; that is, the multimodality of the museum experience. By drawing upon audio-visual excerpts of museum encounters, this analysis brings to the fore the embodied visiting and viewing practices of visitors in museum galleries. Specifically, this article highlights the range of modes of communication and representation, beyond gazing and looking, which are employed, negotiated and regulated within the social context of the visit. The article suggests that visitors' experiences are embodied and performative interactions with the exhibits and other visitors.The analysis draws upon methodological developments within sociology and in particular Erving Goffman's dramaturgy and frame analysis theory (1963; 1971), as well as Gunther Kress's (2010) multimodal and social semiotic theory of representation and communication. By drawing upon multimodality, we show how talk, gesture, gaze and elements of the material context blend together and contribute to the production of meaning. We expand on existing sociocultural research into museums, stressing that museum encounters are embodied and multimodal events, during which physical movement, gesture, and gaze may reveal aspects of meaning making not apparent through analysis of the verbal mode alone. By treating nonverbal modes of expression as resources of meaning-making activity, we suggest that action, experience and communication may be brought into fruitful dialogue, if not integrated, to foreground the multiplicity of ways through which people communicate (Bezemer and Mavers 2011).It is through bringing together these two theories, ethnomethodology and multimodal social semiotics that we engage in a much needed interdisciplinary conversation (Dicks 2014). The virtue of these perspectives is that, by contrast to structural and deterministic sociological approaches, they permit us to theorize the agency of visitors as co-producers of meaning. The article proposes an appropriate interpretative and methodological framework which illuminates the social worlds of museums. Both the theoretical framework and the methodological tools employed allow the traditional mind-body dualism to be overcome in order to explore the modes and performances of visitors' encounters, as they arise in and through interaction with people and exhibits. The approach adopted in this article allows us to better understand this mediation through the exhibits, as well as through other fellow and co-present visitors. By raising social interaction to prominence, this article (i) foregrounds the social worlds of museums; and (ii) challenges notions of the 'static' visito...