Dynamic magnetic resonance imaging (MRI) is an emerging technique for studying speech production. In general, vocal tract image sequences are acquired during the speech of a word or phoneme. Sequences allow the identification of shapes taken by the vocal tract during speech production. However, there is no prior knowledge about the spatial and temporal resolution requirements, which are expected to vary for different speech tasks. Available approaches try to enhance the resolution of the images by empowering the acquisition devices, which can be very expensive. In this paper, we propose an alternative approach to enhance temporal resolution based solely on the observed image sequences. We use a previous non-rigid image registration method, which provides an intuitive background for temporal resolution enhancement. Based on a motion compensated interpolation (MCI) approach, intermediate images are coherent with the movement present in the whole sequence. Results indicate the effectiveness of our approach.