Abstract. This paper contributes to the modeling of audiovisual information with a particular focus on the description needs for the composition of video elements (character, shot, scene, etc.) with other media information (text, sound, image, etc) inside multimedia document. This model has been experimented through an authoring and presentation tool called VideoMadeus. The resultants are illustrated with several examples of document where spatio-temporal synchronization of video is required.