“…The latter is of course preferable, as it avoids cost-intensive and error-prone manual work. Yet, it involves various challenging research topics, like automatic indexing [1,2,3], person identification [4,5,6], speech recognition [7], understanding [8], and summarisation [9]. In this work we address the first step towards the automatic analysis: finding shot and scene boundaries in videos.…”