Since the image based intelligent video monitoring system has limited viewing angle, the blind monitoring zone will easily appear when the target is not in the field range of the camera. In order to solve the above problem, an intelligent video monitoring system with auditory function is proposed in this article on the basis of the advantages of sound localization. Firstly, the linear microphone array is acquired and the time delay estimation technology is adopted for sound localization; secondly, the camera is driven to turn to the sound source position to acquire video information; finally, the image detection program is adopted to locate and track the target in a real-time manner, and meanwhile the system feasibility is verified through the simulation experiment. The result shows that the system has good localization and tracking accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.