STREGA@VST: Structure and Evolution of the Galaxy

Abstract-A robust face detection technique along with mouth localization, processing every frame in real time (video rate), is presented. Moreover, it is exploited for motion analysis onsite to verify "liveness" as well as to achieve lip reading of digits. A methodological novelty is the suggested quantized angle features ("quangles") being designed for illumination invariance without the need for preprocessing (e.g., histogram equalization). This is achieved by using both the gradient direction and the double angle direction (the structure tensor angle), and by ignoring the magnitude of the gradient. Boosting techniques are applied in a quantized feature space. A major benefit is reduced processing time (i.e., that the training of effective cascaded classifiers is feasible in very short time, less than 1 h for data sets of order 10 4 ). Scale invariance is implemented through the use of an image scale pyramid. We propose "liveness" verification barriers as applications for which a significant amount of computation is avoided when estimating motion. Novel strategies to avert advanced spoofing attempts (e.g., replayed videos which include person utterances) are demonstrated. We present favorable results on face detection for the YALE face test set and competitive results for the CMU-MIT frontal face test set as well as on "liveness" verification barriers.

show abstract

Person Verification by Lip-Motion

Faraj

Bigün

View full text Add to dashboard Cite

show abstract

Audio–visual person authentication using lip-motion from orientation maps

Faraj

Bigün

2007

Pattern Recognition Letters

View full text Add to dashboard Cite

Motion Features from Lip Movement for Person Authentication

Faraj

Bigün

2006

View full text Add to dashboard Cite

Lip Biometrics for Digit Recognition

Faraj

Bigün

2007

View full text Add to dashboard Cite

Abstract. This paper presents a speaker-independent audio-visual digit recognition system that utilizes speech and visual lip signals. The extracted visual features are based on line-motion estimation obtained from video sequences with low resolution (128 ×128 pixels) to increase the robustness of audio recognition. The core experiments investigate lip motion biometrics as stand-alone as well as merged modality in speech recognition system. It uses Support Vector Machines, showing favourable experimental results with digit recognition featuring 83% to 100% on the XM2VTS database depending on the amount of available visual information.

show abstract

Speaker and Digit Recognition by Audio-Visual Lip Biometrics

Faraj

Bigün

2007

View full text Add to dashboard Cite

Abstract. This paper proposes a new robust bi-modal audio visual digit and speaker recognition system by lip-motion and speech biometrics. To increase the robustness of digit and speaker recognition, we have proposed a method using speaker lip motion information extracted from video sequences with low resolution (128 ×128 pixels). In this paper we investigate a biometric system for digit recognition and speaker identification based using line-motion estimation with speech information and Support Vector Machines. The acoustic and visual features are fused at the feature level showing favourable results with digit recognition being 83% to 100% and speaker recognition 100% on the XM2VTS database.

show abstract

Lip Motion Features for Biometric Person Recognition

Faraj

Bigün

2009

View full text Add to dashboard Cite

The present chapter reports on the use of lip motion as a stand alone biometric modality as well as a modality integrated with audio speech for identity recognition using digit recognition as a support. First, the auhtors estimate motion vectors from images of lip movements. The motion is modeled as the distribution of apparent line velocities in the movement of brightness patterns in an image. Then, they construct compact lip-motion features from the regional statistics of the local velocities. These can be used as alone or merged with audio features to recognize identity or the uttered digit. The author’s present person recognition results using the XM2VTS database representing the video and audio data of 295 people. Furthermore, we present results on digit recognition when it is used in a text prompted mode to verify the liveness of the user. Such user challenges have the intention to reduce replay attack risks of the audio system.

show abstract

Lip Motion Features for Biometric Person Recognition

Faraj¹,

Bigün²

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Maycel Isaac Faraj

Real-Time Face Detection and Motion Analysis With Application in “Liveness” Assessment

Person Verification by Lip-Motion

Audio–visual person authentication using lip-motion from orientation maps

Motion Features from Lip Movement for Person Authentication

Lip Biometrics for Digit Recognition

Speaker and Digit Recognition by Audio-Visual Lip Biometrics

Lip Motion Features for Biometric Person Recognition

Lip Motion Features for Biometric Person Recognition

Contact Info

Product

Resources

About