Taro Asada scite author profile

Recently, several digital watermarking techniques have been proposed for hiding data in the frequency domain of audio files in order to protect their copyrights. In general, there is a tradeoff between the quality of watermarked audio and the tolerance of watermarks to signal processing methods, such as compression. In previous research, we simultaneously improved the performance of both by developing a multipurpose optimization problem for deciding the positions of watermarks in the frequency domain of audio data and obtaining a near-optimum solution to the problem. This solution was obtained using a wavelet transform and a genetic algorithm. However, obtaining the near-optimum solution was very time consuming. To overcome this issue essentially, we have developed an authentication method for digital audio using a discrete wavelet transform. In contrast to digital watermarking, no additional information is inserted into the original audio by the proposed method, and the audio is authenticated using features extracted by the wavelet transform and characteristic coding in the proposed method. Accordingly, one can always use copyright-protected original audio. The experimental results show that the method has high tolerance of authentication to all types of MP3, AAC, and WMA compression. In addition, the processing time of the method is acceptable for every-day use

show abstract

Facial expression recognition of a speaker using vowel judgment and thermal image processing

Yoshitomi

Asada

Shimada³

et al. 2011

Artif Life Robotics

View full text Add to dashboard Cite

Recognition of a Baby's Emotional Cry towards Robotics Baby Caregiver

Yamamoto

Yoshitomi

Tabuse

et al. 2013

International Journal of Advanced Robotic Systems

View full text Add to dashboard Cite

We developed a method for pattern recognition of baby's emotions (discomfortable, hungry, or sleepy) expressed in the baby's cries. A 32-dimensional fast Fourier transform is performed for sound form clips, detected by our reported method and used as training data. The power of the sound form judged as a silent region is subtracted from each power of the frequency element. The power of each frequency element after the subtraction is treated as one of the elements of the feature vector. We perform principal component analysis (PCA) for the feature vectors of the training data. The emotion of the baby is recognized by the nearest neighbor criterion applied to the feature vector obtained from the test data of sound form clips after projecting the feature vector on the PCA space from the training data. Then, the emotion with the highest frequency among the recognition results for a sound form clip is judged as the emotion expressed by the baby's cry. We successfully applied the proposed method to pattern recognition of baby's emotions. The present investigation concerns the first stage of the development of a robotics baby caregiver that has the ability to detect babyʹs emotions. In this first stage, we have developed a method for detecting babyʹs emotions. We expect that the proposed method could be used in robots that can help take care of babies.

show abstract

Music recommendation hybrid system for improving recognition ability using collaborative filtering and impression words

Yoshizaki¹,

Yoshitomi

Koro³

et al. 2013

Artif Life Robotics

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Taro Asada

Facial expression recognition of a speaker using front-view face judgment, vowel judgment, and thermal image processing

An Authentication Method for Digital Audio Using a Discrete Wavelet Transform

Facial expression recognition of a speaker using vowel judgment and thermal image processing

Recognition of a Baby's Emotional Cry towards Robotics Baby Caregiver

Music recommendation hybrid system for improving recognition ability using collaborative filtering and impression words

Contact Info

Product

Resources

About