Shinji Sako scite author profile

Shinji Sako

5Publications

34Citation Statements Received

8Citation Statements Given

How they've been cited

How they cite others

Affiliations

Nagoya Institute of Technology, The University of Tokyo

Publications

Order By: Most citations

Online Handwritten Kanji Recognition Based on Inter-stroke Grammar

Ota

Yamamoto

Sako

et al. 2007

View full text Add to dashboard Cite

This paper presents a new approach to online recognition of handwritten Kanji characters focusing on their hierarchical structure. Stochastic context-free grammar (SCFG) is introduced to represent the Kanji character generating process in combination with Hidden Markov Models (HMM) representing Kanji substrokes and to improve the recognition accuracy of important and frequently used Kanji characters in which inter-stroke relative positions play important roles. Combining the stroke likelihood and the relative-position likelihood between character-parts in the parsing process is expected to compensate their ambiguities. By modeling relative positions and share the models across distinct Kanji categories, a small training data can yield effective results and enables us to recognize Kanji simply by defining the SCFG rules to represent their structures without training data. Experimental results on an online handwritten Kanji database from JAIST (Japan Advanced Institute of Science and Technology) showed significant improvements in the recognition rates of some important Kanji with relatively fewer strokes and also showed little difference between the trained-and the non-trained Kanji in recognition rates.

show abstract

Recognition of JSL finger spelling using convolutional neural networks

Hosoe

Sako

Kwolek

2017

View full text Add to dashboard Cite

Subunit Modeling for Japanese Sign Language Recognition Based on Phonetically Depend Multi-stream Hidden Markov Models

Sako

Kitamura

2013

View full text Add to dashboard Cite

We work on automatic Japanese sign Language (JSL) recognition using Hidden Markov Model (HMM). An important issue for modeling sign is that how to determine the constituent element of sign (i.e., subunit) like "phoneme" in spoken language. We focused on special feature of sign language that JSL is composed of three types of phonological elements which is hand local information, position, and movement. In this paper, we propose an efficiently method of generating subunit using multi-stream HMM which is correspond to phonological elements. An isolated word recognition experiment has confirmed the effectiveness of our proposed method.

show abstract

Orpheus: Automatic Composition System Considering Prosody of Japanese Lyrics

Fukayama

Nakatsuma

Sako

et al. 2009

View full text Add to dashboard Cite

We present an algorithm for song composition using prosody of Japanese lyrics. Since Japanese is a "pitch accent" language, listener's apprehension is strongly affected by the pitch motions of the speaker. For example, the meaning of Japanese word "ha-shi" changes with the pitch. It means "bridge" with an upward pitch motion, and "chopsticks" with the motion inversed. A melody attached to the lyrics cause an effect similar to the pitch accent. Therefore we can assume that pitches of Japanese lyrics give constraints on pitch motions of the melody. Furthermore, chord progression, rhythm and accompaniment give constraints on the transitions and occurrences of the melody notes. If a certain melody for the lyrics were obtained, the melody would satisfy these constraints. Conversely, we can compose a song by finding the melody which optimally meets the condition. Implementation and Experimental ResultsOrpheus is an automatic composition system that we implemented using melody composition algorithm based on prosody. This system computes melody from the lyrics input with choices of chord progressions, rhythm patterns, and accompaniment instruments. We used Galatea-Talk[4] text-to-speech engine to analyze the prosody of Japanese lyrics, and HMM singing voice synthesizer[5] to generate the vocal part. We also implemented the system as a web-based application 1 . We did two experiments to evaluate the system. Firstly, we asked a classical music composer to evaluate 59 generated songs in five-grade evaluation. Secondly, we uploaded our system to get comments from a large number of users on the internet. During a year of operation, about 56,000 songs were generated by the users and 1378 people answered the questions about Orpheus and the generated songs. The results are shown in Fig. 1 and Fig. 2. Judging from the results, about 70.8% commented that the generated songs are attractive, and 84.9% of the users had fun trying this system.

show abstract

HMM-based text-to-audio-visual speech synthesis

Sako¹,

Tokuda²,

Masuko³

et al. 2000

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shinji Sako

Online Handwritten Kanji Recognition Based on Inter-stroke Grammar

Recognition of JSL finger spelling using convolutional neural networks

Subunit Modeling for Japanese Sign Language Recognition Based on Phonetically Depend Multi-stream Hidden Markov Models

Orpheus: Automatic Composition System Considering Prosody of Japanese Lyrics

HMM-based text-to-audio-visual speech synthesis

Contact Info

Product

Resources

About