The 2005 AMI System for the Transcription of Speech in Meetings

Hain, Thomas; Burget, Lukáš; Dines, John; Garau, Giulia; Karafiát, Martin; Lincoln, Mike; McCowan, Iain; Moore, Darren; Wan, Vincent; Ordelman, Roeland; Renals, Steve

doi:10.1007/11677482_38

Cited by 45 publications

(40 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Table IV shows experimental results on the benefit of multiple microphones. Experiments were conducted using different acoustic data from the same meetings, including beamforming (as described in [5]) with varying numbers of microphones. All acoustic models were trained using standard maximum-likelihood estimation on the AMI corpus only and results were obtained on two different 6-hour test sets, one only with segments with overlapped speech and one without.…”

Section: A Ami Corpusmentioning

confidence: 99%

“…Hence, the approach used needs to allow for variation, even though the NIST paradigm allows systems to make explicit use of meeting room characteristics, the AMIDA systems have not done so. Instead the audio signal is enhanced by beam-forming based on time-delay-of-arrival (TDOA) that can be used with any number of microphones [5]. After noise-filtering of the audio channels the TDOA is estimated.…”

Section: B Far Fieldmentioning

confidence: 99%

“…The transcription systems presented here are the joint effort of the AMI/AMIDA group, in a close international collaboration that participated in the RT evaluations every year since 2005 [5] with very competitive results in 2007 and 2009, on far and near-field tasks [6]. The rest of the paper is organized as follows: We first discuss data resources available for meeting processing and their properties, with special focus on the AMI corpus.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Transcribing Meetings With the AMIDA Systems

Hain

Burget

Dines

et al. 2012

IEEE Trans. Audio Speech Lang. Process.

Self Cite

112

View full text Add to dashboard Cite

Section: A Ami Corpusmentioning

confidence: 99%

Section: B Far Fieldmentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Transcribing Meetings With the AMIDA Systems

Hain

Burget

Dines

et al. 2012

IEEE Trans. Audio Speech Lang. Process.

Self Cite

112

View full text Add to dashboard Cite

“…The system is based on AMI-LVCSR system used in NIST RT'05 evaluation [13]. Here we only summarize the main features of the system: Data -The task is the recognition of meetings defined in NIST RT'05 evaluation.…”

Section: Methodsmentioning

confidence: 99%

Probabilistic and Bottle-Neck Features for LVCSR of Meetings

Grézl

Karafiát

Kontar

et al. 2007

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07

Self Cite

282

167

View full text Add to dashboard Cite

In recent years, probabilistic features became an integral part of state-of-the-are LVCSR systems. In this work, we are exploring the possibility of obtaining the features directly from neural net without the necessity of converting output probabilities to features suitable for subsequent GMM-HMM system. We experimented with 5-layer MLP with bottle-neck in the middle layer. After training such a neural net, we used outputs of the bottle-neck as features for GMM-HMM recognition system. The benefits are twofold: first, improvement was gained when these features are used instead of the probabilistic features, second, the size of the system was reduced, as only part of the neural net is used. The experiments were performed on meetings recognition task defined in NIST RT'05 evaluation.

show abstract

“…Delays and scale factors are then used in the final stage implementing superdirective beam-forming. More details can be found in [21].…”

Section: Multiple Distant Microphonesmentioning

confidence: 99%

Recognition and understanding of meetings the AMI and AMIDA projects

Renals

Hain

Bourlard

2007

2007 IEEE Workshop on Automatic Speech Recognition &Amp; Understanding (ASRU)

Self Cite

111

View full text Add to dashboard Cite

The AMI and AMIDA projects are concerned with the recognition and interpretation of multiparty meetings. Within these projects we have: developed an infrastructure for recording meetings using multiple microphones and cameras; released a 100 hour annotated corpus of meetings; developed techniques for the recognition and interpretation of meetings based primarily on speech recognition and computer vision; and developed an evaluation framework at both component and system levels. In this paper we present an overview of these projects, with an emphasis on speech recognition and content extraction.

show abstract

The 2005 AMI System for the Transcription of Speech in Meetings

Cited by 45 publications

References 19 publications

Transcribing Meetings With the AMIDA Systems

Transcribing Meetings With the AMIDA Systems

Probabilistic and Bottle-Neck Features for LVCSR of Meetings

Recognition and understanding of meetings the AMI and AMIDA projects

Contact Info

Product

Resources

About