The 2007 AMI(DA) System for Meeting Transcription

Hain, Thomas; Burget, Lukáš; Dines, John; Garau, Giulia; Karafiát, Martin; Leeuwen, David A. van; Lincoln, Mike; Wan, Vincent

doi:10.1007/978-3-540-68585-2_39

Cited by 31 publications

(31 citation statements)

References 15 publications

(19 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Equally test sets for seminar style meetings from the CHIL corpora have been provided. In the past we have shown that the performance of the AMIDA systems on seminar data is strongly correlated to conference room meeting transcription [6], [12]. Hence, we have excluded such data from this paper.…”

Section: B Test Sets and Test Conditionsmentioning

confidence: 99%

“…The second task is segmentation and speaker clustering. In work on the RT'07 system we found a considerable mismatch between requirements for diarization and ASR [6]. ASR systems require clusters of reasonable size and can cope with segments that contain significant amounts of silence.…”

Section: B Far Fieldmentioning

confidence: 99%

“…The transcription systems presented here are the joint effort of the AMI/AMIDA group, in a close international collaboration that participated in the RT evaluations every year since 2005 [5] with very competitive results in 2007 and 2009, on far and near-field tasks [6]. The rest of the paper is organized as follows: We first discuss data resources available for meeting processing and their properties, with special focus on the AMI corpus.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Transcribing Meetings With the AMIDA Systems

Hain

Burget

Dines

et al. 2012

IEEE Trans. Audio Speech Lang. Process.

Self Cite

112

View full text Add to dashboard Cite

Section: B Test Sets and Test Conditionsmentioning

confidence: 99%

Section: B Far Fieldmentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Transcribing Meetings With the AMIDA Systems

Hain

Burget

Dines

et al. 2012

IEEE Trans. Audio Speech Lang. Process.

Self Cite

112

View full text Add to dashboard Cite

“…of the system would go beyond of the scope of this paper, the interested reader is referred to [28]. Table 7 shows details for various stages in the system, from the initial decoding with unadapted models to the output of the best branch in the system.…”

Section: Performancementioning

confidence: 99%

Recognition and understanding of meetings the AMI and AMIDA projects

Renals

Hain

Bourlard

2007

2007 IEEE Workshop on Automatic Speech Recognition &Amp; Understanding (ASRU)

Self Cite

111

View full text Add to dashboard Cite

The AMI and AMIDA projects are concerned with the recognition and interpretation of multiparty meetings. Within these projects we have: developed an infrastructure for recording meetings using multiple microphones and cameras; released a 100 hour annotated corpus of meetings; developed techniques for the recognition and interpretation of meetings based primarily on speech recognition and computer vision; and developed an evaluation framework at both component and system levels. In this paper we present an overview of these projects, with an emphasis on speech recognition and content extraction.

show abstract

“…Several scientific projects [18], [7] and challenges [8], [10] have been launched during the last decade targeting intelligent interfaces for indoors smart environments. Distant speech recognition (DSR) via distributed microphones is examined in most of them.…”

Section: Introductionmentioning

confidence: 99%

On the improvement of modulation features using multi-microphone energy tracking for robust distant speech recognition

Rodomagoulakis

Maragos

2017

2017 25th European Signal Processing Conference (EUSIPCO)

View full text Add to dashboard Cite

Abstract-In this work, we investigate robust speech energy estimation and tracking schemes aiming at improved energybased multiband speech demodulation and feature extraction for multi-microphone distant speech recognition. Based on the spatial diversity of the speech and noise recordings of a multimicrophone setup, the proposed Multichannel, Multiband Demodulation (MMD) scheme includes: 1) energy selection across the microphones that are less affected by noise and 2) cross-signal energy estimation based on the cross-Teager energy operator. Instantaneous modulations of speech resonances are estimated on the denoised energies. Second-order frequency modulation features are measured and combined with MFCCs achieving improved distant speech recognition on simulated and real data recorded in noisy and reverberant domestic environments.

show abstract

The 2007 AMI(DA) System for Meeting Transcription

Cited by 31 publications

References 15 publications

Transcribing Meetings With the AMIDA Systems

Transcribing Meetings With the AMIDA Systems

Recognition and understanding of meetings the AMI and AMIDA projects

On the improvement of modulation features using multi-microphone energy tracking for robust distant speech recognition

Contact Info

Product

Resources

About