2006
DOI: 10.1007/11677482_38
|View full text |Cite
|
Sign up to set email alerts
|

The 2005 AMI System for the Transcription of Speech in Meetings

Abstract: In this paper we describe the 2005 AMI system for the transcription of speech in meetings used for participation in the 2005 NIST RT evaluations. The system was designed for participation in the speech to text part of the evaluations, in particular for transcription of speech recorded with multiple distant microphones and independent headset microphones. System performance was tested on both conference room and lecture style meetings. Although input sources are processed using different front-ends, the recogni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
39
0

Year Published

2006
2006
2012
2012

Publication Types

Select...
5
2
1

Relationship

7
1

Authors

Journals

citations
Cited by 45 publications
(40 citation statements)
references
References 19 publications
1
39
0
Order By: Relevance
“…Table IV shows experimental results on the benefit of multiple microphones. Experiments were conducted using different acoustic data from the same meetings, including beamforming (as described in [5]) with varying numbers of microphones. All acoustic models were trained using standard maximum-likelihood estimation on the AMI corpus only and results were obtained on two different 6-hour test sets, one only with segments with overlapped speech and one without.…”
Section: A Ami Corpusmentioning
confidence: 99%
See 2 more Smart Citations
“…Table IV shows experimental results on the benefit of multiple microphones. Experiments were conducted using different acoustic data from the same meetings, including beamforming (as described in [5]) with varying numbers of microphones. All acoustic models were trained using standard maximum-likelihood estimation on the AMI corpus only and results were obtained on two different 6-hour test sets, one only with segments with overlapped speech and one without.…”
Section: A Ami Corpusmentioning
confidence: 99%
“…Hence, the approach used needs to allow for variation, even though the NIST paradigm allows systems to make explicit use of meeting room characteristics, the AMIDA systems have not done so. Instead the audio signal is enhanced by beam-forming based on time-delay-of-arrival (TDOA) that can be used with any number of microphones [5]. After noise-filtering of the audio channels the TDOA is estimated.…”
Section: B Far Fieldmentioning
confidence: 99%
See 1 more Smart Citation
“…The system is based on AMI-LVCSR system used in NIST RT'05 evaluation [13]. Here we only summarize the main features of the system: Data -The task is the recognition of meetings defined in NIST RT'05 evaluation.…”
Section: Methodsmentioning
confidence: 99%
“…Delays and scale factors are then used in the final stage implementing superdirective beam-forming. More details can be found in [21].…”
Section: Multiple Distant Microphonesmentioning
confidence: 99%