2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698) 2003
DOI: 10.1109/icme.2003.1221328
|View full text |Cite
|
Sign up to set email alerts
|

Pitch and timbre manipulations using cortical representation of sound

Abstract: The sound received at the ears is processed by humans using signalprocessing that separates the signal along intensity, pitch and timbre dimensions. Conventional Fourier-based signal processing, while endowed with fast algorithms, is unable to easily represent signal along these attributes. In this paper we use a recently proposed cortical representation to represent and manipulate sound. We briefly overview algorithms for obtaining, manipulating and inverting cortical representation of a sound and describe al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2005
2005
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 8 publications
0
3
0
Order By: Relevance
“…The constant-Q factor characteristic of modulation filters also lead to higher resolution at lower modulation frequencies, hence, providing an arrangement that helps in identifying any deviation from general (or Neutral ) speech modulation rate (2-4 Hz) [51]. Our choice of constant-Q filters in both stages is also inspired from the study of early auditory and cortical stages of mammalian auditory cortex [31,52]. We term our proposed feature as constant-Q transform based modulation spectral feature (CQT-MSF).…”
Section: Motivation and Contributionsmentioning
confidence: 99%
See 1 more Smart Citation
“…The constant-Q factor characteristic of modulation filters also lead to higher resolution at lower modulation frequencies, hence, providing an arrangement that helps in identifying any deviation from general (or Neutral ) speech modulation rate (2-4 Hz) [51]. Our choice of constant-Q filters in both stages is also inspired from the study of early auditory and cortical stages of mammalian auditory cortex [31,52]. We term our proposed feature as constant-Q transform based modulation spectral feature (CQT-MSF).…”
Section: Motivation and Contributionsmentioning
confidence: 99%
“…This shows the importance of temporal modulation for pitch representation, and hence, SER. Temporal modulation extracted by MSF represent tempo [68], pitch, and timber [52], all of which are related to emotion information in speech. Fig.…”
Section: Constant-q Based Modulation Spectral Features (Cqt-msf)mentioning
confidence: 99%
“…If two natural sounds with different timbres are given, there may arise a situation where a sound that interpolates the timbre has to be synthesized [11]. The cortical model is a computational model to observe how the brain is able to obtain and integrate the multitude of cues like loudness, location, timbre, and pitch arriving at the ears [12]. Two transient sounds from the same type of acoustical interaction under different conditions can be used in the morphing operation to generate physically plausible intermediate sounds [13].…”
Section: Introductionmentioning
confidence: 99%