A General Framework for Visualization of Sound Collections in Musical Interfaces

Roma, Gerard; Xambó, Anna; Green, Owen; Tremblay, Pierre Alexandre

doi:10.3390/app112411926

Cited by 4 publications

(5 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The original waveforms are firstly analyzed by the method of Mel-frequency cepstral coefficients (MFCCs), [38,39] and then processed by the feature extraction method to reduce the dimension of each sample while keeping most of the information. [40,41] Details of the pre-processing are shown in Supporting Information. [34] After pre-processing, each audio segment is represented by a two-dimension feature vector and is ready to be analyzed by quantum processor.…”

Section: Resultsmentioning

confidence: 99%

Quantum Anomaly Detection with a Spin Processor in Diamond

Chai,

Liu,

Wang

et al. 2024

Adv Quantum Tech

View full text Add to dashboard Cite

In the processing of quantum computation, analyzing and learning the pattern of the quantum data are essential for many tasks. Quantum machine learning algorithms cannot only deal with the quantum states generated in the preceding quantum procedures, but also the quantum registers encoding classical problems. In this work, the anomaly detection of quantum states encoding audio samples with a three‐qubit quantum processor consisting of solid‐state spins in diamond is experimentally demonstrated. By training the quantum machine with a few normal samples, the quantum machine can detect the anomaly samples with a minimum error rate of 15.4%. These results show the power of quantum anomaly detection in dealing with machine learning tasks and the potential to detect abnormal output of quantum devices.

show abstract

Section: Resultsmentioning

confidence: 99%

Quantum Anomaly Detection with a Spin Processor in Diamond

Chai,

Liu,

Wang

et al. 2024

Adv Quantum Tech

View full text Add to dashboard Cite

show abstract

“…The value is then assumed to be a folder of audio samples. The samples are analyzed using the FluidCorpusMap library (Roma et al 2021). This library performs analysis and dimensionality reduction of the audio features and maps them to a grid using an assignment algorithm.…”

Section: Methodsmentioning

confidence: 99%

“…Corpus-based concatenative synthesis systems such as CataRT (Schwarz, Beller, Verbrugghe and Britton 2006) have traditionally been based on 2D scatterplots using scalar descriptors as axes. Several systems have explored dimensionality reduction and layout mapping for descriptor-based visualisation of sound collections in two dimensions (see Roma, Xambó, Green and Tremblay 2021, and references therein). These systems have generally been controlled through input devices such as mice, tablets, or other sensors.…”

Section: Introductionmentioning

confidence: 99%

“…These systems have generally been controlled through input devices such as mice, tablets, or other sensors. Some systems (Garber, Ciccola, and Amusategui 2020; Roma et al 2021) allow recording trajectories that are re-played in a loop. Visualisations of sound collections can be seen as a way of creating a terrain for live-coded sound-generating agents.…”

Section: Introductionmentioning

confidence: 99%

“…For large databases, the mapping of samples to the terrain should follow some logic. This can be based on audio descriptors, either directly (e.g., as in Schwarz et al 2006) or using dimensionality reduction (as in Roma et al 2021). Audio descriptors can also be used to visually represent each of the patches.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Agent-Based Music Live Coding: Sonic adventures in 2D

Roma

2023

Org. Sound

Self Cite

View full text Add to dashboard Cite

This article describes agent-based music live coding, an approach for music performance and composition based on programming a set of agents in a 2D plane. This style of programming draws from the tradition of agent-based models and facilitates interactive algorithmic control of data-driven sound synthesis methods such as wave terrain synthesis or corpus-based concatenative synthesis. The main elements are a ‘terrain’, which may be used to access different types of data, a set of agents and their trajectories, and a set of synthesis functions associated to agents. An implementation using the SuperCollider language is demonstrated.

show abstract