Speech recognition with quaternion neural networks

Parcollet, Titouan; Ravanelli, Mirco; Morchid, Mohamed; Linarès, Georges; De Mori, Renato

doi:10.48550/arxiv.1811.09678

Cited by 4 publications

(5 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They found that Q achieved faster convergence on the training loss as well as higher classification accuracy on the test set compared to R. Gaudet and Maida [10] made a similar comparison with image classification on the CIFAR-10 and CIFAR-100 datasets and image segmentation on the KITTI Road Segmentation dataset [9], but this time with Q having a quarter of the number of parameters as R. They reported that on both tasks, quaternion models gave higher accuracy than real and complex networks while having a lower parameter count. Similar advantages for quaternion neural networks over real networks were also found by Parcollet et al [29] for speech recognition.…”

Section: Quaternionssupporting

confidence: 78%

“…Using higher-dimensional data embeddings, such as complex numbers or quaternions, has been successfully shown to reduce model parameters while maintaining accuracy [39,38,10,27]. Quaternions are a 4-dimensional extension to the complex numbers introduced by the mathematician William Rowan Hamilton in 1843 [27], and quaternion neural networks have been built for a variety of ML tasks [43,29,10,28,32,4]. Converting a real model to quaternion can lead to a 75% reduction in model parameters (which is explained in more detail in Sec.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Neural Networks at a Fraction with Pruned Quaternions

Iqbal

Mishra

2023

Proceedings of the 6th Joint International Conference on Data Science &Amp; Management of Data (10th ACM IKDD CODS and 28th COM

View full text Add to dashboard Cite

Contemporary state-of-the-art neural networks have increasingly large numbers of parameters, which prevents their deployment on devices with limited computational power. Pruning is one technique to remove unnecessary weights and reduce resource requirements for training and inference. In addition, for ML tasks where the input data is multi-dimensional, using higher-dimensional data embeddings such as complex numbers or quaternions has been shown to reduce the parameter count while maintaining accuracy. In this work, we conduct pruning on real and quaternion-valued implementations of different architectures on classification tasks. We find that for some architectures, at very high sparsity levels, quaternion models provide higher accuracies than their real counterparts. For example, at the task of image classification on CIFAR-10 using Conv-4, at 3% of the number of parameters as the original model, the pruned quaternion version outperforms the pruned real by more than 10%. Experiments on various network architectures and datasets show that for deployment in extremely resource-constrained environments, a sparse quaternion network might be a better candidate than a real sparse model of similar architecture.

show abstract

Section: Quaternionssupporting

confidence: 78%

Section: Introductionmentioning

confidence: 99%

Neural Networks at a Fraction with Pruned Quaternions

Iqbal

Mishra

2023

Proceedings of the 6th Joint International Conference on Data Science &Amp; Management of Data (10th ACM IKDD CODS and 28th COM

View full text Add to dashboard Cite

show abstract

“…Tay et al explored the use of quaternion networks for lightweight and efficient neural natural language processing in [43]. Parcollet et al investigated the use of quaternion-valued convolutional and recurrent neural networks on speech recognition in [44]. Parcollet et al studied the use of quaternion neural networks for theme identification of telephone conversations in [45].…”

Section: Previous Work On Quaternion Neural Networkmentioning

confidence: 99%

A Quaternion Gated Recurrent Unit Neural Network for Sensor Fusion

et al. 2021

View full text Add to dashboard Cite

Recurrent Neural Networks (RNNs) are known for their ability to learn relationships within temporal sequences. Gated Recurrent Unit (GRU) networks have found use in challenging time-dependent applications such as Natural Language Processing (NLP), financial analysis and sensor fusion due to their capability to cope with the vanishing gradient problem. GRUs are also known to be more computationally efficient than their variant, the Long Short-Term Memory neural network (LSTM), due to their less complex structure and as such, are more suitable for applications requiring more efficient management of computational resources. Many of such applications require a stronger mapping of their features to further enhance the prediction accuracy. A novel Quaternion Gated Recurrent Unit (QGRU) is proposed in this paper, which leverages the internal and external dependencies within the quaternion algebra to map correlations within and across multidimensional features. The QGRU can be used to efficiently capture the inter- and intra-dependencies within multidimensional features unlike the GRU, which only captures the dependencies within the sequence. Furthermore, the performance of the proposed method is evaluated on a sensor fusion problem involving navigation in Global Navigation Satellite System (GNSS) deprived environments as well as a human activity recognition problem. The results obtained show that the QGRU produces competitive results with almost 3.7 times fewer parameters compared to the GRU. The QGRU code is available at https://github.com/onyekpeu/Quarternion-Gated-Recurrent-Unit.

show abstract

“…Specifically, a quaternion number, containing one real part and three imaginary parts, and the corresponding quaternion-based neural networks [39][40][41][42] are expected to enhance the performance on processing of data with more degrees of freedom than the conventional real-number and complex-number systems. There have been various proposals about quaternion-based neural networks in ML techniques and applications in computer science, such as the quaternion convolutional neural network (qCNN) [38,43,44], quaternion recurrent neural network [45], quaternion generative adversarial networks [46], quaternion-valued variational autoencoder [47], quaternion graph neural networks [48], quaternion capsule networks [49] and quaternion neural networks for the speech recognitions [50]. However, the ML-related applications of the quaternion-based neural networks on solving problems in physics are still limited, especially in the topological phase detections, even though the quaternion-related concepts have been applied in some fields in physics [51][52][53].…”

Section: Introductionmentioning

confidence: 99%

Quaternion-based machine learning on topological quantum systems

Lin

Li²,

Huang³

2023

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

Topological phase classifications have been intensively studied via machine-learning techniques where different forms of the training data are proposed in order to maximize the information extracted from the systems of interests. Due to the complexity in quantum physics, advanced mathematical architecture should be considered in designing machines. In this work, we incorporate quaternion algebras into data analysis either in the frame of supervised and unsupervised learning to classify two-dimensional Chern insulators. For the unsupervised-learning aspect, we apply the principal component analysis (PCA) on the quaternion-transformed eigenstates to distinguish topological phases. For the supervised-learning aspect, we construct our machine by adding one quaternion convolutional layer on top of a conventional convolutional neural network. The machine takes quaternion-transformed configurations as inputs and successfully classify all distinct topological phases, even for those states that have different distributuions from those states seen by themachine during the training process. Our work demonstrates the power of quaternion algebras on extracting crucial features from the targeted data and the advantages of quaternion-based neural networks than conventional ones in the tasks of topological phase classifications.

show abstract

Speech recognition with quaternion neural networks

Cited by 4 publications

References 28 publications

Neural Networks at a Fraction with Pruned Quaternions

Neural Networks at a Fraction with Pruned Quaternions

A Quaternion Gated Recurrent Unit Neural Network for Sensor Fusion

Quaternion-based machine learning on topological quantum systems

Contact Info

Product

Resources

About