Hybrid Feature-Assisted Neural Model for Crowd Behavior Analysis

Swathi, H. Y.; Shivakumar, G.

doi:10.1007/s42979-021-00636-2

Cited by 4 publications

(3 citation statements)

References 30 publications

(37 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It shows that the amalgamation of MFCC with other acoustics like GTCC and variants, pitch, HNR and other spectrum cues have helped our proposed model yielding superior and more reliable crowd analysis and classification solution. The accuracy of the proposed work is also slightly more than that of our previous research work [28] which was 91.35%, but less than another work [29] which was 96.15%.…”

Section: Resultscontrasting

confidence: 62%

See 1 more Smart Citation

Acoustic Signals Based Crowd Analysis and Classification Model

Swathi¹,

Shivakumar²

2023

Preprint

View full text Add to dashboard Cite

The emergences in computing technologies have broadened the horizon for vision-based surveillance, monitoring and control. However, complex and inferior feature learning over visual artefacts or video streams, especially under extreme conditions confine majority of the at-hand vision-based crowd analysis and classification systems. Retrieving event-sensitive or crowd-type sensitive spatio-temporal features for the different crowd types under extreme conditions is highly complex task. Despite numerous efforts in vision-based approaches, the lack of acoustic cues often create ambiguity in crowd classification. In this research, a novel audio-based feature learning model is developed for crowd analysis and classification. In this work, the audio samples (from the input video) were processed for static (fixed size) sampling, pre-emphasis, block framing and Hann windowing, followed by acoustic feature extraction like GTCC, GTCC-Delta, GTCC-Delta-Delta, MFCC, Spectral Entropy, Spectral Flux, Spectral Slope and Harmonics to noise Ratio (HNR). Finally, the extractedacousticfeatures were processed for classification using the random forest ensemble classifier. The audio-basedclassification model yield classification accuracy of 92.67%, precision of 93.80%, sensitivity82.91%, specificity of 90.48% and F-Measure of 0.9239.

show abstract

Section: Resultscontrasting

confidence: 62%

“…The relative performance in terms of accuracy for the different at-hand solutions as well as our proposed model is given in Figure 3 and Table 4. 80.6% [20] 85.53% [21] 85.43% [22] 82.3% [23] 81.3% [24] 81.5% [25] 96.00% [28] 91.35% [29] 96.15% Proposed audio model 92.67%…”

Section: Resultsmentioning

confidence: 99%

Acoustic Signals Based Crowd Analysis and Classification Model

Swathi¹,

Shivakumar²

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…The emergent dynamics of crowd behavior have been fruitfully modeled according to biological phenomena such as swarm behavior (Kok, Lim, & Chan, 2016). Classification of emergent crowd dynamics, often using computer vision technology, has typically relied on analysis of video data for features, such as crowd density estimation, motion detection, and movement/behavior tracking of individual signals or group behavior (Kok et al., 2016; Swathi, Shivakumar, & Mohana, 2017). However, it is not always feasible to obtain high‐quality image, video, or speech data of a crowd in action, nor is it always feasible to obtain signals measured from each individual in an interacting crowd.…”

Section: Introductionmentioning

confidence: 99%

Recurrence Quantification Analysis of Crowd Sound Dynamics

Proksch,

Reeves,

Gee

et al. 2023

Cognitive Science

View full text Add to dashboard Cite

When multiple individuals interact in a conversation or as part of a large crowd, emergent structures and dynamics arise that are behavioral properties of the interacting group rather than of any individual member of that group. Recent work using traditional signal processing techniques and machine learning has demonstrated that global acoustic data recorded from a crowd at a basketball game can be used to classify emergent crowd behavior in terms of the crowd's purported emotional state. We propose that the description of crowd behavior from such global acoustic data could benefit from nonlinear analysis methods derived from dynamical systems theory. Such methods have been used in recent research applying nonlinear methods to audio data extracted from music and group musical interactions. In this work, we used nonlinear analyses to extract features that are relevant to the behavioral interactions that underlie acoustic signals produced by a crowd attending a sporting event. We propose that recurrence dynamics measured from these audio signals via recurrence quantification analysis (RQA) reflect information about the behavioral dynamics of the crowd itself. We analyze these dynamics from acoustic signals recorded from crowds attending basketball games, and that were manually labeled according to the crowds' emotional state across six categories: angry noise, applause, cheer, distraction noise, positive chant, and negative chant. We show that RQA measures are useful to differentiate the emergent acoustic behavioral dynamics between these categories, and can provide insight into the recurrence patterns that underlie crowd interactions.

show abstract