2022
DOI: 10.1109/jiot.2022.3143171
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Event Processing: A Neural-Symbolic Paradigm for the Internet of Multimedia Things

Abstract: Modern distributed computing infrastructure need to process vast quantities of data streams generated by a growing number of participants with information generated in multiple formats. With the Internet of Multimedia Things (IoMT) becoming a reality, new approaches are needed to process realtime multimodal event data streams. Existing approaches to event processing have limited consideration for the challenges of multimodal events, including the need for complex content extraction, increased computational and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 47 publications
0
1
0
Order By: Relevance
“…NeSy visual semantic models have found useful applications in the representation of multimedia streams for realtime multimodal event processing in the Internet of Multimedia Things (IoMT) [19,54]. These models blend DNNs for object and attribute detection with symbolic rules to understand spatiotemporal relations among objects.…”
Section: Other Tasksmentioning
confidence: 99%
“…NeSy visual semantic models have found useful applications in the representation of multimedia streams for realtime multimodal event processing in the Internet of Multimedia Things (IoMT) [19,54]. These models blend DNNs for object and attribute detection with symbolic rules to understand spatiotemporal relations among objects.…”
Section: Other Tasksmentioning
confidence: 99%
“…In this discipline, methods regard people who text about occurrences on social media as sensors. Events are also discovered using space-time scan statistics (STSS) out without aid of the text utilizing only space and time [15]. STSS perceives text in a space-time cube, which moves a cylindrical window over all imaginable space-time locations with a height (time) and variable radius (space).…”
Section: International Journal On Recent and Innovation Trends In Com...mentioning
confidence: 99%
“…› Multimedia event processing (MEP) uses graphbased approaches for representing multimedia streams for real-time event processing in the middleware for the Internet of Multimedia Things. 10 MEP approaches use graph-based semantic models for representing video streams; deep learning models are used to detect objects and symbolic rules are employed to identify relationships between objects, which are required for matching high-level video events queried by users.…”
Section: Applicationsmentioning
confidence: 99%