“…Graphical models and graph neural networks (GNNs) have emerged as powerful tools in the field of computer vision [49,51,38,48,27,40,57,64,65,54,41,59], offering a versatile framework for representing and processing complex visual data. These approaches leverage the inherent structure and relationships within images [55,53,37,50,58,11] and videos [45,43,2], allowing for more nuanced and context-aware analysis.…”