“…To address the context limitation, recent state-of-the-art methods use more past frames as feature memory [36,13,64,21,28,58,16]. Particularly, Space-Time Memory (STM) [36] is popular and has been extended by many follow-up works [43,8,18,54,50,31,9,44,33]. Among these extensions, we use STCN [9] as our working memory backbone as it is simple and effective.…”