“…The state-of-the-art EVIO system, UltimateSLAM [43], operates by independently tracking visual features from pseudo-images reconstructed from events and optional images from a conventional camera, and fusing the tracks with inertial measurements using an existing optimization backend [23]. Here, we go one step further and directly apply an off-the-shelf VIO system (specifically, VINS-Mono [37], which is state-of-the-art [11]) to videos reconstructed from events using either our approach, MR, or HF, and evaluate against UltimateSLAM. As is standard [55,39,43], we use sequences from the Event Camera Dataset [31], which contain events, frames, and IMU measurements from a DAVIS240C [7] sensor.…”