“…The availability of commodity RGB-D sensors [25,48,59] led to significant progress in estimating 3D hand pose given depth or RGB-D input [17,24,39,40]. Recently, the community has shifted its focus to RGB-based methods [20,37,45,60,80]. To overcome the lack of 3D annotated data, many methods employed synthetic training images [9,33,37,38,80].…”