In this paper, we present a mesh-based motion estimation scheme for image sequence. Nodal motion vectors optimization is performed by using a multi-resolution differential method. Because our final aim is mesh tracking throughout a video sequence with optimized reconstruction, neither backward tracking nor forward tracking is well suited. One motivation of our work is to take advantage of both forward tracking (which enables tracking) and backward tracking (for its efficiency) in a "backward in forward" method. For the optimization of the nodal motion vectors, we also propose a novel approach with multi-resolution and several hierarchy levels, which, in addition, makes it possible scalable representation. This is achieved with a progressive representation defined according to a rate distortion criterion. Results are presented to illustrate the proposed methods.