Collective motion takes many forms in nature; schools of fish, flocks of birds, and swarms of locusts to name a few. Commonly, during collective motion the individuals of the group avoid collisions. These collective motion and collision avoidance behaviors are based on input from the environment such as smell, air pressure, and vision, all of which are processed by the individual and defined action. In this work, a novel vision-based collective motion with collision avoidance model (i.e., VCMCA) that is simulating the collective evolution learning process is proposed. In this setting, a learning agent obtains a visual signal about its environment, and throughout trial-and-error over multiple attempts, the individual learns to perform a local collective motion with collision avoidance which emerges into a global collective motion with collision avoidance dynamics. The proposed algorithm was evaluated on the case of locusts’ swarms, showing the evolution of these behaviors in a swarm from the learning process of the individual in the swarm. Thus, this work proposes a biologically-inspired learning process to obtain multi-agents multi-objective dynamics.