SummaryBased on the Q‐learning, this paper investigates the tracking problem for linear discrete‐time modal persistent dwell time switched systems. The modal persistent dwell time switching rule is introduced to coordinate the switching conditions of each switching subsystems. Moreover, two algorithms are given to realize the optimization of switched systems. Then, the stability and convergence are proved. Finally, a numerical simulation example and a practical example with an electro‐hydraulic system are given to verify the effectiveness of the proposed approach.