In recent years, research on the intelligent fault diagnosis of rotating machinery has made remarkable progress, bringing considerable economic benefits to industrial production. However, in the industrial environment, the accuracy and stability of the diagnostic model face severe challenges due to the extremely limited fault data. Data augmentation methods have the capability to increase both the quantity and diversity of data without altering the key characteristics of the original data, which is particularly important for the development of intelligent fault diagnosis of rotating machinery under limited data conditions (IFD-RM-LDC). Despite the abundant achievements in research on data augmentation methods, there is a lack of systematic reviews and clear future development directions. Therefore, this paper systematically reviews and discusses data augmentation methods for IFD-RM-LDC. Firstly, existing data augmentation methods are categorized into three groups: synthetic minority over-sampling technique (SMOTE)-based methods, generative model-based methods, and data transformation-based methods. Then, these three methods are introduced in detail and discussed in depth: SMOTE-based methods synthesize new samples through a spatial interpolation strategy; generative model-based methods generate new samples according to the distribution characteristics of existing samples; data transformation-based methods generate new samples through a series of transformation operations. Finally, the challenges faced by current data augmentation methods, including their limitations in generalization, real-time performance, and interpretability, as well as the absence of robust evaluation metrics for generated samples, have been summarized, and potential solutions to address these issues have been explored.