Premature babies have several immature functions and begin their life under high medical supervision. Since the sleep organization diers across postmenstrual age, its analysis may give a good indication of the degree of brain maturation. However, sleep analysis (polysomnography or behavioral observation) is dicult to install, time consuming and cannot systematically be used. In this context, development of new ways to automatically monitor the neonates, using contactless modalities, is necessary. Therefore, this study presents an innovative non-invasive approach to semi-automatize the classication of infant behavioral sleep states. Methods First, three descriptors were extracted from audio and video recordings: vocalizations, motion and eye state of the baby. For this purpose, an original semi-automatic algorithm for the estimation of the eye state was proposed. Secondly, the three descriptors were used in order to obtain an estimation of the behavioral sleep states. Five classiers (K-Nearest Neighbors, Linear Discriminant Analysis, Support Vector Machine, Random Forest and Multi-Layer Perceptron) were compared to an expert annotation. Results Firstly, the comparison of the semi-automatic eye state estimation to manual annotations of 10 videos led to a mean accuracy of 99.4%. Secondly, sleep stage classication was performed. Best results were obtained with Random Forest, for Quiet Alert and Active Alert stages, with 93.5% and 99.0% of accuracy respectively. Conclusion The proposed method provides a high capacity to identify alert sleep stages but the dierentiation between Quiet Sleep and Active Sleep only by behavioral observations still remains a dicult task to achieve. Signicance Results presented in this paper are new since no similar approach was proposed in the literature in the context of neonatal intensive care unit. They augur well for the automatic sleep organization assessment to improve newborn care.