In recent years, deep learning-based algorithms have been immensely employed and tested in a variety of real-world applications. The efficacy of such algorithms has been thoroughly examined in a practical setting. In this paper, CNN-based deep learning approaches are utilized to recognize faces in real-time to identify faces with and without mask. We employ pre-trained algorithms (YOLOv2 and SSD) to identify people wearing a face mask, which enables a machine to perform recognition tasks while evolving through a learning method. Meanwhile, if there is more than one person in the scene, the one with the max score will be selected for classification. Thus, a hybrid approach that combines YOLOv2 and SSD algorithms to work in parallel is developed for masked-face extraction. Likewise, the Viola-Jones algorithm is used here to detect faces without mask and randomly select a single region of interest (ROI) to be stored for classification. All pre-processing algorithms work separately in parallel as reconstruction steps for preprocessing to crop the ROI and store images for training and testing dataset. Followed by developing a lightweight computational complexity CNN model for face mask recognition to identify whether the selected person's face is wearing a mask or not. The dataset contains numerous variations in appearance and viewpoint to capture different scenarios with and without mask faces. On average, the proposed face mask detection architecture realizes recall and F1 score of 98.3% and 98.31%, respectively. The training performance, on the other hand, has improved by 32.1% for training time as compared to AlexNet and 40.43% of storage space (model size) reduction compared to EfficientNet-B0. The presented framework architecture is an efficient face mask and unmask detector and can be employed as a robust medical assistant face detector in the healthcare sector for automated tracking of a patient, visitor, or staff member wearing a mask or not.