(1) Background: The application of deep learning technology to realize cancer diagnosis based on medical images is one of the research hotspots in the field of artificial intelligence and computer vision. Due to the rapid development of deep learning methods, cancer diagnosis requires very high accuracy and timeliness as well as the inherent particularity and complexity of medical imaging. A comprehensive review of relevant studies is necessary to help readers better understand the current research status and ideas. (2) Methods: Five radiological images, including X-ray, ultrasound (US), computed tomography (CT), magnetic resonance imaging (MRI), positron emission computed tomography (PET), and histopathological images, are reviewed in this paper. The basic architecture of deep learning and classical pretrained models are comprehensively reviewed. In particular, advanced neural networks emerging in recent years, including transfer learning, ensemble learning (EL), graph neural network, and vision transformer (ViT), are introduced. Five overfitting prevention methods are summarized: batch normalization, dropout, weight initialization, and data augmentation. The application of deep learning technology in medical image-based cancer analysis is sorted out. (3) Results: Deep learning has achieved great success in medical image-based cancer diagnosis, showing good results in image classification, image reconstruction, image detection, image segmentation, image registration, and image synthesis. However, the lack of high-quality labeled datasets limits the role of deep learning and faces challenges in rare cancer diagnosis, multi-modal image fusion, model explainability, and generalization. (4) Conclusions: There is a need for more public standard databases for cancer. The pre-training model based on deep neural networks has the potential to be improved, and special attention should be paid to the research of multimodal data fusion and supervised paradigm. Technologies such as ViT, ensemble learning, and few-shot learning will bring surprises to cancer diagnosis based on medical images.
The current recognition algorithms of sign language, or can only identify static gestures, or need data gloves, position sensor and other additional auxiliary equipments, which are only used for laboratory research and some special occasions. Therefore, they are not conducive to the promotion of widely use. A new idea of sign language recognition based on key frames is presented in this paper. The dynamic sign language can be looked on as a series of static gestures, which can be called the key frames. Through the key frame sequence detection and identification, the sign language can be rapidly recognized. So an algorithm of key frame detection especially for the dynamic sign language is proposed. This adaptive method uses image difference and classification theory in pattern recognition to extract key frames from video, and in addition to PC machines, the entire process requires only a camera, which is very easy to use. Experiments show that the key frames obtained by this way have good stability and accuracy, thus the real-time recognition of dynamic sign language can be realized.
Nowadays, predicting students' performance is one of the most specific topics for learning environments, such as universities and schools, since it leads to the development of effective mechanisms that can enhance academic outcomes and avoid destruction. In education 4.0, Artificial Intelligence (AI) can play a key role in identifying new factors in students' performance and implementing personalized learning, answering routine student questions, using learning analytics, and predictive modeling. It is a new challenge to redefine education 4.0 to recognize the creative and innovative intelligent students, and it is difficult to determine students' outcomes. Hence, in this paper, Hybridized Deep Neural Network (HDNN) to predict student performance in Education 4.0. The proposed HDNN method is utilized to determine the dynamics that likely influence the student's performance. The deep neural network monitors predict, and evaluate students' performance in an education 4.0 environment. The findings show that the proposed HDNN method achieved better prediction accuracy when compared to other popular methods.
Over the years, the continuous development of new technology has promoted research in the field of posture recognition and also made the application field of posture recognition have been greatly expanded. The purpose of this paper is to introduce the latest methods of posture recognition and review the various techniques and algorithms of posture recognition in recent years, such as scale-invariant feature transform, histogram of oriented gradients, support vector machine (SVM), Gaussian mixture model, dynamic time warping, hidden Markov model (HMM), lightweight network, convolutional neural network (CNN). We also investigate improved methods of CNN, such as stacked hourglass networks, multi-stage pose estimation networks, convolutional pose machines, and high-resolution nets. The general process and datasets of posture recognition are analyzed and summarized, and several improved CNN methods and three main recognition techniques are compared. In addition, the applications of advanced neural networks in posture recognition, such as transfer learning, ensemble learning, graph neural networks, and explainable deep neural networks, are introduced. It was found that CNN has achieved great success in posture recognition and is favored by researchers. Still, a more in-depth research is needed in feature extraction, information fusion, and other aspects. Among classification methods, HMM and SVM are the most widely used, and lightweight network gradually attracts the attention of researchers. In addition, due to the lack of 3D benchmark data sets, data generation is a critical research direction.
Artificial Intelligence (AI) is the enhancement and method of computer system that handles tasks which requires human like intelligence such as recognition, language translation and visual interpretation. Subjective image quality assessment (IQA) is difficult to be implemented in real-time systems, methodology for enhancing the involvement in producing IQA model is to improve the quality of image by significant evaluation. Intuitively, human eyes are not sensitive to the distortion and damage from the area with lesser visual saliency (VS), VS is closely related to IQA. With this consideration, an effective IQA was proposed, which involved two processes. The local quality map of a distorted image was computed using the structural similarity function of its feature attributes, such as brightness, chrominance and gradient. Second, the local quality map was weighted with visual saliency (VS) to get the objective evaluation of image quality. The VS was modeled by extracting the saliency of low-level features of the image, wiping off the molestation information from these saliency based on an apriori threshold, and combining the effective information to construct the saliency map. Image processing using fuzzy is gathering features and segments as fuzzy set while processing images. The experiments on the two largest database for six classical IQA metrics demonstrate that performance of weighted-VS IQA metrics is superior to the performance of no weighted-VS IQA metrics, and the proposed IQA method has higher computational accuracy than the other IQA metrics under a moderate computational complexity, especially for two types of distortion images, such as local block-wise (Block) and fast-fading (FTF).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.