Alzheimer’s disease (AD) is a progressive and irreversible neurodegenerative disease. Neuroimaging based on magnetic resonance imaging (MRI) is one of the most intuitive and reliable methods to perform AD screening and diagnosis. Clinical head MRI detection generates multimodal image data, and to solve the problem of multimodal MRI processing and information fusion, this paper proposes a structural and functional MRI feature extraction and fusion method based on generalized convolutional neural networks (gCNN). The method includes a three-dimensional residual U-shaped network based on hybrid attention mechanism (3D HA-ResUNet) for feature representation and classification for structural MRI, and a U-shaped graph convolutional neural network (U-GCN) for node feature representation and classification of brain functional networks for functional MRI. Based on the fusion of the two types of image features, the optimal feature subset is selected based on discrete binary particle swarm optimization, and the prediction results are output by a machine learning classifier. The validation results of multimodal dataset from the AD Neuroimaging Initiative (ADNI) open-source database show that the proposed models have superior performance in their respective data domains. The gCNN framework combines the advantages of these two models and further improves the performance of the methods using single-modal MRI, improving the classification accuracy and sensitivity by 5.56% and 11.11%, respectively. In conclusion, the gCNN-based multimodal MRI classification method proposed in this paper can provide a technical basis for the auxiliary diagnosis of Alzheimer’s disease.
Lung cancer is one of the tumors with the highest incidence rate and mortality rate in the world. It is also the malignant tumor with the fastest growing number of patients, which seriously threatens human life. How to improve the accuracy of diagnosis and treatment of lung cancer and the survival prognosis is particularly important. Machine learning is a multi-disciplinary interdisciplinary specialty, covering the knowledge of probability theory, statistics, approximate theory and complex algorithm. It uses computer as a tool and is committed to simulating human learning methods, and divides the existing content into knowledge structures to effectively improve learning efficiency and being able to integrate computer science and statistics into medical problems. Through the introduction of algorithm to absorb the input data, and the application of computer analysis to predict the output value within the acceptable accuracy range, identify the patterns and trends in the data, and finally learn from previous experience, the development of this technology brings a new direction for the diagnosis and treatment of lung cancer. This article will review the performance and application prospects of different types of machine learning algorithms in the clinical diagnosis and survival prognosis analysis of lung cancer.
To solve the safety problems caused by the restriction of interaction space and the singular configuration of rehabilitation robot in terminal traction upper limb rehabilitation training, a trajectory planning and tracking control scheme for rehabilitation training is proposed. The human-robot safe interaction space was obtained based on kinematics modeling and rehabilitation theory, and the training trajectory was planned based on the occupational therapy in rehabilitation medicine. The singular configuration of the rehabilitation robot in the interaction space was avoided by exponential adaptive damped least square method. Then, a nonlinear controller for the upper limb rehabilitation robot was designed based on the backstepping control method. Radial basis function neural network was used to approximate the robot model information online to achieve model-free control. The stability of the controller was proved by Lyapunov stability theory. Experimental results demonstrate the effectiveness and superiority of the proposed singular avoidance control scheme.
Fetal electrocardiogram (ECG) signals provide important clinical information for early diagnosis and intervention of fetal abnormalities. In this paper, we propose a new method for fetal ECG signal extraction and analysis. Firstly, an improved fast independent component analysis method and singular value decomposition algorithm are combined to extract high-quality fetal ECG signals and solve the waveform missing problem. Secondly, a novel convolutional neural network model is applied to identify the QRS complex waves of fetal ECG signals and effectively solve the waveform overlap problem. Finally, high quality extraction of fetal ECG signals and intelligent recognition of fetal QRS complex waves are achieved. The method proposed in this paper was validated with the data from the PhysioNet computing in cardiology challenge 2013 database of the Complex Physiological Signals Research Resource Network. The results show that the average sensitivity and positive prediction values of the extraction algorithm are 98.21% and 99.52%, respectively, and the average sensitivity and positive prediction values of the QRS complex waves recognition algorithm are 94.14% and 95.80%, respectively, which are better than those of other research results. In conclusion, the algorithm and model proposed in this paper have some practical significance and may provide a theoretical basis for clinical medical decision making in the future.
Deep learning method can be used to automatically analyze electrocardiogram (ECG) data and rapidly implement arrhythmia classification, which provides significant clinical value for the early screening of arrhythmias. How to select arrhythmia features effectively under limited abnormal sample supervision is an urgent issue to address. This paper proposed an arrhythmia classification algorithm based on an adaptive multi-feature fusion network. The algorithm extracted RR interval features from ECG signals, employed one-dimensional convolutional neural network (1D-CNN) to extract time-domain deep features, employed Mel frequency cepstral coefficients (MFCC) and two-dimensional convolutional neural network (2D-CNN) to extract frequency-domain deep features. The features were fused using adaptive weighting strategy for arrhythmia classification. The paper used the arrhythmia database jointly developed by the Massachusetts Institute of Technology and Beth Israel Hospital (MIT-BIH) and evaluated the algorithm under the inter-patient paradigm. Experimental results demonstrated that the proposed algorithm achieved an average precision of 75.2%, an average recall of 70.1% and an average F1-score of 71.3%, demonstrating high classification accuracy and being able to provide algorithmic support for arrhythmia classification in wearable devices.
Pneumoconiosis ranks first among the newly-emerged occupational diseases reported annually in China, and imaging diagnosis is still one of the main clinical diagnostic methods. However, manual reading of films requires high level of doctors, and it is difficult to discriminate the staged diagnosis of pneumoconiosis imaging, and due to the influence of uneven distribution of medical resources and other factors, it is easy to lead to misdiagnosis and omission of diagnosis in primary healthcare institutions. Computer-aided diagnosis system can realize rapid screening of pneumoconiosis in order to assist clinicians in identification and diagnosis, and improve diagnostic efficacy. As an important branch of deep learning, convolutional neural network (CNN) is good at dealing with various visual tasks such as image segmentation, image classification, target detection and so on because of its characteristics of local association and weight sharing, and has been widely used in the field of computer-aided diagnosis of pneumoconiosis in recent years. This paper was categorized into three parts according to the main applications of CNNs (VGG, U-Net, ResNet, DenseNet, CheXNet, Inception-V3, and ShuffleNet) in the imaging diagnosis of pneumoconiosis, including CNNs in pneumoconiosis screening diagnosis, CNNs in staging diagnosis of pneumoconiosis, and CNNs in segmentation of pneumoconiosis foci to conduct a literature review. It aims to summarize the methods, advantages and disadvantages, and optimization ideas of CNN applied to the images of pneumoconiosis, and to provide a reference for the research direction of further development of computer-aided diagnosis of pneumoconiosis.
With the rapid improvement of the perception and computing capacity of mobile devices such as smart phones, human activity recognition using mobile devices as the carrier has been a new research hot-spot. The inertial information collected by the acceleration sensor in the smart mobile device is used for human activity recognition. Compared with the common computer vision recognition, it has the following advantages: convenience, low cost, and better reflection of the essence of human motion. Based on the WISDM data set collected by smart phones, the inertial navigation information and the deep learning algorithm-convolutional neural network (CNN) were adopted to build a human activity recognition model in this paper. The K nearest neighbor algorithm (KNN) and the random forest algorithm were compared with the CNN network in the recognition accuracy to evaluate the performance of the CNN network. The classification accuracy of CNN model reached 92.73%, which was much higher than KNN and random forest. Experimental results show that the CNN algorithm model can achieve more accurate human activity recognition and has broad application prospects in predicting and promoting human health.
With the wide application of deep learning technology in disease diagnosis, especially the outstanding performance of convolutional neural network (CNN) in computer vision and image processing, more and more studies have proposed to use this algorithm to achieve the classification of Alzheimer’s disease (AD), mild cognitive impairment (MCI) and normal cognition (CN). This article systematically reviews the application progress of several classic convolutional neural network models in brain image analysis and diagnosis at different stages of Alzheimer’s disease, and discusses the existing problems and gives the possible development directions in order to provide some references.
Colorectal cancer (CRC) is a common malignant tumor that seriously threatens human health. CRC presents a formidable challenge in terms of accurate identification due to its indistinct boundaries. With the widespread adoption of convolutional neural networks (CNNs) in image processing, leveraging CNNs for automatic classification and segmentation holds immense potential for enhancing the efficiency of colorectal cancer recognition and reducing treatment costs. This paper explores the imperative necessity for applying CNNs in clinical diagnosis of CRC. It provides an elaborate overview on research advancements pertaining to CNNs and their improved models in CRC classification and segmentation. Furthermore, this work summarizes the ideas and common methods for optimizing network performance and discusses the challenges faced by CNNs as well as future development trends in their application towards CRC classification and segmentation, thereby promoting their utilization within clinical diagnosis.
The processing mechanism of the human brain for speech information is a significant source of inspiration for the study of speech enhancement technology. Attention and lateral inhibition are key mechanisms in auditory information processing that can selectively enhance specific information. Building on this, the study introduces a dual-branch U-Net that integrates lateral inhibition and feedback-driven attention mechanisms. Noisy speech signals input into the first branch of the U-Net led to the selective feedback of time-frequency units with high confidence. The generated activation layer gradients, in conjunction with the lateral inhibition mechanism, were utilized to calculate attention maps. These maps were then concatenated to the second branch of the U-Net, directing the network’s focus and achieving selective enhancement of auditory speech signals. The evaluation of the speech enhancement effect was conducted by utilising five metrics, including perceptual evaluation of speech quality. This method was compared horizontally with five other methods: Wiener, SEGAN, PHASEN, Demucs and GRN. The experimental results demonstrated that the proposed method improved speech signal enhancement capabilities in various noise scenarios by 18% to 21% compared to the baseline network across multiple performance metrics. This improvement was particularly notable in low signal-to-noise ratio conditions, where the proposed method exhibited a significant performance advantage over other methods. The speech enhancement technique based on lateral inhibition and feedback-driven attention mechanisms holds significant potential in auditory speech enhancement, making it suitable for clinical practices related to artificial cochleae and hearing aids.