The clinical manifestations of patients with schizophrenia and patients with depression not only have a certain similarity, but also change with the patient's mood, and thus lead to misdiagnosis in clinical diagnosis. Electroencephalogram (EEG) analysis provides an important reference and objective basis for accurate differentiation and diagnosis between patients with schizophrenia and patients with depression. In order to solve the problem of misdiagnosis between patients with schizophrenia and patients with depression, and to improve the accuracy of the classification and diagnosis of these two diseases, in this study we extracted the resting-state EEG features from 100 patients with depression and 100 patients with schizophrenia, including information entropy, sample entropy and approximate entropy, statistical properties feature and relative power spectral density (rPSD) of each EEG rhythm (δ, θ, α, β). Then feature vectors were formed to classify these two types of patients using the support vector machine (SVM) and the naive Bayes (NB) classifier. Experimental results indicate that: ① The rPSD feature vector P performs the best in classification, achieving an average accuracy of 84.2% and a highest accuracy of 86.3%; ② The accuracy of SVM is obviously better than that of NB; ③ For the rPSD of each rhythm, the β rhythm performs the best with the highest accuracy of 76%; ④ Electrodes with large feature weight are mainly concentrated in the frontal lobe and parietal lobe. The results of this study indicate that the rPSD feature vector P in conjunction with SVM can effectively distinguish depression and schizophrenia, and can also play an auxiliary role in the relevant clinical diagnosis.
By analyzing the mechanism of pronunciation, traditional acoustic parameters, including fundamental frequency, Mel frequency cepstral coefficients (MFCC), linear prediction cepstrum coefficient (LPCC), frequency perturbation, amplitude perturbation, and nonlinear characteristic parameters, including entropy (sample entropy, fuzzy entropy, multi-scale entropy), box-counting dimension, intercept and Hurst, are extracted as feature vectors for identification of pathological voice. Seventy-eight normal voice samples and 73 pathological voice samples for /a/, and 78 normal samples and 80 pathological samples for /i/ are recognized based on support vector machine (SVM). The results showed that compared with traditional acoustic parameters, nonlinear characteristic parameters could be well used to distinguish between healthy and pathological voices, and the recognition rates for /a/ were all higher than those for /i/ except for multi-scale entropy. That is why the /a/ sound data is used widely in related research at home and abroad for obtaining better identification of pathological voices. Adopting multi-scale entropy for /i/ could obtain higher recognition rate than /a/ between healthy and pathological samples, which may provide some useful inspiration for evaluating vocal compensatory function.
The research of sleep staging is not only the basis of diagnosing sleep related diseases, but also the precondition of evaluating sleep quality, and has important clinical significance. In recent years, the research of automatic sleep staging based on computer has become a hotspot and made some achievements. Feature extraction and feature classification are two key technologies in automatic sleep staging system. In order to achieve effective automatic sleep staging, we proposed a new automatic sleep staging method which combines the energy features and least squares support vector machines (LS-SVM). Firstly, we used FIR band-pass filter to extract the energy features of Pz-Oz channel sleep electroencephalogram (EEG) signals, and compared them with those from wavelet packet transform method. Then we designed an LS-SVM classifier to realize the automatic sleep stage classification. The research showed that FIR band-pass filter (with the Kaiser window) performed better than wavelet packet transform (WPT) for energy feature extraction just in terms of the data from the Sleep-EDF Database and the LS-SVM classifier (with the RBF Kernel function) designed was good, and the automatic sleep staging method proposed in this paper was better than many similar methods from other studies with an average accuracy of 88.89% and had a very prosperous application future.
At present the prediction method of epilepsy patients is very time-consuming and vulnerable to subjective factors, so this paper presented an automatic recognition method of epilepsy electroencephalogram (EEG) based on common spatial model (CSP) and support vector machine (SVM). In this method, the CSP algorithm for extracting spatial characteristics was applied to the detection of epileptic EEG signals. However, the algorithm did not consider the nonlinear dynamic characteristics of the signals and ignored the time-frequency information, so the complementary characteristics of standard deviation, entropy and wavelet packet energy were selected for the combination in the feature extraction stage. The classification process adopted a new double classification model based on SVM. First, the normal, interictal and ictal periods were divided into normal and paroxysmal periods (including interictal and ictal periods), and then the samples belonging to the paroxysmal periods were classified into interictal and ictal periods. Finally, three categories of recognition were realized. The experimental data came from the epilepsy study at the University of Bonn in Germany. The average recognition rate was 98.73% in the first category and 99.90% in the second category. The experimental results show that the introduction of spatial characteristics and double classification model can effectively solve the problem of low recognition rate between interictal and ictal periods in many literatures, and improve the identification efficiency of each period, so it provides an effective detecting means for the prediction of epilepsy.
Entropy model is widely used in epileptic electroencephalogram (EEG) analysis, but there are few reports on how to objectively select the parameters to compute the entropy model in the analysis of resting-state functional magnetic resonance imaging (rfMRI). Therefore, an optimization algorithm to confirm the parameters in multi-scale entropy (MSE) model was proposed, and the location of epileptogenic hemisphere was taken as an example to test the optimization effect by supervised machine learning. The rfMRI data of 20 temporal lobe epilepsy (TLE) patients with hippocampal sclerosis, positive on structural magnetic resonance imaging, were divided into left and right groups. Then, the parameters in MSE model were optimized by the receiver operating characteristic curves (ROC) and area under ROC curve (AUC) values in sensitivity analysis, and the entropy value of the brain regions with statistically significant difference between the groups were taken as sensitive features to epileptogenic hemisphere lateral. The optimized entropy values of these bio-marker brain areas were considered as feature vectors input into the support vector machine (SVM). Finally, combining optimized MSE model with SVM could accurately distinguish epileptogenic hemisphere in TLE at an average accuracy rate of 95%, which was higher than the current level. The results show that the MSE model parameter optimization algorithm can accurately extract the functional imaging markers sensitive to the epileptogenic hemisphere, and achieve the purpose of objectively selecting the parameters for MSE in rfMRI, which provides the basis for the application of entropy in advanced technology detection.
Lung cancer is one of the tumors with the highest incidence rate and mortality rate in the world. It is also the malignant tumor with the fastest growing number of patients, which seriously threatens human life. How to improve the accuracy of diagnosis and treatment of lung cancer and the survival prognosis is particularly important. Machine learning is a multi-disciplinary interdisciplinary specialty, covering the knowledge of probability theory, statistics, approximate theory and complex algorithm. It uses computer as a tool and is committed to simulating human learning methods, and divides the existing content into knowledge structures to effectively improve learning efficiency and being able to integrate computer science and statistics into medical problems. Through the introduction of algorithm to absorb the input data, and the application of computer analysis to predict the output value within the acceptable accuracy range, identify the patterns and trends in the data, and finally learn from previous experience, the development of this technology brings a new direction for the diagnosis and treatment of lung cancer. This article will review the performance and application prospects of different types of machine learning algorithms in the clinical diagnosis and survival prognosis analysis of lung cancer.
This study is aimed to investigate objective indicators of mental fatigue evaluation to improve the accuracy of mental fatigue evaluation. Mental fatigue was induced by a sustained cognitive task. The brain functional networks in two states (normal state and mental fatigue state) were constructed based on electroencephalogram (EEG) data. This study used complex network theory to calculate and analyze nodal characteristics parameters (degree, betweenness centrality, clustering coefficient and average path length of node), and served them as the classification features of support vector machine (SVM). Parameters of the SVM model were optimized by gird search based on 6-fold cross validation. Then, the subjects were classified. The results show that characteristic parameters of node of brain function networks can be divided into normal state and mental fatigue state, which can be used in the objective evaluation of mental fatigue state.
Autoimmune pancreatitis (AIP) is a unique subtype of chronic pancreatitis, which shares many clinical presentations with pancreatic ductal adenocarcinoma (PDA). The misdiagnosis of AIP often leads to unnecessary pancreatic resection. 18F-FDG positron emission tomography/ computed tomography (PET/CT) could provide comprehensive information on the morphology, density, and functional metabolism of the pancreas at the same time. It has been proved to be a promising modality for noninvasive differentiation between AIP and PDA. However, there is a lack of clinical analysis of PET/CT image texture features. Difficulty still remains in differentiating AIP and PDA based on commonly used diagnostic methods. Therefore, this paper studied the differentiation of AIP and PDA based on multi-modality texture features. We utilized multiple feature extraction algorithms to extract the texture features from CT and PET images at first. Then, the Fisher criterion and sequence forward floating selection algorithm (SFFS) combined with support vector machine (SVM) was employed to select the optimal multi-modality feature subset. Finally, the SVM classifier was used to differentiate AIP from PDA. The results prove that texture analysis of lesions helps to achieve accurate differentiation of AIP and PDA.
The automatic detection of arrhythmia is of great significance for the early prevention and diagnosis of cardiovascular diseases. Traditional arrhythmia diagnosis is limited by expert knowledge and complex algorithms, and lacks multi-dimensional feature representation capabilities, which is not suitable for wearable electrocardiogram (ECG) monitoring equipment. This study proposed a feature extraction method based on autoregressive moving average (ARMA) model fitting. Different types of heartbeats were used as model inputs, and the characteristic of fast and smooth signal was used to select the appropriate order for the arrhythmia signal to perform coefficient fitting, and complete the ECG feature extraction. The feature vectors were input to the support vector machine (SVM) classifier and K-nearest neighbor classifier (KNN) for automatic ECG classification. MIT-BIH arrhythmia database and MIT-BIH atrial fibrillation database were used to verify in the experiment. The experimental results showed that the feature engineering composed of the fitting coefficients of the ARMA model combined with the SVM classifier obtained a recall rate of 98.2% and a precision rate of 98.4%, and the F1 index was 98.3%. The algorithm has high performance, meets the needs of clinical diagnosis, and has low algorithm complexity. It can use low-power embedded processors for real-time calculations, and it’s suitable for real-time warning of wearable ECG monitoring equipment.
Objective To explore the white matter microstructural abnormalities in patients with different subtypes of attention-deficit/hyperactivity disorder (ADHD) and establish a diagnostic classification model. Methods Patients with ADHD admitted to West China Hospital of Sichuan University between January 2019 and September 2021 and healthy controls recruited through advertisement were prospectively selected. All participants underwent diffusion tensor imaging scanning. The whole brain voxel-based analysis was used to compare the diffusion parameter maps of fractional anisotropy (FA) among patients with combined subtype of ADHD (ADHD-C), patients with inattentive subtype of ADHD (ADHD-I) and healthy controls. The support vector machine classifier and feature selection method were used to construct the individual ADHD diagnostic classification model and efficiency was evaluated between each two groups of the ADHD patients and healthy controls. Results A total of 26 ADHD-C patients, 24 ADHD-I patients and 26 healthy controls were included. The three groups showed significant differences in FA values in the bilateral sagittal stratum of temporal lobe (ADHD-C<ADHD-I<healthy controls) and the isthmus of corpus callosum (ADHD-C>ADHD-I>healthy controls) (P<0.005). The direct comparison between the two subtypes of ADHD showed that ADHD-C had higher FA than ADHD-I in the right middle frontal gyrus. The classification model differentiating ADHD-C and ADHD-I showed the highest efficiency, with a total accuracy of 76.0%, sensitivity of 88.5%, and specificity of 70.8%. Conclusions There is both commonality and heterogeneity in white matter microstructural alterations in the two subtypes of patients with ADHD. The white matter damage of the sagittal stratum of temporal lobe and the corpus callosum may be the intrinsic pathophysiological basis of ADHD, while the anomalies of frontal brain region may be the differential point between different subtypes of patients.