Page 92 - The-5th-MCAIT2021-eProceeding
P. 92

3.1. Machine Learning

           According to the literature, SVM and CNN are widely used in CVDs diagnosis using ECGs data. Both SVM
        and CNN are considered powerful algorithms, and they achieved very high accuracy rates compared with other
        methods (Ahmad et al., 2014). The main reason for SVM significant performance is the feature dimensionality
        independency which makes it immune from the “curse of dimensionality”.  SVM works well with a clear margin
        of separation between classes. The hyperplane is affected by only the support vectors. Thus, outliers have less
        impact, which makes it particularly efficient for classifying complex but small or medium-sized datasets (Li,
        Bhanu, & Krawiec).
           Furthermore, different SVM classifiers can be constructed using different kernels (polynomial, RBF, linear)
        to solve leaner or non-leaner problems. However, selecting an appropriate kernel for a. given problem is still
        under research. Besides, SVM does not work perfectly in a noisy or large dataset, and it takes a long time in
        training the model (Bisong, 2019). In contrast, CNN can work perfectly in a noisy and large dataset because of
        its capability of finding optimal temporal features better than other machine learning algorithms.
        CNN can learn complex features by preserving the spatial relationship between the feature (Bisong, 2019).
        Reducing the number of weights needed for training the network is another significant advantage of CNN.
        However, both SVM and CNN are High computational costs and time and memory consuming (Alizadehsani,
        Roshanzamir, et al., 2019).
           A small number of databases are available in the public domain, which explicitly studied in most previous
        works.  Further,  most  of  these  public  databases  have  a  small  sample  size  and  unbalanced  data.  MIT-BIH,
        ZAlizadeh  Sani,  Cleveland,  and  Hungarian  datasets  are  the  most  popular  public  dataset.  The  main  reason
        traditional machine learning algorithms are significantly well performed is that most of the sample size of these
        public  domain  datasets,  accept  MIT-BIH,  is  between  303  and  below  with  few  features.  As  seen  from  the
        literature, a feature selection method has been used in most structured data works, and it has a significant impact
        on model performance.
           In signal data, a combination of the Fantasia dataset and St.-Petersburg dataset are widely used. The ECG
        signals were retrieved from Fantasia (for Normal) and St.-Petersburg Institute (for CAD).  However, most
        datasets in this field are either very small in size or inaccessible to the public. One reason traditional ML has
        worked sufficiently well in previous years in using ECGs to diagnose cardiology is that experts are carefully
        designed feature extraction methods. Also, handcrafted features such as statistical measures from the ECG beats
        and the RR interval positively influence traditional ML’s performance. Recently, using DL to diagnose CVDs
        from  ECG  signals  has  gained  much  interest  due  to  its  simplicity  and  reduced  dimensionality  compared  to
        imaging data (Strodthoff, Wagner, Schaeffter, & Samek, 2020).

        4. Summary

           Signal data and especially ECG signals have gained a lot of interest in the research community with respect
        to the other data type. ECG is a widely available and great tool used to diagnose CVDs with high efficiency
        from a medical aspect. However, it is noisy sensitive and can be misleadingly interpreted by doctors. ML and
        DL  incredibly  proved  their  ability  to  insight  hiding  information  and  provide  simplicity  in  handling  such
        problems.  CNN  algorithms  particularly  have  achieved  high  efficiency  and  outstanding  performance  in
        diagnosing  CVDs  using  ECG  signals.  This  survey  introduced  the  ECG  structure  and  components,  and  the
        diseases that can aid to diagnose. The related state-of-art works used to detect CVDs using ECG signals is
        discussed as well as the widely used ML algorithms, feature selections, and datasets.

        Acknowledgment

             The authors acknowledge the support of this research by the Yayasan Universiti Teknologi PETRONAS
        Fundamental Research Grant (YUTP-FRG) under Grant 015LC0-244





        E- Proceedings of The 5th International Multi-Conference on Artificial Intelligence Technology (MCAIT 2021)   [79]
        Artificial Intelligence in the 4th Industrial Revolution
   87   88   89   90   91   92   93   94   95   96   97