Page 75 - Read Online
P. 75

Yang et al. Intell Robot 2024;4(4):406-21  I http://dx.doi.org/10.20517/ir.2024.24  Page 418

                                                    Table 3. Confusion matrix
                                                      Actual value (positive)  Actual value (negative)
                                   Predicted value (positive)  TP             FP
                                   Predicted value (negative)  FN             TN
                                    TP: True positive; FP: false positive; FN: false negative; TN: true
                                    negative.


                                                   Table 4. Model rating index
                                           Model        Accuracy  Precision  Recall  F1-score
                                    1-layer convolutional model  0.5493  0.5591  0.4667  0.5087
                                       Model from ref.  [20]  0.8747  0.8482  0.9127  0.8793
                                          VNN-DM         0.8950  0.8741  0.9207  0.8968
                                            MLP          0.8153  0.7986  0.8433  0.8204
                                           SVM           0.7740  0.7553  0.8107  0.7820
                                           KNN           0.6487  0.6345  0.7013  0.6662
                                     VNN-DM: Vector neural network-based detection model;
                                      MLP: multi-layer perceptron; SVM: support vector machine;
                                      KNN: K-nearest neighbor.


               several evaluation metrics can be calculated, including accuracy, precision, recall, and F1-score, as well as the
               receiver operating characteristic (ROC) curve and the area under the curve (AUC), to quantify the model’s
               performance. These metrics are defined as follows:



                                                                   +     
                                                               =                                       (34)
                                                              +      +      +     
                                                                     
                                                                      =                                (35)
                                                                   +     
                                                                   
                                                                 =                                     (36)
                                                                  +     
                                                        2 ×                    ×             
                                                1−               =                                     (37)
                                                                             +             

               Accuracy reflects the overall correctness of the model’s predictions. Precision evaluates the proportion of
               correctly predicted positive samples among all samples predicted as positive. Recall measures the model’s
               ability to correctly identify positive samples among all actual positive samples. The F1-score is used to assess
               the model’s performance on imbalanced datasets and is the harmonic mean of Precision and Recall.


               To validate the effectiveness of the VNN-DM detection model in TSA detection tasks, two sets of experiments
               were designed to compare the performance of different models on the same test dataset. The first set of experi-
               ments analyzed the detection capability differences between a single-layer convolutional model, the reference
               model, and the proposed VNN-DM model, aiming to verify the performance improvements in feature extrac-
               tion and classification achieved by combining a multi-layer convolutional network structure with a capsule
               network. The second set of experiments compared traditional machine learning models, including support
               vector machine (SVM), K-nearest neighbor (KNN), and MLP, to further assess the performance improve-
               ments of the VNN-DM model over traditional models. The evaluation metrics for all models in both sets of
               experiments are shown in Table 4.


               Additionally, to further highlight the classification performance of the models, ROC curves were plotted for
               both sets of experiments. The ROC curve illustrates the model’s discriminative ability by displaying the rela-
               tionship between the true positive rate (TPR) and the false positive rate (FPR). To quantify the overall perfor-
   70   71   72   73   74   75   76   77   78   79   80