Page 19 - Read Online
P. 19

Ji et al. Intell Robot 2021;1(2):151-75  https://dx.doi.org/10.20517/ir.2021.14     Page 163






















                                           Figure 7. Single Shot multibox Detector architecture.

               • PSPNet utilizes a pyramid parsing module to exploit global context information by aggregating different
                                  [59]
               region-based contexts . A pre-trained CNN with the dilated network strategy is used to extract the feature
               map, on top of which the pyramid pooling module gathers context information. The final feature map size
               is one-eighth of the input image. Using a four-level pyramid, the pooling kernels cover the whole, half of,
               and small portions of the image. They are fused as the global prior, which is then concatenated with the
               original feature map in the final part. It is followed by a convolution layer to generate the final prediction
               map. The local and global clues together make the final prediction more reliable.


               3. REVIEW OF RAIL TRACK CONDITION MONITORING WITH DEEP LEARNING
               The authors systematically searched published peer-reviewed journal articles and papers found in Google
               Scholar. Combinations of keywords such as “rail”, “surface”, “rail track”, “defect”, and “deep learning” were
               used as search keys to find research works published in the application of deep learning techniques to rail
               track condition monitoring and anomaly detection and classification. The review covers work from 2013 to
               2021. In total, we identified 62 relevant research publications to review.


               The trend over time: a clear increasing trend can be observed of the popularity of deep learning approaches
               in rail track condition monitoring applications. Table 1 summarizes the findings. The number of papers
               surged in 2018. Before 2018, machine learning techniques other than deep learning approaches were more
               widely adopted. The rail industries are adopting deep learning methods with growing interests. An upwards
               trend of publication number is observed. There is also a gap of a few years from the invention of a deep
               learning model to its adoption by the rail industry.


               Regions of study: fourteen regions are represented by the papers identified. Among them, China has the
               highest number of papers, which indicates the popularity of rail-related research work corresponding to the
               expanding rail networks across the country. Papers from China surged in 2018 and kept a high number in
               the following years. Table 2 summarizes the distribution of papers over regions.

               Raw data type: it is observed that 70% of studies used image-type raw data for the deep learning models.
               Nevertheless, acoustic emission signals [65,71,100,103,108] , defectogram [96,109] , speed accelerations , concatenated
                                                                                           [98]
                                        [101]
               vector of curve and numbers , current signal , maintenance records [80,99] , synthetic data from generative
                                                       [89]
                                                                   [60]
                                                                                                       [119]
                                                                                   [87]
               model , time-frequency measurement data , time-series , geometry data , and vibration signal
                    [63]
                                                       [82]
               could all be possible input data sources as well.
   14   15   16   17   18   19   20   21   22   23   24