Page 47 - Read Online
P. 47

Qi et al. Intell Robot 2021;1(1):18-57  I http://dx.doi.org/10.20517/ir.2021.02      Page 42


               framework is presented for edge system [88] , named as “In-Edge AI”, to address optimization of mobile edge
               computing, caching, and communication problems. The authors also propose some ideas and paradigms for
               solving these problems by using DRL and Distributed DRL. To carry out dynamic system-level optimization
               and reduce the unnecessary transmission load, “In-Edge AI” framework takes advantage of the collaboration
               among edge nodes to exchange learning parameters for better training and inference of models. It has been
               evaluated that the framework has high performance and relatively low learning overhead, while the mobile
               communication system is cognitive and adaptive to the environment. The paper provides good prospects
               for the application of FRL to edge computing, but there are still many challenges to overcome, including the
               adaptive improvement of the algorithm, and the training time of the model from scratch etc.

               Edge caching has been considered a promising technique for edge computing to meet the growing demands
               for next-generation mobile networks and beyond. Addressing the adaptability and collaboration challenges of
               the dynamic network environment, Wang et al. [89]  proposes a device-to-device (D2D)-assisted heterogeneous
               collaborative edge caching framework. User equipment (UE) in a mobile network uses the local DQN model
               to make node selection and cache replacement decisions based on network status and historical information.
               In other words, UE decides where to fetch content and which content should be replaced in its cache list. The
               BS calculates aggregation weights based on the training evaluation indicators from UE. To solve the long-term
               mixed-integer linear programming problem, the attention-weighted federated deep reinforcement learning
               (AWFDRL) is presented, which optimizes the aggregation weights to avoid the imbalance of the local model
               quality and improve the learning efficiency of the DQN. The convergence of the proposed algorithm is verified
               and simulation results show that the AWFDRL framework can perform well on average delay, hit rate, and
               offload traffic.

               A federated solution for cooperative edge caching management in fog radio access networks (F-RANs) is pro-
               posed [90] . Both edge computing and fog computing involve bringing intelligence and processing to the origins
               of data. The key difference between the two architectures is where the computing node is positioned. A du-
               eling deep Q-network based cooperative edge caching method is proposed to overcome the dimensionality
               curse of RL problem and improve caching performance. Agents are developed in fog access points (F-APs)
               and allowed to build a local caching model for optimal caching decisions based on the user content request
               and the popularity of content. HFRL is applied to aggregate the local models into a global model in the cloud
               server. The proposed method outperforms three classical content caching methods and two RL algorithms in
               terms of reducing content request delays and increasing cache hit rates.

               For edge-enabled IoT, Majidi et al. [91]  proposes a dynamic cooperative caching method based on hierarchical
               federated deep reinforcement learning (HFDRL), which is used to determine which content should be cached
               or evicted by predicting future user requests. Edge devices that have a strong relationship are grouped into
               a cluster and one head is selected for this cluster. The BS trains the Q-value based local model by using BS
               states, content states, and request states. The head has enough processing and caching capabilities to deal with
               model aggregation in the cluster. By categorizing edge devices hierarchically, HFDRL improves the response
               time delay to keeps both small and large clusters from experiencing the disadvantages they could encounter.
               Storage partitioning allows content to be stored in clusters at different levels using the storage space of each
               device. The simulation results show the proposed method using MovieLens datasets improves the average
               content access delay and the hit rate.

               Considering the low latency requirements and privacy protection issue of IoV, the study of efficient and secure
               caching methods has attracted many researchers. An FRL-empowered task caching problem with IoV has
               been analyzed by Zhao et al. [92] . The work proposes a novel cooperative caching algorithm (CoCaRL) for
               vehicular networks with multi-level FRL to dynamically determine which contents should be replaced and
               where the content requests should be served. This paper develops a two-level aggregation mechanism for
   42   43   44   45   46   47   48   49   50   51   52