Page 44 - Read Online
P. 44

Harib et al. Intell Robot 2022;2(1):37-71  https://dx.doi.org/10.20517/ir.2021.19                                                              Page 39


























                                         Figure 1. Simple schematic of a two-link robot manipulator.

               is the PILCO approach .
                                  [11]

               Artificial Neural Networks (ANNs) have been explored for a long time in the hopes of obtaining human-
               like performance in speech and image processing. Several types of Neural Networks (NNs) appear to be
               promising candidates for control system applications. Multilayer NNs (MLNs), recurrent NNs (RNNs), and
               the cerebellar model articulation controller (CMAC) are examples of these. The decision of the NN to
               employ and which training technique to utilize is crucial, and it changes according to the application. The
               type of NNs most commonly used in control systems is the feedforward MLNs, where no information is fed
               back during operation. There is, however, feedback information available during training. Typically,
               supervised learning methods, where the neural network is trained to learn input-output patterns presented
               to it, are used. Most often, versions of the backpropagation (BP) algorithm are used to adjust the NN’s
               weights during training. The feedforward MLNs are the most often employed NNs in control systems since
               no information is fed back during operation. During training, however, there is feedback information
               accessible. In most cases, supervised learning methods are utilized, in which the NN is taught to learn input-
               output patterns that are provided to it. During training, variants of the BP algorithm are frequently
               employed to change the NN weights. More details about NNs, for dynamical systems in general and for
               robotics in particular, are discussed in section 3 of this work.


               In robotics, it all boils down to making the actuator perform the desired action. The basics of control
               systems tell us that the transfer function decides the relationship between the output and the input given the
               system or plant. While purely control-based robots use the system model to define their input-output
               relations, AI-based robots may or may not use the system model and rather manipulate the robot based on
               the experience they have with the system while training or possibly enhance it in real-time as well.

               Reinforcement learning (RL) is a type of experience-based learning that may be used in robotics when on-
               line learning without knowledge of the environment is necessary. The controller may learn which of its
               possible actions will result in the greatest performance for a particular job for each of its states. If the mobile
               robot collides with an obstacle, for example, it will learn that this is a poor action, but if it achieves the
               objective, it will learn that this is a positive activity. Reinforcement or reward is the term for such contextual
               feedback. The goal of the controller is to maximize its predicted future rewards for state-action pairings,
   39   40   41   42   43   44   45   46   47   48   49