Page 71 - Read Online
P. 71

Harib et al. Intell Robot 2022;2(1):37-71  https://dx.doi.org/10.20517/ir.2021.19       Page 65

               As RL/DRL researchers, we should take a step back and concentrate on the basics. By concentrating on the
               basics, we imply concentrating on simple, analyzable domains from which we may draw useful conclusions
               about the algorithms. Above all, areas in which we know what the best possible reward is. We hope that our
               survey helps the nonlinear dynamic control community in general, and the robotics community in
               particular, to quickly learn about this topic and become closely familiar with the current work being done
               and what work remains to be done. We also hope to assist researchers in deriving some conclusions from
               work carried out so far and provide them with new avenues for future research.


               DECLARATIONS
               Authors’ contributions
               Made substantial contributions to the conception and design of the article and interpreting the relevant
               literature: Harib M
               Performed oversight and leadership responsibility for the activity planning and execution, as well as
               developed ideas and evolution of overarching aims: Chaoui H
               Performed critical review, commentary and revision, as well as provided administrative, technical, and
               material support: Chaoui H, Miah S


               Availability of data and materials
               Not applicable.


               Financial support and sponsorship
               None.


               Conflicts of interest
               All authors declared that there are no conflicts of interest.


               Ethical approval and consent to participate
               Not applicable.


               Consent for publication
               Not applicable.

               Copyright
               © The Author(s) 2022.

               REFERENCES
               1.       Aseltine J, Mancini A, Sarture C. A survey of adaptive control systems. IRE Trans Automat Contr 1958;6:102-8.  DOI
               2.       Stromer PR. Adaptive or self-optimizing control systems - a bibliography. IRE Trans Automat Contr 1959;AC-4:65-8.  DOI
               3.       Mishkin E, Ludwig BJ. Adaptive control systems. 1st ed. New York: McGraw-Hill; 1961.
               4.       Truxal JG. Adaptive control. IFAC Proceedings Volumes 1963;1:386-92.  DOI
               5.       Eveleigh VW. Adaptive control and optimization technique. 1st ed. New York: McGraw-Hill; 1967.
               6.       Wittenmark B. Stochastic adaptive control methods: a survey. Int J Control 1975;21:705-30.  DOI
               7.       Åström K, Borisson U, Ljung L, Wittenmark B. Theory and applications of self-tuning regulators. Automatica 1977;13:457-76.  DOI
               8.       Åström K. Theory and applications of adaptive control - a survey. Automatica 1983;19:471-86.  DOI
               9.       Jamali H. Adaptive control methods for mechanical manipulators: a comparative study. Monterey, CA: Naval Postgraduate School;
                    1989.
               10.       Mathelin MD, Lozano R. Robust adaptive identification of slowly time-varying parameters with bounded disturbances. Automatica
                    1999;35:1291-305.  DOI
               11.       Deisenroth MP, Rasmussen CE. PILCO: a model-based and data-efficient approach to policy search. Proceedings of the 28th
                    International Conference on International Conference on Machine Learning; 2011 Jun; Madison, WI, USA. 2011. p. 465-72.
               12.       Wang LY, Zhang JF. Fundamental limitations and differences of robust and adaptive control. Proceedings of the 2001 American
   66   67   68   69   70   71   72   73   74   75   76