Reinforcement Learning State-of-the-Art /

Reinforcement learning encompasses both a science of adaptive behavior of rational beings in uncertain environments and a computational methodology for finding optimal behaviors for challenging problems in control, optimization and adaptive behavior of intelligent agents. As a field, reinforcement l...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Συγγραφή απο Οργανισμό/Αρχή: SpringerLink (Online service)
Άλλοι συγγραφείς: Wiering, Marco (Επιμελητής έκδοσης), Otterlo, Martijn van (Επιμελητής έκδοσης)
Μορφή: Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:English
Έκδοση: Berlin, Heidelberg : Springer Berlin Heidelberg : Imprint: Springer, 2012.
Σειρά:Adaptation, Learning, and Optimization, 12
Θέματα:
Διαθέσιμο Online:Full Text via HEAL-Link
LEADER 03873nam a22004695i 4500
001 978-3-642-27645-3
003 DE-He213
005 20151125201532.0
007 cr nn 008mamaa
008 120305s2012 gw | s |||| 0|eng d
020 |a 9783642276453  |9 978-3-642-27645-3 
024 7 |a 10.1007/978-3-642-27645-3  |2 doi 
040 |d GrThAP 
050 4 |a Q342 
072 7 |a UYQ  |2 bicssc 
072 7 |a COM004000  |2 bisacsh 
082 0 4 |a 006.3  |2 23 
245 1 0 |a Reinforcement Learning  |h [electronic resource] :  |b State-of-the-Art /  |c edited by Marco Wiering, Martijn van Otterlo. 
264 1 |a Berlin, Heidelberg :  |b Springer Berlin Heidelberg :  |b Imprint: Springer,  |c 2012. 
300 |a XXXIV, 638 p.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a Adaptation, Learning, and Optimization,  |x 1867-4534 ;  |v 12 
505 0 |a Continous State and Action Spaces -- Relational and First-Order Knowledge Representation -- Hierarchical Approaches -- Predictive Approaches -- Multi-Agent Reinforcement Learning -- Partially Observable Markov Decision Processes (POMDPs) -- Decentralized POMDPs (DEC-POMDPs) -- Features and Function Approximation -- RL as Supervised Learning (or batch learning) -- Bounds and complexity -- RL for Games -- RL in Robotics -- Policy Gradient Techniques -- Least Squares Value Iteration -- Models and Model Induction -- Model-based RL -- Transfer Learning in RL -- Using of and extracting Knowledge in RL -- Biological or Psychological Background -- Evolutionary Approaches -- Closing chapter, prospects, future issues. 
520 |a Reinforcement learning encompasses both a science of adaptive behavior of rational beings in uncertain environments and a computational methodology for finding optimal behaviors for challenging problems in control, optimization and adaptive behavior of intelligent agents. As a field, reinforcement learning has progressed tremendously in the past decade. The main goal of this book is to present an up-to-date series of survey articles on the main contemporary sub-fields of reinforcement learning. This includes surveys on partially observable environments, hierarchical task decompositions, relational knowledge representation and predictive state representations. Furthermore, topics such as transfer, evolutionary methods and continuous spaces in reinforcement learning are surveyed. In addition, several chapters review reinforcement learning methods in robotics, in games, and in computational neuroscience. In total seventeen different subfields are presented by mostly young experts in those areas, and together they truly represent a state-of-the-art of current reinforcement learning research. Marco Wiering works at the artificial intelligence department of the University of Groningen in the Netherlands. He has published extensively on various reinforcement learning topics. Martijn van Otterlo works in the cognitive artificial intelligence group at the Radboud University Nijmegen in The Netherlands. He has mainly focused on expressive knowledge representation in reinforcement learning settings. 
650 0 |a Engineering. 
650 0 |a Artificial intelligence. 
650 0 |a Computational intelligence. 
650 1 4 |a Engineering. 
650 2 4 |a Computational Intelligence. 
650 2 4 |a Artificial Intelligence (incl. Robotics). 
700 1 |a Wiering, Marco.  |e editor. 
700 1 |a Otterlo, Martijn van.  |e editor. 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
776 0 8 |i Printed edition:  |z 9783642276446 
830 0 |a Adaptation, Learning, and Optimization,  |x 1867-4534 ;  |v 12 
856 4 0 |u http://dx.doi.org/10.1007/978-3-642-27645-3  |z Full Text via HEAL-Link 
912 |a ZDB-2-ENG 
950 |a Engineering (Springer-11647)