|
|
|
|
LEADER |
04017nam a22006135i 4500 |
001 |
978-3-540-89722-4 |
003 |
DE-He213 |
005 |
20151204150445.0 |
007 |
cr nn 008mamaa |
008 |
100301s2008 gw | s |||| 0|eng d |
020 |
|
|
|a 9783540897224
|9 978-3-540-89722-4
|
024 |
7 |
|
|a 10.1007/978-3-540-89722-4
|2 doi
|
040 |
|
|
|d GrThAP
|
050 |
|
4 |
|a Q334-342
|
050 |
|
4 |
|a TJ210.2-211.495
|
072 |
|
7 |
|a UYQ
|2 bicssc
|
072 |
|
7 |
|a TJFM1
|2 bicssc
|
072 |
|
7 |
|a COM004000
|2 bisacsh
|
082 |
0 |
4 |
|a 006.3
|2 23
|
245 |
1 |
0 |
|a Recent Advances in Reinforcement Learning
|h [electronic resource] :
|b 8th European Workshop, EWRL 2008, Villeneuve d’Ascq, France, June 30-July 3, 2008, Revised and Selected Papers /
|c edited by Sertan Girgin, Manuel Loth, Rémi Munos, Philippe Preux, Daniil Ryabko.
|
264 |
|
1 |
|a Berlin, Heidelberg :
|b Springer Berlin Heidelberg,
|c 2008.
|
300 |
|
|
|a XII, 283 p.
|b online resource.
|
336 |
|
|
|a text
|b txt
|2 rdacontent
|
337 |
|
|
|a computer
|b c
|2 rdamedia
|
338 |
|
|
|a online resource
|b cr
|2 rdacarrier
|
347 |
|
|
|a text file
|b PDF
|2 rda
|
490 |
1 |
|
|a Lecture Notes in Computer Science,
|x 0302-9743 ;
|v 5323
|
505 |
0 |
|
|a Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance Trees -- Exploiting Additive Structure in Factored MDPs for Reinforcement Learning -- Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration -- Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case -- Regularized Fitted Q-Iteration: Application to Planning -- A Near Optimal Policy for Channel Allocation in Cognitive Radio -- Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets -- Bayesian Reward Filtering -- Basis Expansion in Natural Actor Critic Methods -- Reinforcement Learning with the Use of Costly Features -- Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car Problem -- Optimistic Planning of Deterministic Systems -- Policy Iteration for Learning an Exercise Policy for American Options -- Tile Coding Based on Hyperplane Tiles -- Use of Reinforcement Learning in Two Real Applications -- Applications of Reinforcement Learning to Structured Prediction -- Policy Learning – A Unified Perspective with Applications in Robotics -- Probabilistic Inference for Fast Learning in Control -- United We Stand: Population Based Methods for Solving Unknown POMDPs -- New Error Bounds for Approximations from Projected Linear Equations -- Markov Decision Processes with Arbitrary Reward Processes.
|
520 |
|
|
|a This book constitutes revised and selected papers of the 8th European Workshop on Reinforcement Learning, EWRL 2008, which took place in Villeneuve d'Ascq, France, during June 30 - July 3, 2008. The 21 papers presented were carefully reviewed and selected from 61 submissions. They are dedicated to the field of and current researches in reinforcement learning.
|
650 |
|
0 |
|a Computer science.
|
650 |
|
0 |
|a Computer programming.
|
650 |
|
0 |
|a Computers.
|
650 |
|
0 |
|a Database management.
|
650 |
|
0 |
|a Artificial intelligence.
|
650 |
1 |
4 |
|a Computer Science.
|
650 |
2 |
4 |
|a Artificial Intelligence (incl. Robotics).
|
650 |
2 |
4 |
|a Programming Techniques.
|
650 |
2 |
4 |
|a Theory of Computation.
|
650 |
2 |
4 |
|a Computation by Abstract Devices.
|
650 |
2 |
4 |
|a Information Systems Applications (incl. Internet).
|
650 |
2 |
4 |
|a Database Management.
|
700 |
1 |
|
|a Girgin, Sertan.
|e editor.
|
700 |
1 |
|
|a Loth, Manuel.
|e editor.
|
700 |
1 |
|
|a Munos, Rémi.
|e editor.
|
700 |
1 |
|
|a Preux, Philippe.
|e editor.
|
700 |
1 |
|
|a Ryabko, Daniil.
|e editor.
|
710 |
2 |
|
|a SpringerLink (Online service)
|
773 |
0 |
|
|t Springer eBooks
|
776 |
0 |
8 |
|i Printed edition:
|z 9783540897217
|
830 |
|
0 |
|a Lecture Notes in Computer Science,
|x 0302-9743 ;
|v 5323
|
856 |
4 |
0 |
|u http://dx.doi.org/10.1007/978-3-540-89722-4
|z Full Text via HEAL-Link
|
912 |
|
|
|a ZDB-2-SCS
|
912 |
|
|
|a ZDB-2-LNC
|
950 |
|
|
|a Computer Science (Springer-11645)
|