HOME > 상세정보

상세정보

Reinforcement learning

Reinforcement learning (13회 대출)

자료유형
단행본
개인저자
Sutton, Richard S.
서명 / 저자사항
Reinforcement learning / edited by Richard S. Sutton.
발행사항
Boston :   Kluwer Academic Publishers ,   c1992.  
형태사항
171 p. : ill. ; 25 cm.
총서사항
The Kluwer international series in engineering and computer science ; Knowledge representation, learning, and expert systems.SECS 173.
ISBN
0792392345
일반주기
"A special issue of Machine learning ... reprinted from ... vol. 8, nos. 3-4 (1992)."  
서지주기
Includes bibliographical references and index.
일반주제명
Reinforcement learning (Machine learning).
000 00948camuuu200253 a 4500
001 000000022846
005 19980601110615.0
008 920130s1992 maua b 001 0 eng
010 ▼a 92007567
020 ▼a 0792392345
040 ▼a DLC ▼c DLC ▼d DLC
049 1 ▼l 111023408
050 0 0 ▼a Q325.6 ▼b .R45 1992
082 0 0 ▼a 006.3/1 ▼2 20
090 ▼a 006.3 ▼b R367
245 0 0 ▼a Reinforcement learning / ▼c edited by Richard S. Sutton.
260 ▼a Boston : ▼b Kluwer Academic Publishers , ▼c c1992.
300 ▼a 171 p. : ▼b ill. ; ▼c 25 cm.
440 4 ▼a The Kluwer international series in engineering and computer science ; ▼v SECS 173. ▼a Knowledge representation, learning, and expert systems.
500 ▼a "A special issue of Machine learning ... reprinted from ... vol. 8, nos. 3-4 (1992)."
504 ▼a Includes bibliographical references and index.
650 0 ▼a Reinforcement learning (Machine learning).
700 1 0 ▼a Sutton, Richard S.

No. 소장처 청구기호 등록번호 도서상태 반납예정일 예약 서비스
No. 1 소장처 중앙도서관/서고6층/ 청구기호 006.3 R367 등록번호 111023408 (11회 대출) 도서상태 대출가능 반납예정일 예약 서비스 B M
No. 2 소장처 세종학술정보원/과학기술실(5층)/ 청구기호 006.3 R367 등록번호 151001746 (2회 대출) 도서상태 대출가능 반납예정일 예약 서비스 B M ?
No. 소장처 청구기호 등록번호 도서상태 반납예정일 예약 서비스
No. 1 소장처 중앙도서관/서고6층/ 청구기호 006.3 R367 등록번호 111023408 (11회 대출) 도서상태 대출가능 반납예정일 예약 서비스 B M
No. 소장처 청구기호 등록번호 도서상태 반납예정일 예약 서비스
No. 1 소장처 세종학술정보원/과학기술실(5층)/ 청구기호 006.3 R367 등록번호 151001746 (2회 대출) 도서상태 대출가능 반납예정일 예약 서비스 B M ?

컨텐츠정보

책소개

Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation, and through that all subsequent rewards. These two characteristics -- trial-and-error search and delayed reward -- are the most important distinguishing features of reinforcement learning.
Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century, and that work has had a very strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (e.g. operant conditioning and secondary reinforcement).
Reinforcement Learning is an edited volume of original research, comprising seven invited contributions by leading researchers.


Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation, and through that all subsequent rewards. These two characteristics -- trial-and-error search and delayed reward -- are the most important distinguishing features of reinforcement learning.
Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century, and that work has had a very strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (e.g. operant conditioning and secondary reinforcement).
Reinforcement Learning is an edited volume of original research, comprising seven invited contributions by leading researchers.



정보제공 : Aladin

목차

Introduction; R.Sutton. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning; R.J. Wiiliams. Practical Issues in Temporal Difference Learning; G. Teasauro. Technical Note: Q-Learning; C.J.C.H. Watkins, P. Dayan. Self Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching; L.-J. Lin. Transfer of Learning by Composing Solutions of Elemental Sequential Tasks; S.P. Singh. The Convergence of TD (lambda) for general lambda; P. Dayan. A Reinforcement Connctionist Approach to Robot Path Finding in Non-Maze-Like Environments; J. del R. Millan, C. Torras.


정보제공 : Aladin

관련분야 신착자료

Negro, Alessandro (2026)
Dyer-Witheford, Nick (2026)