Playing Atari with Deep Reinforc

Playing Atari with Deep Reinforc

作者: 海街diary | 来源:发表于2018-03-10 00:07 被阅读18次

Playing Atari with Deep Reinforc
8/10/2019 PaperReading: Playing
【5分钟 Paper】Playing Atari with De
深度学习背后的人工智能：深度学习原理初探
《Playing Atari with Deep Reinfor
[DQN] Playing Atari with Deep Re
强化学习是什么
Reinforcement Learning（RL）的事
目录
客座文章(一): 揭开强化学习神秘的面纱

1. 简介

使用CNN从raw pixel学习Q(s, a)，利用experience memory学习Q(s, a)，在atari2600 games中的7款游戏上进行了测试，全部超越之前算法，并且在3款游戏上超过了人类。

2. 算法

1. DQN算法

DQN Algorithm

2. 算法细节

DQN Architecture

RMSProp
Minibatch with 32
linearly from 1 to 0.1 over the first million frames, and fixed 0.1 thereafter
time steps = 10 million frames, memory size = 1 million
skip frames = 4

3.实验

same network architecture, same learning algorithm, same hyper parameter across all seven games.
Raw pixel cropped to 84x84x4.
在固定时间步下，比较不同算法(其他算法的输入是handcraft-feature)在7款游戏上的、所有episode的reward sum的average；同时，比较在这些episode中reward sum的最大值。此外，包括人类选手的score。
为了适应不同游戏的reward, 在train的时候positive reward=1, negative reward=-1, zero reward=0。
评价的时候使用 for a fixed number of steps(具体数字未提)。

4.收获

2 dense layers for output。
不同游戏时reward 归一，便于generalization。
memory size 约为 total steps 的 1/10。
RMSProp优化算法
参考文献值的读的：
《Prioritized Sweeping- Reinforcement Learning with Less Data and Less Real Time》、
《Deep Auto-Encoder Neural Networks in Reinforcement Learning》

相关文章

Playing Atari with Deep Reinforc
1. 简介使用CNN从raw pixel学习Q(s, a)，利用experience memory学习Q(s, ...
8/10/2019 PaperReading: Playing
Playing Atari with Deep Reinforcement Learning Abstract 使...
【5分钟 Paper】Playing Atari with De
论文题目：Playing Atari with Deep Reinforcement Learning 所解决的问...
深度学习背后的人工智能：深度学习原理初探
去年11月，一篇名为《Playing Atari with Deep Reinforcement Learning...
《Playing Atari with Deep Reinfor
领域：强化学习强化学习很久以来的一个重要挑战就是学习control agents能够直接从高维度的场景输入，例如...
[DQN] Playing Atari with Deep Re
论文链接：https://arxiv.org/abs/1312.5602[https://arxiv.org/ab...
强化学习是什么
参考 2013年伦敦的一家人工智能公司 Deep Mind 发表了一篇论文 “Playing Atari with...
Reinforcement Learning（RL）的事
两年前，位于伦敦的一家小公司，在arxiv上提交了一篇论文——Playing Atari with Deep Re...
目录
Machine Learning Deep Learning Transfer Learning Reinforc...
客座文章(一): 揭开强化学习神秘的面纱
本文禁止转载原文:Guest Post (Part I): Demystifying Deep Reinforc...

网友评论

本文标题：Playing Atari with Deep Reinforc

本文链接：https://www.haomeiwen.com/subject/ifqvfftx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|Playing Atari with Deep Reinforc|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！