Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more (English Edition)Packt Publishing2018/06169Maxim Lapan読んだ読みたいamazonで見る