[ 1 ] Sallab A E L,Abdou M,Perot E,et al.Deep reinforcement learning framework for autonomous driving[J].Electronic Imaging,
2017(19):70-76.
[ 2 ] Li Y.Deep reinforcement learning:An Overview[J].arXiv,
2017,arXiv:1701.07274.
[ 3 ] Silver D,Huang A,Maddison C J,et al.Mastering the game of
Go with deep neural networks and tree search[J].Nature, 2016,
529(7587):484-489.
[ 4 ] Xiong X,Wang J,Zhang F,et al.Combining deep reinforcement
learning and safety based control for autonomous driving[J].
arXiv,2016,arXiv:1612.00147.
[ 5 ] Yang F,Wang P,Wang X H.Continuous control in car simulator with deep reinforcement learning[C]//Proceedings of
the 2018 2nd International Conference on Computer Science
and Artificial Intelligence. 2018: 566-570.
[ 6 ] Liu Y,Zhang W,Chen F,et al.Path planning based on improved deep deterministic policy gradient algorithm[C]//2019
IEEE 3rd Information Technology, Networking, Electronic
and Automation Control Conference (ITNEC).IEEE,2019:
295-299.
[ 7 ] Zong X,Xu G,Yu G,et al.Obstacle avoidance for self-driving
vehicle with reinforcement learning[J]. SAE International
Journal of Passenger Cars-Electronic and Electrical Systems,
2017, 11(07-11-01-0003): 30-39.
[ 8 ] 刘全,翟建伟,章宗长,等.深度强化学习综述[J].计算机学报,
2018,41(1): 1-27.
[ 9 ] Wang Z, Bapst V, Heess N, et al. Sample efficient actor-critic
with experience replay[J]. arXiv,2016 arXiv:1611.01224.
[10] Silver D,Lever G,Heess N,et al.Deterministic policy gradient
algorithms[C]//Proceedings of the 31st International Conference on Machine Learning(ICML-14).New York,USA:ACM
Press,2014:387-395.
[11] Mnih V,Kavukcuoglu K,Silver D,et al.Human-level control
through deep reinforcement learning[J].nature,2015,518(7540):
529-533.
[12] Wymann B,Espié E,Guionneau C,et al.Torcs,the open racing
car simulator[J].Software available at http://torcs. sourceforge.net,2000,4(6):2-6. |