1 |
孙清. 基于强化学习的多智能体协同机制研究[D]. 杭州: 浙江工业大学, 2015.
|
|
SUN Qing. Research on Multi-Agent Collaboration Mechanism Based on Reinforcement Learning[D]. Hangzhou:Zhejiang University of Technology, 2015.
|
2 |
柏晓祉. 强化学习在多智能体协同中的研究与应用[D].成都: 电子科技大学, 2020.
|
|
BAI Xiao-zhi. Research and Application of Reinforcement Learning in Multi-Agent Collaboration[D]. Chengdu: University of Electronic Science and Technology of China, 2020.
|
3 |
李天旭. 基于深度强化学习的多智能体协同算法研究[D]. 北京: 中国矿业大学, 2020.
|
|
LI Tian-xu. Research on Multi-Agent Cooperative Algorithm Based on Deep Reinforcement Learning[D].Beijing: China University of Mining and Technology,2020.
|
4 |
谭晓阳, 文超, 姚兴虎. 一种基于λ-回报的异策略多智能体强化学习协作方法: CN111079305A[P]. 2020-04-28.
|
|
TAN Xiao-yang, WEN Chao, YAO Xing-hu. Multi-Agent Reinforcement Learning Cooperation Method Based on λ-Return: CN111079305A[P].2020-04-28.
|
5 |
陈亮, 梁宸, 张景异, 等. Actor-Critic框架下一种基于改进DDPG的多智能体强化学习算法[J]. 控制与决策, 2021, 36(1): 75-82.
|
|
CHEN Liang, LIANG Chen, ZHANG Jing-yi,et al. A Multi-Agent Reinforcement Learning Algorithm Basedon Improved DDPG Under Actor-Critic Framework[J].Control and Decision, 2021, 36(1): 75-82.
|
6 |
郑健, 陈建, 朱琨. 基于多智能体强化学习的无人集群协同设计[J]. 指挥信息系统与技术,2020,11(6): 26-31.
|
|
ZHENG Jian, CHEN Jian, ZHU Kun. Collaborative Design of Unmanned Cluster Based on Multi-Agent Reinforcement Learning[J]. Command Information System and Technology, 2020, 11(6): 26-31.
|
7 |
曹雷. 基于深度强化学习的智能博弈对抗关键技术[J]. 指挥信息系统与技术, 2019, 10(5): 1-7.
|
|
CAO Lei. Key Technologies of Intelligent Game Confrontation Based on Deep Reinforcement Learning[J]. Command Information System and Technology, 2019, 10(5): 1-7.
|
8 |
HAUSKNECHT M, STONE P. Deep Recurrent Q-Learning for Partially Observable MDP[C]∥2015 AAAI Spring Symposium Series, Palo Alto,CA,2015.
|
9 |
MNIH V, BADIA A P, MIRZA M, et al. Asynchronous Methods for Deep Reinforcement Learning[C]∥International Conference on Machine Learning (ICLR). New York,2016: 1928-1937.
|
10 |
LOWE R, WU Y I, TAMAR A, et al. Multi-Agent Actor-Critic for Mixed Cooperative Competitive Environments[C]//Advances in Neural Information Processing Systems.San Francisco, 2017: 6379-6390.
|
11 |
PENG P, WEN Y, YANG Y, et al. Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games[J]. arXiv: Learning, 2017.
|
12 |
WEI E, WICKE D, FREELAN D, et al. Multiagent Soft Q-Learning[C]∥2018 AAAI Spring Symposium Series, 2018.
|
13 |
SUNEHAG P, LEVER G, GRUSLYS A, et al. Value-Decomposition Networks For Cooperative Multi-Agent Learning Based on Team Reward[C]∥Adaptive Agents and Multi-Agents Systems (AAMAS). Stockholm,Sweden,2018: 2085-2087.
|
14 |
RASHID T, SAMVELYAN M, SCHROEDER C, et al. QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning[C]∥ International Conference on Machine Learning (ICML). Stockholm, Sweden,2018: 4292-4301.
|
15 |
FOERSTER J N, FARQUHAR G, AFOURAS T, et al. Counterfactual Multi-Agent policy gradients[C]∥ Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans,2018.
|
16 |
IQBAL S, SHA F. Actor-Attention-Critic for Multi-Agent Reinforcement Learning[C]∥International Conference on Machine Learning.San Francisco,2019: 2961-2970.
|