Hangyu Mao（毛航宇）

Cited by

	All	Since 2019
Citations	657	647
h-index	13	13
i10-index	15	15

240

120

180

201720182019202020212022202320242 4 9 48 74 147 234 135

Public access

View all

13 articles

1 article

available

not available

Based on funding mandates

Co-authors

Zhen XiaoPeking UniversityVerified email at pku.edu.cn
Rui ZhaoSenseTime Group LimitedVerified email at sensetime.com
Wulong LiuHuawei Noah's Ark LabVerified email at huawei.com
Bin ZhangInstitute of Automation,Chinese Academy of SciencesVerified email at ia.ac.cn
jingqing ruanInstitute of Automation，Chinese Academy of SciencesVerified email at ia.ac.cn
Ziyue LIUniversity of CologneVerified email at wiso.uni-koeln.de
Jun WangProfessor, Computer Science, University College LondonVerified email at cs.ucl.ac.uk
Jianye HaoTianjin University

Hangyu Mao（毛航宇）

SenseTime SCG/Research

Verified email at pku.edu.cn - Homepage

AI Agent Reinforcement Learning Multi-Agent Reinforcement Learning Large Language Model


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Modelling the dynamic joint policy of teammates with attention multi-agent DDPG H Mao, Z Zhang, Z Xiao, Z Gong Proceedings of the 18th International Conference on Autonomous Agents and …, 2019	102	2019
Learning agent communication under limited bandwidth by message pruning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5142-5149, 2020	76	2020
Neighborhood cognition consistent multi-agent reinforcement learning H Mao, W Liu, J Hao, J Luo, D Li, Z Zhang, J Wang, Z Xiao Proceedings of the AAAI conference on artificial intelligence 34 (05), 7219-7226, 2020	67	2020
Tptu: Task planning and tool usage of large language model-based ai agents J Ruan, Y Chen, B Zhang, Z Xu, T Bao, G Du, S Shi, H Mao, X Zeng, ... arXiv preprint arXiv:2308.03427, 2023	57	2023
ACCNet: Actor-coordinator-critic net for" Learning-to-communicate" with deep multi-agent reinforcement learning H Mao, Z Gong, Y Ni, Z Xiao arXiv preprint arXiv:1706.03235, 2017	46	2017
Learning multi-agent communication with double attentional deep reinforcement learning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Autonomous Agents and Multi-Agent Systems 34, 1-34, 2020	41	2020
Learning multi-agent communication under limited-bandwidth restriction for internet packet routing H Mao, Z Gong, Z Zhang, Z Xiao, Y Ni arXiv preprint arXiv:1903.05561, 2019	28	2019
Reward design in cooperative multi-agent reinforcement learning for packet routing H Mao, Z Gong, Z Xiao arXiv preprint arXiv:2003.03433, 2020	23	2020
An efficient transfer learning framework for multiagent reinforcement learning T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ... Advances in Neural Information Processing Systems 34, 17037-17048, 2021	21	2021
Seihai: A sample-efficient hierarchical ai for the minerl competition H Mao, C Wang, X Hao, Y Mao, Y Lu, C Wu, J Hao, D Li, P Tang Distributed Artificial Intelligence: Third International Conference, DAI …, 2022	20	2022
Cooperative multi-agent transfer learning with level-adaptive credit assignment T Zhou, F Zhang, K Shao, K Li, W Huang, J Luo, W Wang, Y Yang, H Mao, ... arXiv preprint arXiv:2106.00517, 2021	20	2021
What about inputting policy in value function: Policy representation and policy-extended value function approximator H Tang, Z Meng, J Hao, C Chen, D Graves, D Li, C Yu, H Mao, W Liu, ... Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8441-8449, 2022	17	2022
Structural relational inference actor-critic for multi-agent reinforcement learning X Zhang, Y Liu, X Xu, Q Huang, H Mao, A Carie Neurocomputing 459, 383-394, 2021	17	2021
Tptu-v2: Boosting task planning and tool usage of large language model-based agents in real-world systems Y Kong, J Ruan, Y Chen, B Zhang, T Bao, S Shi, G Du, X Hu, H Mao, Z Li, ... arXiv preprint arXiv:2311.11315, 2023	11	2023
Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020 WH Guss, S Milani, N Topin, B Houghton, S Mohanty, A Melnik, A Harter, ... NeurIPS 2020 Competition and Demonstration Track, 233-252, 2021	10	2021
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ... arXiv preprint arXiv:2311.13884, 2023	9	2023
Boosting multiagent reinforcement learning via permutation invariant and permutation equivariant networks HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang The Eleventh International Conference on Learning Representations, 2022	9	2022
API: Boosting multi-agent reinforcement learning via agent-permutation-invariant networks X Hao, W Wang, H Mao, Y Yang, D Li, Y Zheng, Z Wang, J Hao arXiv preprint arXiv:2203.05285, 2022	8	2022
Transformer in transformer as backbone for deep reinforcement learning H Mao, R Zhao, H Chen, J Hao, Y Chen, D Li, J Zhang, Z Xiao arXiv preprint arXiv:2212.14538, 2022	7	2022
Multiagent q-learning with sub-team coordination W Huang, K Li, K Shao, T Zhou, M Taylor, J Luo, D Wang, H Mao, J Hao, ... Advances in Neural Information Processing Systems 35, 29427-29439, 2022	7	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors