Modelling the dynamic joint policy of teammates with attention multi-agent DDPG H Mao, Z Zhang, Z Xiao, Z Gong Proceedings of the 18th International Conference on Autonomous Agents and …, 2019 | 102 | 2019 |
Learning agent communication under limited bandwidth by message pruning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5142-5149, 2020 | 76 | 2020 |
Neighborhood cognition consistent multi-agent reinforcement learning H Mao, W Liu, J Hao, J Luo, D Li, Z Zhang, J Wang, Z Xiao Proceedings of the AAAI conference on artificial intelligence 34 (05), 7219-7226, 2020 | 67 | 2020 |
Tptu: Task planning and tool usage of large language model-based ai agents J Ruan, Y Chen, B Zhang, Z Xu, T Bao, G Du, S Shi, H Mao, X Zeng, ... arXiv preprint arXiv:2308.03427, 2023 | 57 | 2023 |
ACCNet: Actor-coordinator-critic net for" Learning-to-communicate" with deep multi-agent reinforcement learning H Mao, Z Gong, Y Ni, Z Xiao arXiv preprint arXiv:1706.03235, 2017 | 46 | 2017 |
Learning multi-agent communication with double attentional deep reinforcement learning H Mao, Z Zhang, Z Xiao, Z Gong, Y Ni Autonomous Agents and Multi-Agent Systems 34, 1-34, 2020 | 41 | 2020 |
Learning multi-agent communication under limited-bandwidth restriction for internet packet routing H Mao, Z Gong, Z Zhang, Z Xiao, Y Ni arXiv preprint arXiv:1903.05561, 2019 | 28 | 2019 |
Reward design in cooperative multi-agent reinforcement learning for packet routing H Mao, Z Gong, Z Xiao arXiv preprint arXiv:2003.03433, 2020 | 23 | 2020 |
An efficient transfer learning framework for multiagent reinforcement learning T Yang, W Wang, H Tang, J Hao, Z Meng, H Mao, D Li, W Liu, Y Chen, ... Advances in Neural Information Processing Systems 34, 17037-17048, 2021 | 21 | 2021 |
Seihai: A sample-efficient hierarchical ai for the minerl competition H Mao, C Wang, X Hao, Y Mao, Y Lu, C Wu, J Hao, D Li, P Tang Distributed Artificial Intelligence: Third International Conference, DAI …, 2022 | 20 | 2022 |
Cooperative multi-agent transfer learning with level-adaptive credit assignment T Zhou, F Zhang, K Shao, K Li, W Huang, J Luo, W Wang, Y Yang, H Mao, ... arXiv preprint arXiv:2106.00517, 2021 | 20 | 2021 |
What about inputting policy in value function: Policy representation and policy-extended value function approximator H Tang, Z Meng, J Hao, C Chen, D Graves, D Li, C Yu, H Mao, W Liu, ... Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8441-8449, 2022 | 17 | 2022 |
Structural relational inference actor-critic for multi-agent reinforcement learning X Zhang, Y Liu, X Xu, Q Huang, H Mao, A Carie Neurocomputing 459, 383-394, 2021 | 17 | 2021 |
Tptu-v2: Boosting task planning and tool usage of large language model-based agents in real-world systems Y Kong, J Ruan, Y Chen, B Zhang, T Bao, S Shi, G Du, X Hu, H Mao, Z Li, ... arXiv preprint arXiv:2311.11315, 2023 | 11 | 2023 |
Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020 WH Guss, S Milani, N Topin, B Houghton, S Mohanty, A Melnik, A Harter, ... NeurIPS 2020 Competition and Demonstration Track, 233-252, 2021 | 10 | 2021 |
Controlling large language model-based agents for large-scale decision-making: An actor-critic approach B Zhang, H Mao, J Ruan, Y Wen, Y Li, S Zhang, Z Xu, D Li, Z Li, R Zhao, ... arXiv preprint arXiv:2311.13884, 2023 | 9 | 2023 |
Boosting multiagent reinforcement learning via permutation invariant and permutation equivariant networks HAO Jianye, X Hao, H Mao, W Wang, Y Yang, D Li, Y Zheng, Z Wang The Eleventh International Conference on Learning Representations, 2022 | 9 | 2022 |
API: Boosting multi-agent reinforcement learning via agent-permutation-invariant networks X Hao, W Wang, H Mao, Y Yang, D Li, Y Zheng, Z Wang, J Hao arXiv preprint arXiv:2203.05285, 2022 | 8 | 2022 |
Transformer in transformer as backbone for deep reinforcement learning H Mao, R Zhao, H Chen, J Hao, Y Chen, D Li, J Zhang, Z Xiao arXiv preprint arXiv:2212.14538, 2022 | 7 | 2022 |
Multiagent q-learning with sub-team coordination W Huang, K Li, K Shao, T Zhou, M Taylor, J Luo, D Wang, H Mao, J Hao, ... Advances in Neural Information Processing Systems 35, 29427-29439, 2022 | 7 | 2022 |