Preprints:
-
Game-Theoretic Co-Evolution for LLM-Based Heuristic Discovery.
Xinyi Ke, Kai Li, Junliang Xing, Yifan Zhang, Jian Cheng.
arXiv:2601.22896. (Corresponding Author)
Paper -
LLM-Based Scientific Equation Discovery via Physics-Informed Token-Regularized Policy Optimization
Boxiao Wang, Kai Li, Tianyi Liu, Chen Li, Junzhe Wang, Yifan Zhang, Jian Cheng.
arXiv:2602.10576. (Corresponding Author)
Paper -
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience
Runxiang Wang, Boxiao Wang, Kai Li, Yifan Zhang, Jian Cheng.
arXiv:2506.04282. (Corresponding Author)
Paper
Recent Papers:
-
Evolutionary Augmented Reinforcement Learning for Neural Combinatorial Optimization.
Shengda Gu, Kai Li, Junliang Xing, Yifan Zhang, Jian Cheng.
IEEE Transactions on Evolutionary Computation, 2026. (Corresponding Author)
Paper -
Towards Foresighted AI Cooperators with LLM-driven Decision-Time Planning.
Yuheng Jing, Kai Li, Bingyun Liu, Ziwen Zhang, Zhe Wu, Yifan Zhang, Junliang Xing, Jian Cheng.
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2026. Oral. (Corresponding Author)
Paper -
K²-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control.
Zhe Wu, Donglin Mo, Hongjin Lu, Junliang Xing, Jianheng Liu, Yuheng Jing, Kai Li, Kun Shao, Jianye Hao, Yuanchun Shi.
International Conference on Learning Representations (ICLR), 2026.
Paper -
Deep (Predictive) Discounted Counterfactual Regret Minimization.
Hang Xu, Kai Li, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
AAAI Conference on Artificial Intelligence (AAAI), 2026, Oral. (Corresponding Author)
Paper ArXiv -
Offline Opponent Modeling with Truncated Q-driven Instant Policy Refinement.
Yuheng Jing, Kai Li, Bingyun Liu, Ziwen Zhang, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
International Conference on Machine Learning (ICML), 2025. (Corresponding Author)
Paper -
Goal-Oriented Skill Abstraction for Offline Multi-Task Reinforcement Learning.
Jinmin He, Kai Li, Yifan Zang, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
International Conference on Machine Learning (ICML), 2025. (Corresponding Author)
Paper -
An Open-Ended Learning Framework for Opponent Modeling.
Yuheng Jing, Kai Li, Bingyun Liu, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
AAAI Conference on Artificial Intelligence (AAAI), 2025, Oral, Top 5%. (Corresponding Author)
Paper -
Enhanced Equilibria-Solving via Private Information Pre-Branch Structure in Adversarial Team Games.
Chen Qiu, Haobo Fu, Kai Li, Jiajia Zhang, Xuan Wang.
The Conference on Uncertainty in Artificial Intelligence (UAI), 2025.
Paper -
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning.
Hanlin Yang, Jian Yao, Weiming Liu, Qing Wang, Hanmin Qin, Kong hansheng, Kirk Tang, Jiechao Xiong, Chao Yu, Kai Li, Junliang Xing, Hongwu Chen, Juchao Zhuo, Qiang Fu, Yang Wei, Haobo Fu.
International Conference on Learning Representations (ICLR), 2025.
Paper -
Automatically Designing Counterfactual Regret Minimization Algorithms for Solving Imperfect-Information Games.
Kai Li, Hang Xu, Haobo Fu, Qiang Fu, Junliang Xing.
Artificial Intelligence, 2024.
Paper PDF Code -
Efficient Multi-task Reinforcement Learning with Cross-Task Policy Guidance.
Jinmin He, Kai Li, Yifan Zang, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
Neural Information Processing Systems (NeurIPS), 2024. (Corresponding Author)
Paper Code -
Opponent Modeling with In-context Search.
Yuheng Jing, Bingyun Liu, Kai Li, Yifan Zang, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
Neural Information Processing Systems (NeurIPS), 2024. (Corresponding Author)
Paper Code -
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent.
Hang Xu, Kai Li, Bingyun Liu, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
International Joint Conference on Artificial Intelligence (IJCAI), 2024. (Corresponding Author)
Paper ArXiv Code -
Towards Offline Opponent Modeling with In-context Learning.
Yuheng Jing, Kai Li, Bingyun Liu, Yifan Zang, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
International Conference on Learning Representations (ICLR), 2024. (Corresponding Author)
Paper Code -
Dynamic Discounted Counterfactual Regret Minimization.
Hang Xu, Kai Li, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
International Conference on Learning Representations (ICLR), 2024, Spotlight, Top 5%. (Corresponding Author)
Paper Code -
Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing.
Jinmin He, Kai Li, Yifan Zang, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
AAAI Conference on Artificial Intelligence (AAAI), 2024. (Corresponding Author)
Paper ArXiv -
Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning.
Yifan Zang, Jinmin He, Kai Li, Haobo Fu, Qiang Fu, Junliang Xing, Jian Cheng.
Neural Information Processing Systems (NeurIPS), 2023. (Corresponding Author)
Paper Code -
OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research.
Kai Li, Hang Xu, Enmin Zhao, Zhe Wu, Junliang Xing.
IEEE Transactions on Neural Networks and Learning Systems, 2023.
Paper PDF News -
Sample Efficient Reinforcement Learning Using Graph-Based Memory Reconstruction.
Yongxin Kang, Enmin Zhao, Yifan Zang, Lijuan Li, Kai Li, Pin Tao, Junliang Xing.
IEEE Transactions on Artificial Intelligence, 2023.
Paper PDF -
Sequential Cooperative Multi-Agent Reinforcement Learning.
Yifan Zang, Jinmin He, Kai Li, Haobo Fu, Qiang Fu, Junliang Xing.
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2023.
Paper PDF Code -
Greedy when Sure and Conservative when Uncertain about the Opponents.
Haobo Fu, Ye Tian, Hongxiang Yu, Weiming Liu, Shuang Wu, Jiechao Xiong, Ying Wen, Kai Li, Junliang Xing, Qiang Fu, Wei Yang.
International Conference on Machine Learning (ICML), 2022, Spotlight.
Paper Code -
Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game.
Haobo Fu, Weiming Liu, Shuang Wu, Yijia Wang, Tao Yang, Kai Li, Junliang Xing, Bin Li, Bo Ma, Qiang Fu, Yang Wei.
International Conference on Learning Representations (ICLR), 2022.
Paper Code -
AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms.
Hang Xu, Kai Li, Haobo Fu, Qiang Fu, Junliang Xing.
AAAI Conference on Artificial Intelligence (AAAI), 2022, Oral.
Paper Code -
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning.
Enmin Zhao, Renye Yan, Jinqiu Li, Kai Li, Junliang Xing.
AAAI Conference on Artificial Intelligence (AAAI), 2022. Distinguished Paper Award!
Paper News -
Exploration via State Influence Modeling.
Yongxin Kang, Enmin Zhao, Kai Li, Junliang Xing.
AAAI Conference on Artificial Intelligence (AAAI), 2021.
Paper -
Potential Driven Reinforcement Learning for Hard Exploration Tasks.
Enmin Zhao, Shihong Deng, Yifan Zang, Yongxin Kang, Kai Li, Junliang Xing.
International Joint Conference on Artificial Intelligence (IJCAI), 2020.
Paper Code