Publications

Preprint Papers

6
  • LLM Advertisement based on Neuron Auctions

    LLM Advertising2026

    Peiran Yun, Wenxin Xu, Jiayuan Liu, Yihang Zhang, Liang Zeng, Lingkai Kong, Tonghan Wang

  • NaiAD: Initiate Data-Driven Research for LLM Advertising

    LLM Advertising2026

    Yihang Zhang, Zimeng Huang, Ren Zhai, Yipeng Kang, Tonghan Wang

  • How LLMs Are Persuaded: A Few Attention Heads, Rerouted

    LLM2026

    Xiangkun Sun, Lingkai Kong, Aoqi Zhang, Liang Zeng, Tonghan Wang

  • The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

    LLM Agents2026

    Jiayuan Liu, Tianqin Li, Shiyi Du, Xin Luo, Haoxuan Zeng, Emanuel Tewolde, Tai Sing Lee, Tonghan Wang, Carl Kingsford, Vincent Conitzer

  • Incentive-Aware Multi-Fidelity Optimization for Generative Advertising in Large Language Models

    LLM Advertising2026

    Jiayuan Liu, Barry Wang, Jiarui Gan, Tonghan Wang, Leon Xie, Mingyu Guo, Vincent Conitzer

  • LLM Active Alignment: A Nash Equilibrium Perspective

    LLM Agents2026

    Tonghan Wang*, Yuqi Pan*, Xinyi Yang*, Yanchen Jiang, Milind Tambe, David C. Parkes

Conference Papers

25
  • Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

    2026

    Xinyi Yang, Liang Zeng, Heng Dong, Chao Yu, Xiaoran Wu, Huazhong Yang, Yu Wang, Milind Tambe, Tonghan Wang

  • Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data

    2025

    Lingkai Kong*, Haichuan Wang*, Tonghan Wang*, Guojun Xiong, Milind Tambe

  • BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization

    2025

    Tonghan Wang, Yanchen Jiang, David C. Parkes

  • Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing

    2025

    Davin Choo*, Yuqi Pan*, Tonghan Wang, Milind Tambe, Alastair van Heerden, Cheryl Johnson

  • Robust Optimization with Diffusion Models for Green Security

    2025

    Lingkai Kong, Haichuan Wang, Yuqi Pan, Cheol Woo Kim, Mingxiao Song, Alayna Nguyen, Tonghan Wang, Haifeng Xu, Milind Tambe

  • On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow

    2025

    Tonghan Wang*, Heng Dong*, Yanchen Jiang, David C. Parkes, Milind Tambe

  • The Bandit Whisperer: Communication Learning for Restless Bandits

    2025

    Tonghan Wang*, Yunfan Zhao*, Dheeraj Mysore Nagaraj, Aparna Taneja, Milind Tambe

  • GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning

    2024

    Tonghan Wang*, Yanchen Jiang*, David C. Parkes

  • Multi-Sender Persuasion: A Computational Perspective

    2024

    Tonghan Wang*, Safwan Hossain*, Tao Lin*, Yiling Chen, David C. Parkes, Haifeng Xu

  • Position: Social Environment Design Should be Further Developed for AI-based Policy-Making

    2024

    Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, Yiling Chen

  • Deep Contract Design via Discontinuous Neural Networks

    2023

    Tonghan Wang, Paul Dütting, Dmitry Ivanov, Inbal Talgam-Cohen, David C. Parkes

  • Symmetry-Aware Robot Design with Structured Subgroups

    2023

    Heng Dong, Junyu Zhang, Tonghan Wang, Chongjie Zhang

  • Low-Rank Modular Reinforcement Learning via Muscle Synergy

    2022

    Tonghan Wang*, Heng Dong*, Jiayuan Liu, Chongjie Zhang

  • Non-Linear Coordination Graphs

    2022

    Tonghan Wang*, Yipeng Kang*, Qianlan Yang, Xiaoran Wu, Chongjie Zhang

  • Self-Organized Polynomial-Time Coordination Graphs

    2022

    Qianlan Yang, Weijun Dong, Zhizhou Ren, Jianhao Wang, Tonghan Wang, Chongjie Zhang

  • Context-Aware Sparse Deep Coordination Graphs

    2022

    Tonghan Wang*, Liang Zeng*, Weijun Dong, Qianlan Yang, Chongjie Zhang

  • Celebrating Diversity in Shared Multi-Agent Reinforcement Learning

    2021

    Chenghao Li*, Tonghan Wang*, Chengjie Wu, Qianchuan Zhao, Jun Yang, Chongjie Zhang

  • RODE: Learning Roles to Decompose Multi-Agent Tasks

    2021

    Tonghan Wang, Tarun Gupta, Anuj Mahajan, Bei Peng, Shimon Whiteson, Chongjie Zhang

  • DOP: Off-Policy Multi-Agent Decomposed Policy Gradients

    2021

    Tonghan Wang*, Yihan Wang*, Beining Han*, Heng Dong, Chongjie Zhang

  • Incorporating Pragmatic Reasoning Communication into Emergent Language

    2020

    Yipeng Kang, Tonghan Wang, Gerard de Melo

  • ROMA: Multi-Agent Reinforcement Learning with Emergent Roles

    2020

    Tonghan Wang, Heng Dong, Victor Lesser, Chongjie Zhang

  • Influence-Based Multi-Agent Exploration

    2020

    Tonghan Wang*, Jianhao Wang*, Yi Wu, Chongjie Zhang

  • Learning Nearly Decomposable Value Functions with Communication Minimization

    2020

    Tonghan Wang*, Jianhao Wang*, Chongyi Zheng, Chongjie Zhang

  • Convergence of Multi-Agent Learning with a Finite Step Size in General-Sum Games

    2019

    Xinliang Song, Tonghan Wang, Chongjie Zhang

  • Compact Object Representation of a Non-Rigid Object for Real-Time Tracking in AR Systems

    2018

    Tonghan Wang, Xueying Qin, Fan Zhong, Baoquan Chen, Ming C. Lin

Journal Papers

3
  • Automated Mechanism Design: A Survey

    2025

    Michael J. Curry, Zhou Fan, Yanchen Jiang, Sai Srivatsa Ravindranath, Tonghan Wang, David C. Parkes

  • Multi-Agent Policy Transfer via Task Relationship Modeling

    2024

    Tonghan Wang*, Rongjun Qin*, Feng Chen*, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu

  • Celebrating Diversity With Subtask Specialization in Shared Multiagent Reinforcement Learning

    2023

    Chenghao Li, Tonghan Wang, Chengjie Wu, Qianchuan Zhao, Jun Yang, Chongjie Zhang

Workshop Papers

2
  • Multi-Agent Policy Transfer via Task Relationship Modeling

    2022

    Tonghan Wang*, Rongjun Qin*, Feng Chen*, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu

  • Model and Method: Training-Time Attack for Cooperative Multi-Agent Reinforcement Learning

    2022

    Tonghan Wang*, Siyang Wu*, Xiaoran Wu, Jingfeng Zhang, Yujing Hu, Changjie Fan, Chongjie Zhang

© 2026. Tonghan Wang. All rights reserved.