Rui (Richard) Wei

1 Castle Point Terrace

Hoboken, NJ 07030

Hi there! I am a second-year PhD student under the Department of Electrical and Computer Engineering at Stevens Institute of Technology, advised by Dr. Hao Wang. Before that, I received my Master of Science in Computer Science degree at Boston University, and my Bachelor’s degree in Computer Science and Technology at Chongqing University.

My research interests include serverless computing, reinforcement learning, and machine learning systems. My goal is to design efficient serverless ML systems for varying types of large-scale AI workloads, especially reinforcement learning tasks.

news

Sep 29, 2025	Our Explore request CIS250978 has been granted by NSF ACCESS! I’m serving as the Co-PI on this project.
Sep 26, 2025	Our paper “Multi-Agent Reinforcement Learning with Serverless Computing” has been accepted by SoCC 2025!
Jan 17, 2025	I will give a Lightening Talk about our ongoing Serverless MARL project at the NAIRR Pilot Inaugural Annual Meeting.
Oct 02, 2024	Our project “Distributed Multi-agent Reinforcement Learning with Serverless Computing” (NAIRR240269) has been funded by the National Artificial Intelligence Research Resource Pilot (NAIRR) Pilot program!

latest posts

selected publications

SoCC’25

Multi-Agent Reinforcement Learning with Serverless Computing

Rui Wei, Hanfei Yu, Xikang Song, Jian Li, Devesh Tiwari, Ying Mao, and Hao Wang

In Proceedings of the 2025 ACM Symposium on Cloud Computing, , 2026

Abs DOI

Multi-agent reinforcement learning (MARL) has emerged as a promising approach for tasks requiring multiple agents for cooperation or competition, such as scientific simulation, multi-robot collaboration, and traffic control. Serverless computing, with its dynamic and flexible resource allocation, has demonstrated potential for improving training efficiency and cost-efficiency in RL workloads. However, existing serverless RL training systems focus primarily on single-agent scenarios, overlooking the unique characteristics and inherent complexities of MARL—such as dynamic inter-agent relationships and heterogeneous policy requirements across agents—leaving inefficient and even infeasible support to diverse and complex MARL algorithms.This paper introduces MARLess, the first serverless MARL framework designed to support general MARL algorithms. MARLess decomposes MARL algorithms into serverless functions. It further integrates a dynamic learner sharing mechanism that exploits agent similarities to reduce model update costs and employs actor scaling tailored to MARL tasks, minimizing unnecessary sampling costs based on the data requirements of agents’ models. This design optimizes both training efficiency and costs without harming the training quality. Experiments on AWS EC2 testbeds show that MARLess outperforms SOTA MARL baselines with up to 1.27\texttimesfaster training and 68% cost reduction. Large-scale evaluations on a 15-node cluster with a total of 1,920 vCPUs demonstrate MARLess’s scalability and consistent performance under increasing workloads. For a real-world scientific application—turbulent flow simulation, MARLess achieves a 34% cost reduction and 1.1\texttimes speedup.
arXiv

RLHFless: Serverless Computing for Efficient RLHF

Rui Wei, Hanfei Yu, Shubham Jain, Yogarajan Sivakumar, Devesh Tiwari, Jian Li, Seung-Jong Park, and Hao Wang

2026

DOI
arXiv

The Diminishing Returns of Early-Exit Decoding in Modern LLMs

Rui Wei, Rui Du, Hanfei Yu, Devesh Tiwari, Jian Li, Zhaozhuo Xu, and Hao Wang

2026

DOI