active

RL Queue Controller

Learning dynamic service-rate policies in stochastic queues

The environment evaluates how reinforcement learning agents adjust service capacity as queues evolve. Policies are compared against analytical and dynamic-programming benchmarks and validated through repeated simulation.

Methods

PPO
A2C
REINFORCE
Simulation

RL Queue Controller

Methods

Links