active
RL Queue Controller
Learning dynamic service-rate policies in stochastic queues
The environment evaluates how reinforcement learning agents adjust service capacity as queues evolve. Policies are compared against analytical and dynamic-programming benchmarks and validated through repeated simulation.
Methods
- PPO
- A2C
- REINFORCE
- Simulation
