Joseph A. Walton crest

active

RL Queue Controller

Learning dynamic service-rate policies in stochastic queues

The environment evaluates how reinforcement learning agents adjust service capacity as queues evolve. Policies are compared against analytical and dynamic-programming benchmarks and validated through repeated simulation.

Methods

  • PPO
  • A2C
  • REINFORCE
  • Simulation

Links