https://hgpu.org/?p=5792
Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations