https://hgpu.org/?p=15848
Training Neural Networks Without Gradients: A Scalable ADMM Approach