Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization (2026)
Zelal Su Mustafaoglu, Sungyoung Lee, Eshan Balachandar, Risto Miikkulainen, Keshav Pingali
View:
PDF
Risto Miikkulainen Faculty risto [at] cs utexas edu