Treat Simulation Playground
Train 1x
Train 10x
Train 100x
Reset
View: Radar
Use arrow keys
Learning Parameters
Learning Rate (α)
0.30
Discount Factor (γ)
0.90
Training Speed
5ms