---
Reinforcement learning, mostly. The parts I keep circling back to are multi-agent learning, environment design, and the question of when an agent should stop gathering information and commit.
rifaki.me
[email protected]
submissions | comments