Tags exploration-exploitation1 machine-learning2 multi-armed-bandit1 probabilistic-models2 reinforcement-learning1 statistics2