In Reinforcement Learning, the process of balancing exploration and exploitation is referred to as the:
Policy optimization
State-action value function approximation
Exploration-exploitation dilemma
Value iteration

Artificial Intelligence Exercises are loading ...