Schlaukopf
.ai
In Reinforcement Learning, the process of balancing exploration and exploitation is referred to as the:
Policy optimization
State-action value function approximation
Exploration-exploitation dilemma
Value iteration
Artificial Intelligence Exercises are loading ...