Reinforcement Learning — An AI learns by trial and error, receiving rewards for good actions and penalties for bad ones. How AlphaGo mastered Go, how robots learn to walk, and how RLHF trains chatbots. The agent explores, exploits what works, and gradually develops optimal strategies.
Part of the XLUXX AI Encyclopedia — the most comprehensive AI reference on the web.

Leave a Reply