Cross-references to other patterns:

Gradient descent (Ch. 7): Exploration solves the local optima problem that pure gradient descent cannot. - Power laws (Ch. 4): Power-law distributions of outcomes make exploration more valuable because the best option may be vastly better than the second-best. - Signal detection (Ch. 6): Exploration