**How it works:** An RL agent learns optimal pricing strategies through trial-and-error interaction with bettor behavior. The agent observes bet flow (volume, source, timing) and adjusts lines to maximize expected profit, learning to shade prices based on observed bettor patterns (e.g., adding extra