Glossary

The training objective

what the system is instructed to maximize 2. **The feature set** — what information the system has access to 3. **The exploration-exploitation balance** — how much the system diversifies vs. reinforces established preferences

Learn More

Related Terms