Does the system perform differently for different demographic groups (defined by race, gender, age, disability, national origin, or other protected characteristics)? - What fairness metric or metrics does the system's developer use, and are those metrics appropriate given the deployment context? - A