If the validation sample showed systematic errors in specific categories, revise the classification prompt and rerun those items.