Parquet for the main play-by-play data (will have many rows) - Reasoning: Smaller file size, faster reads, preserves data types - Could use CSV for smaller reference tables