Built an ETL pipeline using Apache Spark to normalize data from multiple hospital systems into a common FHIR-based format. - Implemented a data quality monitoring system that flagged anomalies and missing data patterns. - Established a de-identification pipeline that removed protected health informa