This is **data engineering**. Building pipelines is essential infrastructure, but it is not itself asking or answering a question. (It supports stage 2, data collection, but it is not *doing* data science.) 2. **Not data science** — This is **mathematical statistics / statistical theory**. There is