1
Front Matter
5 chapters2
Part I: Mathematical and Computational Foundations
6 chapters- Part I: Mathematical and Computational Foundations
- Chapter 1: Linear Algebra for Machine Learning
- Chapter 2: Multivariate Calculus and Optimization
- Chapter 3: Probability Theory and Statistical Inference
- Chapter 4: Information Theory for Data Science — Entropy, KL Divergence, and Why Your Loss Function Works
- Chapter 5: Computational Complexity and Scalability — Knowing What's Possible Before You Start Coding
3
Part II: Deep Learning
10 chapters- Part II: Deep Learning
- Chapter 6: Neural Networks from Scratch
- Chapter 7: Training Deep Networks — Initialization, Batch Normalization, Dropout, Learning Rate Schedules, and the Dark Art of Making It Converge
- Chapter 8: Convolutional Neural Networks — Architecture, Intuition, and Computer Vision Applications
- Chapter 9: Recurrent Networks and Sequence Modeling — RNNs, LSTMs, GRUs, and Their Limitations
- Chapter 10: The Transformer Architecture — Attention Is All You Need (and Why It Changed Everything)
- Chapter 11: Large Language Models — Architecture, Training, Fine-Tuning, RAG, and Practical Applications
- Chapter 12: Generative Models — VAEs, GANs, Diffusion Models, and the Frontier of Data Generation
- Chapter 13: Transfer Learning, Foundation Models, and the Modern Deep Learning Workflow
- Chapter 14: Graph Neural Networks and Geometric Deep Learning — When Your Data Has Structure Beyond Grids and Sequences
4
Part III: Causal Inference
6 chapters- Part III: Causal Inference
- Chapter 15: Beyond Prediction — Why Correlation Isn't Enough and What Causal Inference Offers
- Chapter 16: The Potential Outcomes Framework — Counterfactuals, ATEs, and the Fundamental Problem of Causal Inference
- Chapter 17: Graphical Causal Models — DAGs, d-Separation, and Structural Causal Models
- Chapter 18: Causal Estimation Methods — Matching, Propensity Scores, Instrumental Variables, Difference-in-Differences, and Regression Discontinuity
- Chapter 19: Causal Machine Learning — Heterogeneous Treatment Effects, Uplift Modeling, and Double Machine Learning
5
Part IV: Bayesian and Temporal Data Science
5 chapters- Part IV: Bayesian and Temporal Data Science
- Chapter 20: Bayesian Thinking — Priors, Posteriors, and Why Frequentist vs. Bayesian Is the Wrong Debate
- Chapter 21: Bayesian Modeling in Practice — PyMC, Hierarchical Models, and When Bayesian Methods Earn Their Complexity
- Chapter 22: Bayesian Optimization and Sequential Decision-Making — From Hyperparameter Tuning to Bandits
- Chapter 23: Advanced Time Series and Temporal Models — State-Space Models, Temporal Fusion Transformers, and Probabilistic Forecasting
6
Part V: Production ML Systems
8 chapters- Part V: Production ML Systems
- Chapter 24: ML System Design — Architecture Patterns for Real-World Machine Learning
- Chapter 25: Data Infrastructure — Feature Stores, Data Warehouses, Lakehouses, and the Plumbing Nobody Teaches
- Chapter 26: Training at Scale — Distributed Training, GPU Optimization, and Managing Compute Costs
- Chapter 27: ML Pipeline Orchestration — Airflow, Dagster, Prefect, and Designing Robust Data Workflows
- Chapter 28: ML Testing and Validation Infrastructure — Data Contracts, Behavioral Testing, and Great Expectations
- Chapter 29: Continuous Training and Deployment — CI/CD for ML, Canary Deployments, Shadow Mode, and Progressive Rollout
- Chapter 30: Monitoring, Observability, and Incident Response — Keeping ML Systems Healthy in Production
7
Part VI: Responsible and Rigorous Data Science
6 chapters- Part VI: Responsible and Rigorous Data Science
- Chapter 31: Fairness in Machine Learning — Definitions, Impossibility Results, Mitigation Strategies, and Organizational Practice
- Chapter 32: Privacy-Preserving Data Science — Differential Privacy, Federated Learning, and Synthetic Data
- Chapter 33: Rigorous Experimentation at Scale — Multi-Armed Bandits, Interference Effects, and Experimentation Platforms
- Chapter 34: Uncertainty Quantification — Calibration, Conformal Prediction, and Knowing What Your Model Doesn't Know
- Chapter 35: Interpretability and Explainability at Scale — From SHAP to Concept-Based Explanations in Production
8
Part VII: Leadership and Synthesis
5 chapters- Part VII: Leadership and Synthesis
- Chapter 36: Capstone — Designing, Building, and Governing a Production Recommendation System
- Chapter 37: Reading Research Papers — How to Stay Current, Evaluate Claims, and Separate Signal from Hype
- Chapter 38: The Staff Data Scientist — Technical Leadership, Mentoring, Strategy, and Shaping the Roadmap
- Chapter 39: Building and Leading a Data Science Organization — Hiring, Team Structure, Culture, and Scaling Impact
9
Appendices
13 chapters- Glossary
- Answers to Selected Exercises
- Bibliography
- Appendix A: Mathematical Notation Reference
- Appendix B: PyTorch API Reference
- Appendix C: Python ML Ecosystem Reference
- Appendix D: Environment Setup
- Appendix E: Dataset Catalog
- Appendix F: SQL and Infrastructure Reference
- Appendix G: Evaluation Metrics Reference
- Appendix H: ML System Design Patterns
- Appendix I: Key Papers and Reading Lists
- Appendix J: Causal Inference Identification Guide
Explore Related Books
More open-access textbooks from our library
Advanced COBOL 305 pages AI Ethics 304 pages AI Literacy 40 pages AI & ML for Business 304 pages AI Engineering 307 pages Algorithmic Addiction 303 pages Applied Psychology 303 pages Learning Assembly Language 306 pages College Football Analytics 213 pages Creator Economy 318 pages Pattern Recognition 322 pages Data & Society 305 pages Ethical Hacking 318 pages Fandom 332 pages History of Appalachia 324 pages How Humans Get Stuck 285 pages Handling Confrontation 306 pages How Your House Works 306 pages IBM DB2 282 pages Intermediate COBOL 334 pages Intermediate Data Science 278 pages Intro CS Python 44 pages Intro to Data Science 266 pages Introductory Statistics 216 pages Learning COBOL 322 pages Prediction Markets 316 pages Metacognition 222 pages Media Literacy 314 pages NFL Analytics 182 pages Physics of Music 316 pages Political Analytics 324 pages Basketball Analytics 214 pages Soccer Analytics 230 pages Propaganda 304 pages Python for Business 298 pages Quantum Mechanics 303 pages RegTech 307 pages Science of Seduction 320 pages Sports Betting 322 pages Architecture of Surveillance 299 pages Science of Luck 306 pages Vibe Coding 316 pages Why They Watch 308 pages Working with AI 316 pages