Chapter 24 Further Reading: Audience Analytics with Python
Python and Data Analysis Foundations
1. "Python for Data Analysis" by Wes McKinney (3rd edition, 2022) McKinney created pandas. This book is the authoritative reference for pandas data manipulation, written by the person who knows it best. It's more reference than tutorial — dense but comprehensive. The chapters on time-series data are particularly relevant to growth analysis. Start with Chapters 5 (pandas basics) and 11 (time series). Search for the current edition, as pandas has evolved significantly from earlier versions. Best for: Creators who want to deeply understand pandas beyond copy-pasting scripts.
2. "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Aurélien Géron (3rd edition, 2022) The best practical ML book available. For Chapter 24 purposes, Chapters 8–10 cover clustering algorithms including K-means in depth — explaining why it works, its limitations, and how to choose k. You don't need the deep learning chapters, but the clustering coverage is genuinely excellent and goes well beyond this chapter's introduction. Best for: Creators who want to understand why K-means works and what its alternatives are.
3. "Storytelling with Data" by Cole Nussbaumer Knaflic (2015) This is a data visualization book, not a Python book — but it's the most useful companion to matplotlib and seaborn work because it teaches you what makes a chart effective at communicating insight. The creator analytics charts you generate with matplotlib mean nothing if they're confusing to read. Knaflic's principles (choose the right chart type, eliminate clutter, highlight what matters) directly improve the quality of your Python analytics outputs. Best for: Creators who have learned to generate charts but want to make them clearer and more actionable.
Free Online Learning Resources
4. Kaggle Learn — Python and Pandas Micro-Courses (kaggle.com/learn) Kaggle offers free, browser-based micro-courses in Python, pandas, data visualization, and machine learning. The Python and pandas courses take approximately 5 hours each and are interactive — you write code in the browser and get immediate feedback. This is exactly the path Priya (Meridian Collective) used to learn pandas quickly for a specific business problem. Highly recommended for any creator who wants to use the scripts in this chapter with their own data. Best for: Creators with zero Python experience who want to be functional with pandas in under 15 hours total.
5. freeCodeCamp — Data Analysis with Python Certification (freecodecamp.org) FreeCodeCamp's free certification covers NumPy, pandas, matplotlib, and data analysis with Python. The curriculum is more structured than Kaggle and includes project-based assessments. The full certification takes 40–50 hours, but the pandas sections can be completed in under 15 hours independently. Videos are available on freeCodeCamp's YouTube channel for those who prefer video over text. Best for: Creators who want a more structured, comprehensive free Python education with a certificate.
6. "Python Crash Course" by Eric Matthes (3rd edition, 2023) The most recommended beginner Python book for non-programmers. Clear, friendly writing and project-based chapters that build confidence. The final project section includes a data visualization chapter using matplotlib. This book is available in most public libraries. If you have zero coding experience, this is the cleanest starting point before attempting the scripts in Chapter 24. Best for: Complete Python beginners who want a structured, reassuring book-format introduction.
Creator Analytics and Data Applications
7. "Marketing Data Science" by Thomas W. Miller (2015) Miller covers customer segmentation, including K-means clustering, in the context of marketing data — which maps directly to creator audience analytics. The book explains the theoretical basis for clustering while staying applied and business-focused. More accessible than pure statistics textbooks. Best for: Creators who want to understand the statistical foundations of audience segmentation.
8. "The Data Warehouse Toolkit" by Ralph Kimball and Margy Ross (3rd edition, 2013) This might seem like an extreme recommendation for a creator analytics chapter, but Kimball's dimensional data modeling concepts — specifically "fact tables" and "dimension tables" — are the conceptual framework behind every revenue attribution system, from simple UTM tracking to enterprise marketing data warehouses. Understanding these concepts helps you design better tracking systems from the beginning. Best for: Creators building more sophisticated attribution tracking systems who want to understand the data architecture behind it.
Specific Python Libraries — Documentation and Tutorials
9. pandas Documentation — Time Series / Date Functionality (pandas.pydata.org)
The official pandas documentation section on time series functionality is comprehensive and well-organized. Specifically relevant to growth_analysis.py: the section on DatetimeIndex, resampling (aggregating data by different time periods), and rolling windows. Bookmark and use as a reference when adapting the growth analysis script to your own data.
Best for: Reference documentation when adapting the growth analysis script to real platform data.
10. scikit-learn User Guide — Clustering (scikit-learn.org/stable/modules/clustering.html) The scikit-learn documentation's clustering section covers K-means, hierarchical clustering, DBSCAN, and more in plain language with code examples. The section on "Choosing the right clustering algorithm" is particularly useful for creators wondering whether K-means is always the best choice for audience segmentation (it usually is, but not always). Best for: Reference documentation when extending or troubleshooting the segmentation script.
Attribution and Analytics for Small Businesses
11. "Hacking Growth" by Sean Ellis and Morgan Brown (2017) Ellis and Brown popularized "growth hacking" and the "growth loop" concept. The book's analytical framework — identifying the key metric that predicts retention, then using data to systematically optimize that metric — is directly applicable to creator business analytics. The attribution chapters discuss UTM tracking and revenue attribution in accessible language. Best for: Creators interested in using data to systematically improve retention and conversion, not just measure them.
12. Google Analytics 4 Documentation — UTM Parameters (support.google.com)
If your sales platform or website uses Google Analytics 4, the GA4 UTM parameter documentation explains exactly how to set up tracking that integrates with the attribution models in revenue_attribution.py. The section on "Campaign parameters" and "Traffic-source dimensions" is most relevant. Google's UTM standard has become the industry default — even if you're not using GA4, understanding its conventions helps you set up compatible tracking across all platforms.
Best for: Creators who use a personal website or landing page for sales and want to integrate UTM tracking with web analytics.