Chapter 4: Further Reading
Data Strategy and Data Literacy
Data Strategy
1. Leandro DalleMule and Thomas H. Davenport, "What's Your Data Strategy?" Harvard Business Review, May–June 2017. The foundational HBR article that distinguishes between "defensive" data strategy (governance, compliance, security) and "offensive" data strategy (analytics, AI, competitive advantage). Introduces a framework for balancing the two that directly parallels the CDO mandates discussed in this chapter. Essential reading for any executive involved in data strategy decisions.
2. Bernard Marr, Data Strategy: How to Profit from a World of Big Data, Analytics and Artificial Intelligence, 2nd ed. (Kogan Page, 2022). A practitioner-oriented guide that connects data strategy to business value across industries. Marr's strength is translating technical concepts for business audiences. Particularly useful for MBA students who want a comprehensive but accessible overview of data strategy components.
3. Edd Wilder-James, "Breaking Down Data Silos," Harvard Business Review, December 2016. A concise analysis of why data silos persist and practical approaches to integration. Covers organizational, cultural, and technical dimensions. The article's emphasis on incentive alignment resonates with the Athena story — silos persist not because people want fragmented data, but because organizational structures reward local optimization.
Data Governance
4. John Ladley, Data Governance: How to Design, Deploy, and Sustain an Effective Data Governance Program, 3rd ed. (Academic Press, 2024). The standard reference for data governance practitioners. Comprehensive coverage of governance frameworks, organizational models, and implementation strategies. The third edition includes updated guidance on governance for AI and cloud environments. Dense but authoritative.
5. DAMA International, DAMA-DMBOK: Data Management Body of Knowledge, 2nd ed. (Technics Publications, 2017). The industry-standard reference for data management, covering governance, quality, metadata, architecture, security, and more. The DMBOK is to data management what the PMBOK is to project management — a comprehensive body of knowledge that defines the profession. Best used as a reference rather than read cover to cover.
6. Laura Madsen, Disrupting Data Governance (Technics Publications, 2020). A provocative critique of traditional governance approaches that argues most governance programs are too bureaucratic, too slow, and too disconnected from business value. Madsen advocates for agile governance principles that prioritize speed and pragmatism over comprehensive coverage. A useful counterpoint to the more structured approaches in Ladley and DAMA-DMBOK.
Data Quality
7. Thomas C. Redman, Getting in Front on Data: Who Does What (Technics Publications, 2016). Redman — known as "the Data Doc" — has been writing about data quality for three decades. This book focuses on the organizational dynamics of data quality: who is responsible, how to measure quality, and how to build quality improvement into daily operations. More practical and readable than most data quality texts.
8. Thomas C. Redman, "If Your Data Is Bad, Your Machine Learning Tools Are Useless," Harvard Business Review, April 2018. A concise article that makes the case — with specific examples — that data quality is the binding constraint on AI adoption. Directly relevant to the chapter's argument that data quality is a prerequisite for AI success. Assign this as a quick read alongside the chapter.
The Chief Data Officer
9. Caroline Carruthers and Peter Jackson, The Chief Data Officer's Playbook, 2nd ed. (Facet Publishing, 2022). Written by two former CDOs, this book offers practical guidance on the first 100 days, stakeholder management, governance design, and building a data team. The second edition includes insights from CDOs across industries. Particularly valuable for students interested in data leadership careers.
10. Randy Bean and Thomas H. Davenport, "Are You 'Data Driven'? Most Companies Say Yes but Few Back It Up," MIT Sloan Management Review, Winter 2025. An analysis of the annual NewVantage Partners (Wavestone) survey of Fortune 1000 data executives, including findings on CDO tenure, organizational barriers, and the persistent gap between data-driven aspirations and reality. The longitudinal perspective — tracking the same survey over more than a decade — reveals how slowly organizational culture changes.
Master Data Management
11. Dalton Cervo and Mark Allen, Master Data Management in Practice: Achieving True Customer MDM, 2nd ed. (Wiley, 2024). A practical guide to MDM with a focus on customer data — the domain most relevant to the Athena scenario. Covers entity resolution, golden records, survivorship rules, and implementation patterns (registry, consolidation, coexistence). Includes case studies from retail, financial services, and healthcare.
12. Alex Berson and Larry Dubov, Master Data Management and Data Governance, 2nd ed. (McGraw-Hill, 2010). An older but still relevant comprehensive treatment of MDM and its relationship to data governance. The technical details of specific tools are dated, but the conceptual frameworks — particularly the discussion of MDM implementation styles — remain sound.
Data Architecture
13. Zhamak Dehghani, Data Mesh: Delivering Data-Driven Value at Scale (O'Reilly, 2022). The book-length treatment of data mesh by its originator. Dehghani argues for treating data as a product, organized by domain, served through standardized interfaces, and governed through federated computational policies. Whether or not your organization adopts data mesh, the book's critique of centralized data architecture and its emphasis on organizational incentives are valuable.
14. Bill Inmon, Building the Data Warehouse, 4th ed. (Wiley, 2005). The classic text by the "father of the data warehouse." While the technology landscape has changed dramatically since this edition was published, Inmon's principles — particularly his emphasis on subject-oriented, integrated, time-variant, and nonvolatile data — remain foundational to modern data architecture. Read for the concepts, not the technology recommendations.
15. James Serra, "Data Lakehouse — A Practical Introduction," Microsoft Tech Community Blog, 2021 (and subsequent updates). A clear, practitioner-level explanation of the data lakehouse pattern, including comparisons to traditional warehouse and lake architectures. Serra's blog posts are consistently among the best explanations of modern data architecture concepts for non-engineers.
Data Literacy
16. Jordan Morrow, Be Data Literate: The Data Literacy Skills Everyone Needs to Succeed (Kogan Page, 2021). Morrow led Qlik's global data literacy initiative and writes from deep experience designing data literacy programs for non-technical audiences. The book covers reading data, working with data, analyzing data, and communicating with data — the four pillars of data literacy. Accessible and practical.
17. Rahul Bhargava and Catherine D'Ignazio, "Designing Tools and Activities for Data Literacy Learners," Journal of Community Informatics, 2015. An academic paper that explores how to design effective data literacy learning experiences. Particularly relevant for readers interested in the pedagogical dimension — how do you teach data literacy to people who do not self-identify as "data people"? The paper's emphasis on using data relevant to learners' lives and communities is echoed in this chapter's recommendation for role-specific training.
18. Ben Jones, Avoiding Data Pitfalls: How to Steer Clear of Common Blunders When Working with Data and Presenting Analysis and Visualizations (Wiley, 2020). A practical catalogue of the mistakes that data-literate professionals should learn to recognize: misleading visualizations, statistical fallacies, biased samples, and communication failures. Useful as a reference for managers who need to evaluate data-driven arguments critically.
Privacy and Data Ethics
19. Ann Cavoukian, "Privacy by Design: The 7 Foundational Principles," Information and Privacy Commissioner of Ontario, 2009 (updated 2011). The original white paper that defined the privacy-by-design framework now embedded in GDPR. Cavoukian's seven principles — proactive not reactive, privacy as the default, privacy embedded into design, full functionality (positive-sum not zero-sum), end-to-end security, visibility and transparency, and respect for user privacy — are essential reading for anyone designing systems that handle personal data. Available free online.
20. Daniel J. Solove, Understanding Privacy (Harvard University Press, 2008). A foundational academic treatment of privacy that goes beyond legal compliance to explore what privacy means, why it matters, and how it should be protected. Solove's taxonomy of privacy harms (information collection, processing, dissemination, and invasion) provides a conceptual framework for evaluating data strategy decisions. More philosophical than practical, but deepens understanding of the stakes involved in data governance.
Case Study References
21. United Kingdom National Audit Office, "The National Programme for IT in the NHS: An Update on the Delivery of Detailed Care Records Systems," HC 888, Session 2010–12, 2011. The official government audit of the NPfIT programme referenced in Case Study 2. Details the programme's cost overruns, scope reductions, and governance failures. A sobering document for anyone involved in large-scale data infrastructure projects.
22. Paige Leskin, "The Capital One Data Breach Compromised Data of Over 100 Million People — Here's What Happened and How to Protect Yourself," Business Insider, July 2019 (and subsequent follow-up reporting). Contemporaneous reporting on the Capital One data breach referenced in Case Study 1. Useful for understanding the timeline, the technical details of the breach, and the public and regulatory response.
Industry Reports and Surveys
23. Wavestone (formerly NewVantage Partners), Data and AI Leadership Executive Survey, Annual (2012–present). The longest-running annual survey of Fortune 1000 data and AI executives. Tracks trends in CDO appointment, data culture maturity, AI adoption, and the persistent gap between data-driven aspirations and organizational reality. The survey's year-over-year comparisons are particularly valuable for understanding how slowly — and how consistently — organizational culture changes.
24. Gartner, "How to Measure, Monitor, and Improve Data Quality" (Research Note, updated annually). Gartner's practical guidance on implementing data quality measurement programs. Covers the six dimensions of data quality, scoring methodologies, and organizational best practices. Available to Gartner subscribers; many university libraries provide access.
25. MIT Center for Information Systems Research (CISR), "Data Monetization: Why, Where, and How" and related working papers. MIT CISR publishes research on how organizations create value from data — bridging the gap between data strategy theory and measurable business outcomes. Their research on "data-driven" versus "data-informed" decision-making cultures is particularly relevant to the data literacy discussion in this chapter.