Further Reading: Data Stewardship and the Chief Data Officer

The sources below provide deeper engagement with the themes introduced in Chapter 27. They include practitioner guides, academic research, industry reports, and critical perspectives on data governance. Annotations describe what each source covers and why it is relevant.


The CDO Role and Data Governance Leadership

NewVantage Partners. "Data and AI Leadership Executive Survey 2024." NewVantage Partners, 2024. The annual survey of Fortune 1000 companies that the chapter cites for CDO tenure and organizational adoption statistics. The survey provides longitudinal data on how organizations perceive and invest in data leadership. Essential for understanding the gap between aspiration ("data-driven culture") and reality (only 24% of surveyed organizations report having achieved one).

Aiken, Peter, and Juanita Billings. Monetizing Your Data: A Guide to Turning Data into Profit-Driving Strategies and Solutions. Hoboken, NJ: John Wiley & Sons, 2014. A practitioner-oriented guide that argues for treating data as a strategic asset requiring executive-level oversight. Aiken and Billings provide frameworks for data valuation, governance design, and organizational change management. Their emphasis on the CDO's strategic role anticipates the "Phase 4: Strategic" evolution described in the chapter.

Thomas, Gabrielle (ed.). "The Chief Data Officer's Playbook." Data Governance Professionals Organization, 2023. A collaborative guide from practicing CDOs that addresses the practical challenges of the role: building a team, securing executive buy-in, navigating organizational politics, and demonstrating value. Particularly useful for understanding why CDO tenure is short and what organizational conditions enable success.


Data Stewardship Models and Frameworks

DAMA International. DAMA-DMBOK: Data Management Body of Knowledge. 2nd ed. Bradley Beach, NJ: Technics Publications, 2017. The comprehensive reference for data management professionals. DAMA-DMBOK covers data governance, data quality, data architecture, metadata management, and data stewardship in detail. Its data governance framework -- distinguishing strategic, tactical, and operational governance -- provides the theoretical foundation for the stewardship models discussed in Section 27.2.

Plotkin, David. Data Stewardship: An Actionable Guide to Effective Data Management and Data Governance. Amsterdam: Academic Press, 2020. A detailed practitioner guide focused specifically on the data steward role. Plotkin covers steward selection, training, tools, and organizational integration. Particularly strong on the difference between data stewardship as a role and data stewardship as an organizational practice -- a distinction the chapter draws upon.

Ladley, John. Data Governance: How to Design, Deploy, and Sustain an Effective Data Governance Program. 2nd ed. Amsterdam: Academic Press, 2019. A comprehensive guide to data governance program design, from organizational assessment through implementation to sustainability. Ladley's emphasis on governance as a business capability (not a compliance exercise) aligns with the chapter's argument that governance enables rather than constrains organizational objectives.


Data Catalogs and Data Lineage

Seiner, Robert S. Non-Invasive Data Governance: The Path of Least Resistance and Greatest Success. Basking Ridge, NJ: Technics Publications, 2014. Seiner's "non-invasive" approach argues that data governance should leverage existing organizational roles and processes rather than imposing new bureaucratic structures. His methodology for data catalog development -- starting with existing knowledge and building incrementally -- is practical and addresses the organizational resistance that the chapter's case study describes.

Alation, Inc. "The State of Data Governance and Empowerment Report." Alation, 2023. An industry report (produced by a data catalog vendor, so read with appropriate skepticism) that surveys how organizations use data catalogs and the challenges they encounter. The report provides useful statistics on catalog adoption, common implementation challenges, and the organizational benefits of comprehensive data documentation.

Halevy, Alon, Flip Korn, Natalya Fridman Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, and Steven Euijong Whang. "Goods: Organizing Google's Datasets." In Proceedings of the 2016 ACM SIGMOD International Conference on Management of Data, 795-806. A technical paper describing Google's internal data catalog system. While the scale is atypical (Google manages billions of datasets), the paper illustrates the challenges of data cataloging at scale: automated discovery, metadata extraction, lineage tracking, and classification. Provides technical context for the catalog concepts discussed in Section 27.3.


Data Quality and Ethics

Batini, Carlo, and Monica Scannapieco. Data and Information Quality: Dimensions, Principles and Techniques. Cham: Springer, 2016. The standard academic text on data quality. Batini and Scannapieco define quality dimensions (accuracy, completeness, timeliness, consistency) and provide measurement and improvement methodologies. Their framework for quality assessment connects directly to the chapter's argument that data quality is an ethical concern.

Redman, Thomas C. "If Your Data Is Bad, Your Machine Learning Tools Are Useless." Harvard Business Review, April 2, 2018. An accessible article arguing that organizations investing in AI and machine learning while neglecting data quality are building on a flawed foundation. Redman's central point -- that poor data quality propagates through analytical pipelines and amplifies in model outputs -- reinforces the chapter's connection between data quality and ethical outcomes.


Critical Perspectives on Data Governance

Milan, Stefania, and Lonneke Van der Velden. "The Algorithmist: Data Ownership and Governance in the Digital Age." European Journal of Social Theory 19, no. 2 (2016): 195-212. A critical examination of the governance frameworks that determine who controls data and whose interests data governance serves. Milan and Van der Velden argue that existing governance models disproportionately serve organizational interests over the interests of data subjects -- a critique that connects to the chapter's discussion of the stewardship metaphor's limitations.

Zuboff, Shoshana. The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power. New York: PublicAffairs, 2019. Zuboff's influential critique of data extraction practices provides essential context for understanding why data stewardship matters. Her concept of "behavioral surplus" -- data collected beyond what is needed for service delivery, extracted for commercial purposes -- challenges the stewardship model to address not just how data is managed but whether it should have been collected in the first place.

Couldry, Nick, and Ulises A. Mejias. The Costs of Connection: How Data Is Colonizing Human Life and Appropriating It for Capitalism. Stanford: Stanford University Press, 2019. Couldry and Mejias argue that the datafication of human life represents a new form of colonialism -- the appropriation of human experience as raw material. Their framework challenges data stewardship to move beyond responsible management of existing data practices to questioning whether those practices are justified. A provocative counterpoint to the practitioner-oriented governance literature.

Taylor, Linnet. "What Is Data Justice? The Case for Connecting Digital Rights and Freedoms Globally." Big Data & Society 4, no. 2 (2017): 1-14. Taylor's concept of data justice provides a framework for evaluating data governance from the perspective of affected communities -- particularly marginalized communities in the Global South. Her work challenges data stewardship models to incorporate questions of justice, equity, and community self-determination alongside traditional governance concerns of compliance and efficiency.


These readings span from technical practitioner guides to critical social theory. Effective data stewardship requires both: the practical tools to manage data responsibly, and the critical perspective to question whether "responsible management" is sufficient when the data practices themselves are unjust.