Further Reading: Building Your Portfolio

This chapter focused on making your work visible and compelling. The resources below will help you refine your portfolio, prepare for interviews, and build the professional presence that gets you noticed.


Tier 1: Verified Sources

These are published books with full bibliographic details that we can confirm exist and recommend.

Austin Kleon, Show Your Work!: 10 Ways to Share Your Creativity and Get Discovered (Workman Publishing, 2014). This short, beautifully designed book argues that sharing your process — not just your finished products — is the key to building an audience and a career. Kleon's advice applies perfectly to data science portfolios: share what you're learning, show your thinking, and don't wait until you feel "ready." It's a two-hour read that might change how you think about putting your work out there.

Joel Grus, Data Science from Scratch: First Principles with Python (O'Reilly, 2nd edition, 2019). While not a portfolio-building book per se, Grus's approach — building things from scratch to understand how they work — is exactly the philosophy that produces strong portfolio projects. If you want to deepen the technical substance behind your portfolio pieces, this book is an excellent companion.

Hadley Wickham and Garrett Grolemund, R for Data Science (O'Reilly, 2nd edition, 2023). Even if you work primarily in Python, the communication and analytical thinking framework in this book is language-agnostic. The chapters on data transformation and visualization design are particularly relevant if you want your portfolio charts to look their best.

Gayle Laakmann McDowell, Cracking the Coding Interview (CareerCup, 6th edition, 2015). The classic technical interview preparation guide. While it's focused on software engineering interviews rather than data science specifically, the problem-solving framework and behavioral interview preparation sections are highly relevant. Many data science interviews include coding components, and this book's approach to thinking through problems systematically transfers well.

Edward Tufte, The Visual Display of Quantitative Information (Graphics Press, 2nd edition, 2001). If you want your portfolio visualizations to be genuinely exceptional, Tufte's foundational work on information design is essential reading. His principles — maximize data-ink ratio, avoid chartjunk, let the data speak — will make every chart in your portfolio better. Referenced in our Chapter 18, but worth revisiting with portfolio presentation in mind.

Jake VanderPlas, Python Data Science Handbook: Essential Tools for Working with Data (O'Reilly, 2nd edition, 2023). An excellent reference to keep nearby as you build portfolio projects. When you need to quickly look up how to do something in NumPy, pandas, matplotlib, or scikit-learn, VanderPlas has clear, well-organized coverage. Having this as a reference reduces the time between "I want to do X" and "I'm doing X."


Tier 2: Attributed Resources

These are talks, articles, and online resources that are well-known in the data science community. We attribute them to their creators and provide enough context for you to find them.

Rachel Thomas, "Making Peace with Personal Branding" (fast.ai blog). Thomas, co-founder of fast.ai, writes about why sharing your work publicly is important even when it feels uncomfortable, and offers practical advice for introverts and people who don't consider themselves "content creators." Her perspective is particularly valuable because it comes from someone who has hired many data scientists and knows what makes candidates stand out.

David Robinson, "Advice to Aspiring Data Scientists: Start a Blog" (2017). Robinson, a data scientist and author, argues that blogging is the single most effective thing aspiring data scientists can do to build visibility and demonstrate skills. His key point: a blog post about a data analysis you did is a portfolio piece, a writing sample, and a networking tool all in one.

Vicki Boykis, "Data Science is Different Now" (2019). Boykis's widely-shared essay describes how the data science job market has matured and what that means for aspiring practitioners. Her honest assessment of what employers actually need (hint: it's data wrangling and communication, not just modeling) informs the portfolio strategy we recommend in this chapter. Search for her name and the title.

Emily Robinson and Jacqueline Nolis, Build a Career in Data Science (Manning, 2020). This book covers the entire data science career arc — from getting your first job to advancing as a senior practitioner. The chapters on building a portfolio, preparing for interviews, and navigating the job search are directly relevant to the material in our chapter. Robinson and Nolis write from extensive experience on both sides of the hiring table.

Chip Huyen, Introduction to Machine Learning Interviews Book (2021). Huyen's freely available book covers ML interview preparation comprehensively, including system design, coding, and behavioral questions. While more advanced than what most readers of this chapter need right now, it's an excellent resource to revisit when you're preparing for interviews at ML-focused companies.

Ken Jee (YouTube and social media). Jee has built a significant following by documenting his data science journey publicly and offering practical advice on portfolio building, resume writing, and interview preparation. His "data science project from scratch" series shows the process of building a portfolio piece from idea to completion.


Different readers will want different things after this chapter:

  • If you need to build your portfolio right now: Start with Exercise 34.6 (writing your project introduction) and work through the Part B exercises. Set a deadline — give yourself two weeks to produce a polished vaccination analysis portfolio piece. Read Austin Kleon's Show Your Work! for motivation.

  • If you're preparing for interviews: Read the Emily Robinson and Jacqueline Nolis book for comprehensive career advice. Practice the STAR-D framework from this chapter. If you're targeting ML-heavy roles, add Chip Huyen's interview book.

  • If you want to start blogging: Read David Robinson's essay on starting a blog. Pick one project from your portfolio and write it up as a blog post this week. Don't wait until it's perfect — publish it, learn from the experience, and iterate.

  • If you want to improve your GitHub profile: Spend two focused hours this weekend on your profile: write a profile README, organize your repositories, write or improve your project READMEs, and pin your best work. The impact-per-hour ratio of this activity is extremely high.

  • If you're not sure what kind of data science role to pursue: Read Vicki Boykis's essay for a realistic view of the job market. Then skip ahead to Chapter 36, which maps career paths in detail.


Online Resources Worth Bookmarking

These are platforms and communities referenced in the chapter that are useful for portfolio building:

  • GitHub — Your portfolio home base. Learn GitHub Pages if you want a free personal website.
  • Kaggle Datasets — For finding interesting datasets (use the datasets section, not just competitions).
  • Google Dataset Search — A search engine specifically for datasets.
  • data.gov / data.gov.uk / EU Open Data Portal — Government open data from multiple countries.
  • Medium / Towards Data Science — Popular platforms for data science blog posts.
  • LinkedIn — Professional networking; update your profile as described in the chapter.
  • Meetup.com — Find local data science and Python meetups.

Remember: the best portfolio is one that exists. Perfectionism is the enemy of visibility. Get your work out there, iterate, and improve over time. The people who get hired are not the people with the most skills — they're the people who make their skills visible.