When you're learning data science, you usually practice with nice, clean, pre-packaged data sets and tidy case studies that lead you step-by-step from data collection to cool insights.
But when real life hits, many data scientists have to work with missing or sketchy information extracted from (multiple) sources in the organization. Data science that works is a messy, trial-and-error process of creating and testing hypotheses, gathering evidence, and drawing conclusions.
Going Pro in Data Science: What It Takes to Succeed as a Professional Data Scientist, by distinguished CSC engineer Jerry Overton, outlines practices for making good decisions in the complicated real world. These skills are far more useful for practicing data scientists than, say, mastering the details of a machine-learning algorithm.
It's an incredibly practical ebook. And it's free.
Enjoy!

Download the free ebook → http://www.oreilly.com/data/free/going-pro-in-data-science.csp?imm_...

Ben Lorica
Chief Data Scientist, O'Reilly Media
P.S. Jerry Overton is also presenting a half-day tutorial on the topic at Strata + Hadoop World in NY in September, providing in-depth education in data science, big data architecture, and analytics for business. As an O'Reilly customer, get 30% off Early Price with code DATA30 by registering by August 12.

Views: 141

Comment

You need to be a member of Codetown to add comments!

Join Codetown

Happy 10th year, JCertif!

Notes

Welcome to Codetown!

Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.

When you create a profile for yourself you get a personal page automatically. That's where you can be creative and do your own thing. People who want to get to know you will click on your name or picture and…
Continue

Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.

Looking for Jobs or Staff?

Check out the Codetown Jobs group.

 

Enjoy the site? Support Codetown with your donation.



InfoQ Reading List

Presentation: Empathy Driven Platforms: You Build It, Let’s Run It Together

Erin Doyle explains the evolution from siloed IT Ops to the Platform Team model, revealing why the "You Build It, You Run It" principle created new cognitive load. She shares the Empathy-Driven Platforms strategy - the ultimate attack against engineering roadblocks. Discover ways platform teams can build empathy, foster psychological safety, and adopt a product mindset.

By Erin Doyle

Accessibility with Interactive Components at React Advanced Conf

Dynamic React speaker Aurora Scharff captivated attendees at React Advanced 2025 with her talk on "Building Interactive Async UI with React 19 and Ariakit." She showcased ARIAKit, an open-source accessibility library that empowers developers to create WCAG-compliant components effortlessly, blending modern React patterns with customizable, accessible UI primitives.

By Daniel Curtis

Podcast: Looking for Root Causes is a False Path: A Conversation with David Blank-Edelman

In this podcast, Michael Stiefel spoke with David Blank-Edelman about the relationship between software architecture and site reliability engineering. Site reliability engineering can give architecture vital feedback about how the system actually behaves in production. Architects and designers can then learn from their failures to improve their ability to build systems that can evolve.

By David Blank-Edelman

Java News Roundup: Spring Cloud, Quarkus, Hibernate ORM, JobRunr, LangChain4j, Java Operator SDK

This week's Java roundup for November 24th, 2025, features news highlighting: point releases of Spring Cloud, Quarkus, Hibernate ORM, JobRunr, LangChain4j and Java Operator SDK; first release candidates of Hibernate Reactive and Gradle; and a maintenance release of Keycloak.

By Michael Redlich

Helm Improves Kubernetes Package Management with Biggest Release in 6 Years

Helm, the Kubernetes application package manager, has officially reached version 4.0.0. Helm 4 is the first major upgrade in six years, and also marks Helm's 10th anniversary under the guidance of the Cloud Native Computing Foundation (CNCF). The update aims to address several challenges around scalability, security, and developer workflow.

By Matt Saunders

© 2025   Created by Michael Levin.   Powered by

Badges  |  Report an Issue  |  Terms of Service