When you're learning data science, you usually practice with nice, clean, pre-packaged data sets and tidy case studies that lead you step-by-step from data collection to cool insights.
But when real life hits, many data scientists have to work with missing or sketchy information extracted from (multiple) sources in the organization. Data science that works is a messy, trial-and-error process of creating and testing hypotheses, gathering evidence, and drawing conclusions.
Going Pro in Data Science: What It Takes to Succeed as a Professional Data Scientist, by distinguished CSC engineer Jerry Overton, outlines practices for making good decisions in the complicated real world. These skills are far more useful for practicing data scientists than, say, mastering the details of a machine-learning algorithm.
It's an incredibly practical ebook. And it's free.
Enjoy!

Download the free ebook → http://www.oreilly.com/data/free/going-pro-in-data-science.csp?imm_...

Ben Lorica
Chief Data Scientist, O'Reilly Media
P.S. Jerry Overton is also presenting a half-day tutorial on the topic at Strata + Hadoop World in NY in September, providing in-depth education in data science, big data architecture, and analytics for business. As an O'Reilly customer, get 30% off Early Price with code DATA30 by registering by August 12.

Views: 137

Comment

You need to be a member of Codetown to add comments!

Join Codetown

Happy 10th year, JCertif!

Notes

Welcome to Codetown!

Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.

When you create a profile for yourself you get a personal page automatically. That's where you can be creative and do your own thing. People who want to get to know you will click on your name or picture and…
Continue

Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.

Looking for Jobs or Staff?

Check out the Codetown Jobs group.

 

Enjoy the site? Support Codetown with your donation.



InfoQ Reading List

Temporal and OpenAI Launch AI Agent Durability with Public Preview Integration

Temporal has unveiled a public preview integration with the OpenAI Agents SDK, introducing durable execution capabilities to AI agent workflows built using OpenAI's framework.

By Craig Risi

Green IT: How to Reduce IT’s Environmental Footprint

Green IT focuses on reducing IT’s environmental footprint, by rethinking how you build, deploy, and power IT systems. At QCon London, Ludi Akue presented how her team did a lifecycle assessment, set a 10% emissions reduction goal, simplified architecture, and optimized frontends, to align with climate goals.

By Ben Linders

Presentation: Myth Busters: Is Rust a Slam Dunk?

Ramya Krishnamoorthy shares a detailed case study on rewriting Momento's high-performance data platform from Kotlin to Rust. She covers the technical challenges, including garbage collection bottlenecks and multithreaded contention, and the business trade-offs involved in adopting a new language to achieve predictable low tail latencies and maximize cost efficiency for their serverless services.

By Ramya Krishnamoorthy

.NET 10 RC 1: Introduces Persistent State in Blazor, Enhanced Validation, and Production-Ready Tools

Last week, Microsoft announced the release of .NET 10 RC 1, the first of two release candidates ahead of the final version. As stated by the .NET team, this build comes with a go-live license, allowing developers to use it in production environments with official support. It is available alongside Visual Studio 2026 Insiders and is supported in Visual Studio Code through the C# Dev Kit.

By Almir Vuk

Open Practices for Architecture and AI Adoption

Andrea Magnorsky presented on Byte-Sized Architecture at Cloud Native Summit 2025, as a format for building shared understanding through small, recurrent workshops. Ahilan Ponnusamy and Andreas Grabner discussed the Technology Operating Model for AI adoption. Both approaches drew on the Open Practice Library for human-centred collaboration and driving architectural evolution.

By Rafiq Gemmail

© 2025   Created by Michael Levin.   Powered by

Badges  |  Report an Issue  |  Terms of Service