When you're learning data science, you usually practice with nice, clean, pre-packaged data sets and tidy case studies that lead you step-by-step from data collection to cool insights.
But when real life hits, many data scientists have to work with missing or sketchy information extracted from (multiple) sources in the organization. Data science that works is a messy, trial-and-error process of creating and testing hypotheses, gathering evidence, and drawing conclusions.
Going Pro in Data Science: What It Takes to Succeed as a Professional Data Scientist, by distinguished CSC engineer Jerry Overton, outlines practices for making good decisions in the complicated real world. These skills are far more useful for practicing data scientists than, say, mastering the details of a machine-learning algorithm.
It's an incredibly practical ebook. And it's free.
Enjoy!

Download the free ebook → http://www.oreilly.com/data/free/going-pro-in-data-science.csp?imm_...

Ben Lorica
Chief Data Scientist, O'Reilly Media
P.S. Jerry Overton is also presenting a half-day tutorial on the topic at Strata + Hadoop World in NY in September, providing in-depth education in data science, big data architecture, and analytics for business. As an O'Reilly customer, get 30% off Early Price with code DATA30 by registering by August 12.

Views: 137

Comment

You need to be a member of Codetown to add comments!

Join Codetown

Happy 10th year, JCertif!

Notes

Welcome to Codetown!

Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.

When you create a profile for yourself you get a personal page automatically. That's where you can be creative and do your own thing. People who want to get to know you will click on your name or picture and…
Continue

Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.

Looking for Jobs or Staff?

Check out the Codetown Jobs group.

 

Enjoy the site? Support Codetown with your donation.



InfoQ Reading List

Article: Best Practices to Build Energy-Efficient AI/ML Systems

In this article, author Lakshmithejaswi Narasannagari discusses the sustainable innovations in AI/ML technologies, how to track carbon footprint in all stages of ML systems lifecycle and best practices for model development and deployment.

By Lakshmithejaswi Narasannagari

Temporal on AWS Aims to Ease Building Resilient Distributed Systems

Temporal's open-source microservices orchestration platform leverages AWS to enhance durable execution, simplifying the development of resilient, fault-tolerant applications. By ensuring seamless recovery from system failures, Temporal helps businesses navigate the challenges of distributed systems, enabling improved data integrity and operational efficiency during peak demands.

By Steef-Jan Wiggers

Article: InfoQ Culture and Methods Trends Report - 2025

This report summarizes how the InfoQ Culture and Methods editorial team sees the ongoing and emergent trends in the culture and methods space.

By Shane Hastie, Charity Majors, Ben Linders, Rafiq Gemmail, Craig Smith

Podcast: InfoQ Culture & Methods Trends in 2025

By Charity Majors, Ben Linders, Rafiq Gemmail, Craig Smith, Shane Hastie

Presentation: Stream All the Things — Patterns of Effective Data Stream Processing

Adi Polak discusses patterns for effective data stream processing, highlighting common pitfalls and the complexities of balancing data infrastructure. Learn about exactly-once semantics, the challenges of join operations in streaming (including the "Puppies shelter" concept), and crucial error handling strategies.

By Adi Polak

© 2025   Created by Michael Levin.   Powered by

Badges  |  Report an Issue  |  Terms of Service