Codetown ::: a software developer's community
-
I'm currently looking into big data because it's very interesting to me.
Claire Corthell published a Data Science curriculum that is like an encyclopedia. http://datasciencemasters.org/
The US government publishes a site called http://www.data.gov/ with over 160,000 datasets. "The home of the U.S. Government’s open data Here you will find data, tools, and resources to conduct research, develop web and mobile applications, design data visualizations, and more." As I browsed around, I found a link to the TIGER database of the census. It's a database I was familiar with when I first got out of college.
The image you see at the beginning of this article is from Citymapper, an app that uses public data you can get from Data.gov. I'm about to download it. From the comments, it's a lifechanger for some people.
Are you working with big data? Tell me about it!
O'Reilly Media is offering unlimited 30-day access to the Safari library of over 30,000 books and videos from over 200 publishers and imprints. Keith Spurr called Safari the “best technical learning resource on the internet.” You’ll have instant access to hundreds of books and videos on all aspects of big data--check out this recommended data reading list. Keep up to date on Safari-related news @safari or @safaribot.
Link: http://www.oreilly.com/pub/cpc/1551
O’Reilly Releases their 2015 Data Science Salary Survey
For the third consecutive year, O’Reilly Media conducted an anonymous survey to expose the tools that successful data scientists and engineers use, and how those tool choices might relate to their salary.
Want to know what you need to know to earn the big bucks? Knowledge of certain tools can increase your salary more than getting a Ph.D. Curious what clusters of tools are most commonly used together? Or what job titles pay the best? It's all there.
Gain insight from these potentially career-changing findings, and plug your own variables into one of the linear models to predict your own salary.
Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.
Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.
Check out the Codetown Jobs group.

AWS recently announced ExtendDB, a DynamoDB-compatible adapter that lets developers use the DynamoDB API with different storage backends, starting with PostgreSQL. The project supports existing SDKs and tools without modification, giving teams greater flexibility to run DynamoDB-style workloads outside of native DynamoDB while maintaining compatibility with current applications and workflows.
By Renato Losio
Cloudflare recently described how a slowdown in its billing pipeline was traced to contention inside the query planning stage of ClickHouse. The team profiled the bottleneck and patched ClickHouse to replace an exclusive lock with a shared lock, drop the per-query copy of the parts list, and improve part filtering.
By Renato Losio
OpenAI details Codex Windows sandbox architecture, showing how SIDs, ACLs, restricted tokens, and dedicated sandbox accounts enable safe execution of autonomous coding tasks. The design balances isolation with real developer workflows and shows how OS security primitives must be composed for AI agents on local development environments.
By Leela Kumili
LinkedIn’s Karthik Ramgopal and Prince Valluri discuss leveraging AI as a new execution model for large-scale engineering. They explain how to move beyond fragmented implementations by building platform abstractions for orchestration, structured context, and safe tooling like MCP. They share architectural insights from real-world coding, observation, and UI testing agents built at LinkedIn.
By Karthik Ramgopal, Prince Valluri
Netflix has shared details about Service Topology. This internal system creates and updates a live dependency graph for thousands of microservices. It helps engineers see how services connect and resolve issues more quickly. The system merges three separate data sources into a single, queryable graph. It updates almost in real-time as traffic patterns shift.
By Claudio Masolo
© 2026 Created by Michael Levin.
Powered by
You need to be a member of Codetown to add comments!
Join Codetown