Paul Black
Share on Facebook Share

Paul Black's Friends

  • Michael Levin

Gifts Received

Gift

Paul Black has not received any gifts yet

Give a Gift

 

Paul Black's Page

Latest Activity

Camila Aug commented on Paul Black's blog post Octoparse vs. Import.io comparison: which is best for web scraping?
"Hi Paul, thanks for writing this up. I've been using Proxycrawl for the past year or so and after looking at your post I decided to give Octoparse a try. Do you know if in the free octopare version I can use Proxycrawl? As you say it supports…"
Oct 2

Profile Information

How did you hear about Codetown?
Googling
What are your main interests in software development?
I have a passion for strategy games
Do you have a website?
http://www.octoparse.com
Anything else you'd like to add? Where do you live? (optional!)
Walnut, CA

Paul Black's Videos

  • Add Videos
  • View All

Paul Black's Blog

How to Build a Web Crawler from Scratch – A Guide for Beginners

Posted on October 22, 2018 at 3:00am 0 Comments

Living in the digital world today has definitely made our lives easier in many aspects as the internet becomes the ultimate source to finding most of everything we need; such digital transformation has generated new challenges to how data can be assessed, collected, stored and analyzed.

The number of…

Continue

Web Scraping Spotlights: What is it and why you need to learn it?

Posted on August 30, 2018 at 7:00am 0 Comments

web-scraping-industries

original posted at: https://www.octoparse.com/blog/what-is-web-scraping

What Is Web Scraping?

It is the process of extracting information and data from a website, transforming the information on a webpage into structured data for further analysis. Web scraping is also known as web harvesting or web data extraction. With the…

Continue

Web Scraping Service vs. Automatic Web Scraper: Which is the best option for web scraping?

Posted on September 1, 2017 at 3:43am 0 Comments

 

What is web scraping?

Web scraping aka. web extraction or web crawling refers to the process of obtaining various unstructured information from any websites and turn it into structured, clean data such as xls, csv, or txt or populate the captured data to a database directly. Some common uses of web scraping include lead generation, data collection for academic researches, price monitoring from…

Continue

Top 30 Big Data Tools for Data Analysis

Posted on August 27, 2017 at 10:42pm 0 Comments

There are thousands of big data tools out there for data analysis today. Data analysis is the process of inspecting, cleaning, transforming, and modeling data with the goal of discovering useful information, suggesting conclusions, and supporting decision making. To save your time, in this post, I will list out 30 top big data tools for data analysis in the areas ofopen source data…

Continue

Comment Wall (2 comments)

You need to be a member of Codetown to add comments!

Join Codetown

At 10:28am on August 18, 2017, Michael Levin said…

Happy birthday, Paul!

At 7:37am on May 20, 2016, Michael Levin said…
Hi Paul. Welcome to Codetown. All the best, Michael
 
 
 

Happy 10th year, JCertif!

Notes

Welcome to Codetown!

Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.

When you create a profile for yourself you get a personal page automatically. That's where you can be creative and do your own thing. People who want to get to know you will click on your name or picture and…
Continue

Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.

Looking for Jobs or Staff?

Check out the Codetown Jobs group.

There's also a free Java Jobs mailing list. It's a Yahoo group so you have to create a Yahoo account to use it.

 

Enjoy the site? Support Codetown with your donation.



InfoQ Reading List

Coming Next, JavaScript Private Class Fields & Methods

Private class fields are finally coming to JavaScript with no less than three separate TC39 proposals that define the new capabilities.

By Guy Nesher

Presentation: High Performance Remote and Distributed Teams

Randy Shoup starts with the organization itself - how to form teams, give them scope, and manage their growth. He discusses communication strategies for getting the best out of far-flung teams, how to foster & maintain the human bonds and empathy critical to good work, and explores the human side. By looking beyond a single physical site, we can find better, more diverse, more motivated employees.

By Randy Shoup

GoDaddy Releases Automatic Canary Deployments Tool for Kubernetes

GoDaddy recently released an open-source tool to automate gated deployments in Kubernetes. Every time a deployment happens, the tool can run regression tests, and pull metrics from data backends like New Relic. After some time, the tool decides whether to roll back or continue with the deployment automatically. Users can run A/B tests and run experiments with a small portion of live traffic.

By Christian Melendez

Presentation: How to Evolve Kubernetes Resource Management Model

Jiaying Zhang talks about the current Kubernetes resource model and best practice guidance on managing compute resources and specifying application resource requirements on Kubernetes. She discusses some work on extending the Kubernetes resource model to provide better resource isolation, support more diverse hardware, promote consistent application performance across different compute platforms.

By Jiaying Zhang

Introducing Microsoft.Data.SqlClient

Continuing the effort to decouple Microsoft products from .NET Core itself, Microsoft is spinning off their SQL Server drivers into a separate deployment stream. This new package will be called Microsoft.Data.SqlClient and is intended to be a drop-in replacement for System.Data.SqlClient.

By Jonathan Allen

© 2019   Created by Michael Levin.   Powered by

Badges  |  Report an Issue  |  Terms of Service