All Blog Posts (388)

O'Reilly Conference Diversity and Inclusion Scholarship Program

O'Reilly Conference Diversity and Inclusion Scholarship Program:

At O'Reilly, they believe that true innovation depends on hearing from, and listening to, people with a variety of perspectives. They want their conferences, and the technology communities and companies who participate in them, to include, encourage, and recognize people of all races, ethnicities,…

Continue

Added by Michael Levin on February 11, 2017 at 7:19pm — No Comments

Humble Bundle 2017

Humble Bundle 2017: 



O'Reilly has another Humble Book Bundle, this time: All about Hacks. 



Fill your library with practical and creative hacks with O'Reilly's latest Humble Bundle. Readers can pay any price they choose and support a charity at the same time! Bundle Ends …

Continue

Added by Michael Levin on February 11, 2017 at 7:17pm — No Comments

JFocus!

JFocus is going on right now in Stockholm! I should say it just ended yesterday, but the ripples usually linger from the great talks and conversations.

https://www.jfokus.se/jfokus/

I noticed Matt Raible is there and a bunch of people you probably know.

If you have some news to report about JFocus or just want to keep up with what's going on, stay tuned and please add comments.

Added by Michael Levin on February 9, 2017 at 8:30am — No Comments

Scraping News Articles from TheStreet.com

Octoaprse enables you to scrape latest news articles from news source.

There're two parts for getting the real-time data in Octoparse - Make a scraping task and schedule a task to run it in Octoparse cloud.

 

In this web scraping tutorial we will scrape the latest news articles of TheStreet.com to get article information - such as the title of article, article body text, date/time article published, author and article URL with…

Continue

Added by Paul Black on January 8, 2017 at 10:57pm — No Comments

Scraping Stock Data on Yahoo Finance

Octoaprse enables you to scrape finance data from financial websites. There're two parts for getting the real-time data in Octoparse - Make a scraping task and schedule a task to run it in Octoparse cloud.

 

In this web scraping tutorial we will scrape the stock data - such as most active stocks, stock gainers and stock losers on Yahoo Finance with Octoparse.

The website URLs we will use are as follows.…

Continue

Added by Paul Black on January 5, 2017 at 11:05pm — No Comments

Scraping Stock Information from CNN Money

Octoaprse enables you to scrape stock information from financial website. There're two parts for getting the real-time data in Octoparse - Make a scraping task and schedule a task to run it in Octoparse cloud.

 

In this web scraping tutorial we will scrape the stock data from CNN Money website to get detail information - such as the title, body, published date/time, author and article URL with Octoparse.

The website URL we will use is…

Continue

Added by Paul Black on January 5, 2017 at 11:01pm — No Comments

How to add a fixed value when scraping in Octoparse?

Q: How to add a fixed value when scraping in Octoparse?

 

Description:

How to add a fixed value as one of the data fields when making a scraping task in Octoparse?

  

A:

The simplest method:

You can add a fixed value when you are in the "Extract Data" action:

1. Click the "Add Pre-defined Fields".

 

2. Choose the “Add a fixed value…

Continue

Added by Paul Black on January 5, 2017 at 5:20am — No Comments

Scraping Yelp Reviews

Octoparse enables you to scrape reviews from yelp.com.    

 

In this tutorial we will scrape all reviews about car audios in Brooklyn, NY, United States from yelp.com with Octoparse.

The website URL we will use is …

Continue

Added by Paul Black on January 3, 2017 at 1:41am — No Comments

Scraping Hotel Reviews from Tripadvisor.com

In this tutorial we will scrape the phone numbers of all the hotels and their customer reviews in London from TripAdvisor.com with Octoparse.

The website URL we will use is https://www.tripadvisor.com/Hotels-g186338-London_England-Hotels.html.

The data fields include Hotel name, the number of reviews, address, ranking, PhoneNumber, customer…

Continue

Added by Paul Black on January 3, 2017 at 1:39am — No Comments

Scraping Restaurants Infomation from yell.com

Octoparse enables you to scrape the search results from Yell.com. After you enter the items you want to search in a certain region, you will redirect to the search page by clicking the “Search” botton.

 

In this tutorial we will scrape data about all restaurants in London from yell.com with Octoparse.

Then we will use the URL of the…

Continue

Added by Paul Black on January 3, 2017 at 1:37am — No Comments

Scraping Product Detail Pages from eBay.com

Octoparse enables you to scrape data from eBay.com. To speed up the extraction, you can use our Cloud Extraction to split the scraping task into many sub-tasks. Then our cloud servers will collect the data shortly and provide you with a structured data-set.   

To scrape product details from eBay.com as fast as possible, you can make two scraping tasks -- Task 1 and Task 2. Task 1 is used to scrape the URLs of product details and Task 2 is used to scrape all the product details from …

Continue

Added by Paul Black on January 3, 2017 at 1:35am — No Comments

Scrape Data from YellowPages.com

Octoparse enables you to scrape yellowpages.com (www.yp.com). You can capture names, addresses, cities, phone numbers, websites, etc of a certain job positions in a region posted on yellowpages.com.

  

In this tutorial we will scrape all anesthesiologist in New York, NY, United States from yellowpages.com with Octoparse.

The website URL we will use is …

Continue

Added by Paul Black on January 3, 2017 at 1:33am — No Comments

Scraping Online Dictionary - Merriam-Webster.com

Octoparse enables you to scrape the online dictionary into an organized list by entering a list of words. It’s very easy to use and could get the definition and examples of the word you want by using a Loop mode for entering a text list.

 

In this tutorial, I will show you how to scrape definition of some words from merriam-webster.com.

The website URL we will use is …

Continue

Added by Paul Black on December 29, 2016 at 9:43pm — No Comments

Web Scraping|Scrape Booking Reviews

 

(picture from www.luxurybackpacker.com)

 

Collecting online customer reviews, including star ratings, comments, likes, dislikes, images, videos, share channels and etc, can help an online retailer to better understand if the product sold is a good purchase and popular among customers, thus to adjust marketing strategies. There are many web scraping tools available online to live up to your expectations to scrape…

Continue

Added by Paul Black on December 29, 2016 at 9:00pm — No Comments

10 Essential Tutorials That Every Octoparse Newbie Should Know

Octoparse offers the most convenient way to scrape data from websites. Although few programming knowledge is required, some still claim that they have no ideas about how to use Octoparse. Thus this post aims to help our lovely new users to settle into Octoparse smoothly.

 

Below you will find links to 10 of the most helpful tutorials that will support you to make a first step in Octoparse. These guides will not only help you in scraping different kinds of website structures,…

Continue

Added by Paul Black on December 29, 2016 at 8:47pm — No Comments

Reasons and Solutions - Missing Data in Cloud Extraction

We all want to get a neat Excel spreadsheet with the data scraped, before going further analysis.

With Octoparse, you can fetch the data you want from websites and have the data ready for your use. Our cloud services enable you to fetch large amounts of data by running your scraping task with Cloud Extraction. The premise is, you know how to deal with all the circumstances when you are using Cloud Extraction to scrape the sites.

We summarize several problems encountered by our…

Continue

Added by Paul Black on December 27, 2016 at 4:14am — No Comments

Reasons and Solutions - Cloud Extraction Is Slower Than Local Extraction

Imagine that one day you open one web scraping software and the screen display all the data you want, neatly.

Octoparse Cloud servers had got all the data you want from any websites for you. You're full of joy.

We love to see you smile.

We are dedicated to providing the best web scraping software and service for you. 

So we create some tutorials to solve all the problems you may have when using Cloud…

Continue

Added by Paul Black on December 27, 2016 at 4:10am — No Comments

Reasons and Solutions - Getting Data from Local Extraction but None from Cloud Extraction

We all want to get a neat Excel spreadsheet with the data scraped, before going further analysis.

With Octoparse, you can fetch the data you want from websites and have the data ready for your use. Our cloud services enable you to fetch large amounts of data by running your scraping task with Cloud Extraction. The premise is, you know how to deal with all the circumstances when you are using Cloud Extraction to scrape the sites.

We summarize several…

Continue

Added by Paul Black on December 27, 2016 at 4:04am — No Comments

5 Steps to Collect Big Data

 

(picture from databigandsmall.com)

We know most companies today collect big data to analyze and interpretate of daily transaction and traffic data for keeping track of the operations, forecasting needs or implementing new programs. It is in this way that we define big data as the capability allowing companies to extract value from large volumes of different kinds of data. But how to collect such capability of big data we want directly?

There may be…

Continue

Added by Paul Black on December 22, 2016 at 4:36am — No Comments

Web Scraping|Scrape Data from Online Accommodation Booking Sites

For personnel who are actively looking for flight or hotels with low prices for traveling to other places, or for businesses who want to track prices of flights or any types of travel accommodations for maintain their competitive edge, Octoparse works great to effortlessly collect data based on different filters without manual searches.

An real-life example from one of our users who was trying to scrape data from …

Continue

Added by Paul Black on December 20, 2016 at 3:16am — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

Notes

Welcome to Codetown!

Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.

When you create a profile for yourself you get a personal page automatically. That's where you can be creative and do your own thing. People who want to get to know you will click on your name or picture and…
Continue

Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.

Looking for Jobs or Staff?

Check out the Codetown Jobs group.

There's also a free Java Jobs mailing list. It's a Yahoo group so you have to create a Yahoo account to use it.

 

Enjoy the site? Support Codetown with your donation.



Reading List

© 2019   Created by Michael Levin.   Powered by

Badges  |  Report an Issue  |  Terms of Service