January 2017 Blog Posts (9)

Scraping News Articles from TheStreet.com

Octoaprse enables you to scrape latest news articles from news source.

There're two parts for getting the real-time data in Octoparse - Make a scraping task and schedule a task to run it in Octoparse cloud.

 

In this web scraping tutorial we will scrape the latest news articles of TheStreet.com to get article information - such as the title of article, article body text, date/time article published, author and article URL with…

Continue

Added by Paul Black on January 8, 2017 at 10:57pm — No Comments

Scraping Stock Data on Yahoo Finance

Octoaprse enables you to scrape finance data from financial websites. There're two parts for getting the real-time data in Octoparse - Make a scraping task and schedule a task to run it in Octoparse cloud.

 

In this web scraping tutorial we will scrape the stock data - such as most active stocks, stock gainers and stock losers on Yahoo Finance with Octoparse.

The website URLs we will use are as follows.…

Continue

Added by Paul Black on January 5, 2017 at 11:05pm — No Comments

Scraping Stock Information from CNN Money

Octoaprse enables you to scrape stock information from financial website. There're two parts for getting the real-time data in Octoparse - Make a scraping task and schedule a task to run it in Octoparse cloud.

 

In this web scraping tutorial we will scrape the stock data from CNN Money website to get detail information - such as the title, body, published date/time, author and article URL with Octoparse.

The website URL we will use is…

Continue

Added by Paul Black on January 5, 2017 at 11:01pm — No Comments

How to add a fixed value when scraping in Octoparse?

Q: How to add a fixed value when scraping in Octoparse?

 

Description:

How to add a fixed value as one of the data fields when making a scraping task in Octoparse?

  

A:

The simplest method:

You can add a fixed value when you are in the "Extract Data" action:

1. Click the "Add Pre-defined Fields".

 

2. Choose the “Add a fixed value…

Continue

Added by Paul Black on January 5, 2017 at 5:20am — No Comments

Scraping Yelp Reviews

Octoparse enables you to scrape reviews from yelp.com.    

 

In this tutorial we will scrape all reviews about car audios in Brooklyn, NY, United States from yelp.com with Octoparse.

The website URL we will use is …

Continue

Added by Paul Black on January 3, 2017 at 1:41am — No Comments

Scraping Hotel Reviews from Tripadvisor.com

In this tutorial we will scrape the phone numbers of all the hotels and their customer reviews in London from TripAdvisor.com with Octoparse.

The website URL we will use is https://www.tripadvisor.com/Hotels-g186338-London_England-Hotels.html.

The data fields include Hotel name, the number of reviews, address, ranking, PhoneNumber, customer…

Continue

Added by Paul Black on January 3, 2017 at 1:39am — No Comments

Scraping Restaurants Infomation from yell.com

Octoparse enables you to scrape the search results from Yell.com. After you enter the items you want to search in a certain region, you will redirect to the search page by clicking the “Search” botton.

 

In this tutorial we will scrape data about all restaurants in London from yell.com with Octoparse.

Then we will use the URL of the…

Continue

Added by Paul Black on January 3, 2017 at 1:37am — No Comments

Scraping Product Detail Pages from eBay.com

Octoparse enables you to scrape data from eBay.com. To speed up the extraction, you can use our Cloud Extraction to split the scraping task into many sub-tasks. Then our cloud servers will collect the data shortly and provide you with a structured data-set.   

To scrape product details from eBay.com as fast as possible, you can make two scraping tasks -- Task 1 and Task 2. Task 1 is used to scrape the URLs of product details and Task 2 is used to scrape all the product details from …

Continue

Added by Paul Black on January 3, 2017 at 1:35am — No Comments

Scrape Data from YellowPages.com

Octoparse enables you to scrape yellowpages.com (www.yp.com). You can capture names, addresses, cities, phone numbers, websites, etc of a certain job positions in a region posted on yellowpages.com.

  

In this tutorial we will scrape all anesthesiologist in New York, NY, United States from yellowpages.com with Octoparse.

The website URL we will use is …

Continue

Added by Paul Black on January 3, 2017 at 1:33am — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

Happy 10th year, JCertif!

Notes

Welcome to Codetown!

Codetown is a social network. It's got blogs, forums, groups, personal pages and more! You might think of Codetown as a funky camper van with lots of compartments for your stuff and a great multimedia system, too! Best of all, Codetown has room for all of your friends.

When you create a profile for yourself you get a personal page automatically. That's where you can be creative and do your own thing. People who want to get to know you will click on your name or picture and…
Continue

Created by Michael Levin Dec 18, 2008 at 6:56pm. Last updated by Michael Levin May 4, 2018.

Looking for Jobs or Staff?

Check out the Codetown Jobs group.

There's also a free Java Jobs mailing list. It's a Yahoo group so you have to create a Yahoo account to use it.

 

Enjoy the site? Support Codetown with your donation.



InfoQ Reading List

WebExpo 2019: Make Healthcare Affordable and Accessible Using Tech and AI

Anna Zawilska, Lead User Researcher at Babylon Health, recently presented, at WebExpo 2019 in Prague, the lessons learnt from their experience delivering remote healthcare through a combination of technology and Artificial Intelligence (AI). Babylon Health came to adjust three key assumptions underpinning their product development.

By Bruno Couriol

Article: Q&A on the Book The Driver in the Driverless Car

The book The Driver in the Driverless Car by Vivek Wadhwa and Alex Salkever explores how technology is changing faster and faster, and what impact that can have on the future of our society. It aims to help frame decisions and thinking about rapidly developing technologies. Salkever and Wadhwa cover a wide variety of technologies, including robotics, AI, quantum computing, and driverless cars.

By Ben Linders, Vivek Wadhwa, Alex Salkever

Presentation: Business Agility – Increasing Your Organization’s Competitiveness

Dean Latchana addresses how organizations can handle market pressure and opportunity, covering closing the gap between vision and execution, determining strategic fit with the vision, and others.

By Dean Latchana

Presentation: Introduction to Stateful Property-based Testing

Tomasz Kowal presents a high-level overview that is both encouraging for beginners but also maps the road to mastering Property-based Testing.

By Tomasz Kowal

Introducing Maesh: A Service Mesh for Kubernetes

On September 4th, 2019, Containous, a cloud infrastructure software provider, released Maesh, an open-source service mesh written in Golang and built on top of the reverse proxy and load balancer Traefik. Maesh promises to provide a lightweight service mesh solution that is easy to get started with and to roll out across a microservice application.

By K Jonas

© 2019   Created by Michael Levin.   Powered by

Badges  |  Report an Issue  |  Terms of Service