Home » Freemium, Headline, News, Opinion, Product Review
80Legs, 50k Computers and a Web Crawler
By Senior Editor – Kris Smith (@croncast)
You need a pile-o-data fast and you got nowhere to get it other than surf, bookmark and beg for interns to copy and paste for you. Where do you turn? Your IT department? Your hackery skills and your shared GoDaddy hosting account for bandwidth? Nah.
80Legs is ready to run a couple miles with your pile of data on their shoulders. You get to pick it up and work with it as you see fit.
Did I mention that they are now offering this as a free service? Well, up to a certain point it is free but for the many is plenty of room to get what they’re looking for.
80Legs offers a unique service that will crawl the internet on your behalf and gather data from the links that you provide. They then take this unstructured data and make it available for further refinement to the customer.
Their value proposition lies in the ability to deliver this service efficiently and affordably. Like I said earlier, it would be difficult if not impossible for an individual run a service to crawl 100,000 pages quickly. 80Legs is offering this as a free service now and it’s all powered by a 50,000 computer network.
The ability to put the data collection into another companies hands allows developers to think about what to do with the data. By freeing up developers more can be done with the data that is returned to them as they have time to think about new algorithms to run across the dataset.
An example of this would be simple search. Developers with more time could work on creating new layers to search that make it more valuable to the end user. Whether it is integrating advanced search functionality or returning results contextually depending on the page that a user is currently searching from.
If you’re interested, the free Basic specs are below. Plus and Premium are listed on their blog.
80Legs Basic Plan:
- Free to use
- Normal crawling speed (up to 1 request/second/domain)
- Access to 80legs Web Portal
- 1 job running at a time
- Up to 100K crawled pages per job
- Low priority in 80legs job queue
- No recurring jobs allowed
[Via VentureBeat]
Related articles by Zemanta
- GoDaddy Referral URLs and ISC codes (startups.com)
- How to Use TipTop for Real Time Market Research (growmap.com)
- Sony Trademarks “Qrisoity,” Possibly New Premium PSN Service? (1up.com)
- Got Data? How Changing My Social Sharing Workflow Is Making Me Smarter (I Hope). (techstartups.com)
, 80legs spider
, 80Legs web crawler
, free web crawler
, free web spider
, unstructured data
, value proposition
, VentureBeat 

![Reblog this post [with Zemanta]](http://img.zemanta.com/reblog_e.png?x-id=903642c8-0bd2-4e2a-9a5c-051f1739069d)











And right now they don’t follow robots.txt and flood sites with their crawler, abusing different IPs – mainly from comcast, cox and verizon.
Webmasters should block this bad bot in htaccess!
[...] 80Legs, 50k Computers and a Web Crawler (techstartups.com) [...]
Leave your response!
Tech Cloud
Tech Categories
Archives