Vroman's is OPEN for in-store shopping and curbside pick-up

Perl & Lwp: Fetching Web Pages, Parsing Html, Writing Spiders & More (Paperback)

Perl & Lwp: Fetching Web Pages, Parsing Html, Writing Spiders & More By Sean M. Burke Cover Image
Add to Wish List
Usually arrives at our store within 4-7 days


Perl soared to popularity as a language for creating and managing web content, but with LWP (Library for WWW in Perl), Perl is equally adept at consuming information on the Web. LWP is a suite of modules for fetching and processing web pages.The Web is a vast data source that contains everything from stock prices to movie credits, and with LWP all that data is just a few lines of code away. Anything you do on the Web, whether it's buying or selling, reading or writing, uploading or downloading, news to e-commerce, can be controlled with Perl and LWP. You can automate Web-based purchase orders as easily as you can set up a program to download MP3 files from a web site.Perl & LWP covers:

  • Understanding LWP and its designFetching and analyzing URLsExtracting information from HTML using regular expressions and tokensWorking with the structure of HTML documents using treesSetting and inspecting HTTP headers and response codesManaging cookiesAccessing information that requires authenticationExtracting linksCooperating with proxy cachesWriting web spiders (also known as robots) in a safe fashion
Perl & LWP includes many step-by-step examples that show how to apply the various techniques. Programs to extract information from the web sites of BBC News, Altavista, ABEBooks.com, and the Weather Underground, to name just a few, are explained in detail, so that you understand how and why they work.Perl programmers who want to automate and mine the web can pick up this book and be immediately productive. Written by a contributor to LWP, and with a foreword by one of LWP's creators, Perl & LWP is the authoritative guide to this powerful and popular toolkit.
Product Details
ISBN: 9780596001780
ISBN-10: 0596001789
Publisher: O'Reilly Media
Publication Date: June 30th, 2002
Pages: 262
Language: English