<?xml version="1.0"?>
<rss version="2.0">
  <channel>
    <title>PHPDeveloper.org</title>
    <link>http://www.phpdeveloper.org</link>
    <description>Up-to-the Minute PHP News, views and community</description>
    <language>en-us</language>
    <pubDate>Thu, 20 Nov 2008 08:33:40 -0600</pubDate>
    <ttl>30</ttl>
    <item>
      <title><![CDATA[Matthew Turland's Blog: How-To (and How-Not-To) on Web Scraping]]></title>
      <guid>http://www.phpdeveloper.org/news/9798</guid>
      <link>http://www.phpdeveloper.org/news/9798</link>
      <description><![CDATA[<p>
<i>Matthew Turland</i> has a few things to say about web scraping (and <a href="http://php.dzone.com/news/writing-website-scrapers-php">recent articles</a> covering it) on <a href="http://ishouldbecoding.com/2008/03/12/scraping-html-with-dom">his blog today</a> as an author of a previous article published in <a href="http://www.phparch.com">php|architect</a> covering the same topic:
</p>
<blockquote>
A friend of mine who shall remain nameless pointed a <a href="http://php.dzone.com/news/writing-website-scrapers-php">post</a> out to me on the <a href="http://php.dzone.com/">PHP DZone</a> web site recently. Noting that the article's content was misinformed at best and downright ignorant at worst, even when examining it sheerly from the author's knowledge of PHP as a language, this friend asked that I set the author straight.
</blockquote>
<p>
He mentions his <a href="http://php.dzone.com/news/writing-website-scrapers-php#comment-1497">comments</a> on the post correcting the author on some points as well as a more "clued in" <a href="http://www.xml.lt/Blog/2008/03/11/Scraping+html+with+DOM">post</a> on the xml.lt website talking about using PHP's DOM functionality instead.
</p>]]></description>
      <pubDate>Fri, 14 Mar 2008 11:18:44 -0500</pubDate>
    </item>
  </channel>
</rss>
