<?xml version="1.0"?>
<rss version="2.0">
  <channel>
    <title>PHPDeveloper.org</title>
    <link>http://www.phpdeveloper.org</link>
    <description>Up-to-the Minute PHP News, views and community</description>
    <language>en-us</language>
    <pubDate>Wed, 22 May 2013 15:17:15 -0500</pubDate>
    <ttl>30</ttl>
    <item>
      <title><![CDATA[Juozas Kaziukenas' Blog: Web scraping with PHP and XPath]]></title>
      <guid>http://www.phpdeveloper.org/news/11972</guid>
      <link>http://www.phpdeveloper.org/news/11972</link>
      <description><![CDATA[<p>
In <a href="http://dev.juokaz.com/php/web-scraping-with-php-and-xpath">this new post</a> to his blog <i>Juozas Kaziukenas</i> takes a look at one method for getting the information out of a remote page - parsing it with PHP and XPath (assuming the page is correctly formatted).
</p>
<blockquote>
When I was writing about <a href="http://dev.juokaz.com/php/web-scraping-easy-way-to-monitor-market">how I use web scraping</a>, I was still hadn't tried using Xpath (shame on me). [...] It turned out, that using Xpath is extremely easy, really. When you master it, you can do everything in seconds. Yes, you need to know how XML works and how to write correct Xpath queries (brief explanation of Xpath syntax is available at <a href="http://www.w3schools.com/XPath/xpath_syntax.asp">W3Schools</a>), but hey - these topics are in 1st year of university. 
</blockquote>
<p>
He includes both some sample code (to fetch a titles and prices for cameras from bhphotovideo.com) and a link to a <a href="https://addons.mozilla.org/en-US/firefox/addon/1095">XPath checker</a> you can use to ensure that your query is correctly formatted. It's good that he also includes a quick reminder about the ethical issue with web scraping - it could be considered stealing depending on where the information comes from and who is providing it.
</p>]]></description>
      <pubDate>Wed, 18 Feb 2009 10:28:08 -0600</pubDate>
    </item>
  </channel>
</rss>
