News Feed
Sections




News Archive
Looking for more information on how to do PHP the right way? Check out PHP: The Right Way

James Morris' Blog:
Parsing HTML with DOMDocument and DOMXPathQuery
June 27, 2012 @ 10:19:35

In the latest post to his blog James Morris looks at using XPath's query() function to locate pieces of data in your XML.

The other day I needed to do some html scraping to trim out some repeated data stuck inside nested divs and produce a simplified array of said data. My first port of call was SimpleXML which I have used many times. However this time, the son of a bitch just wouldn't work with me and kept on throwing up parsing errors. I lost my patience with it and decided to give DomDocument and DOMXpath a go which I'd heard of but never used.

He includes a code (and XML document) example showing how to extract out some content from an HTML structure - grabbing each of the images from inside a div and associating them with their description content.

0 comments voice your opinion now!
dom domdocument domxpath xpath tutorial html


blog comments powered by Disqus

Similar Posts

Bob Majdak's Blog: Making images transparent using Imagick - enter the pixel iterator

Stephan Hochdörfer: Silex running on HHVM

NetTuts.com: Quick Tip: Integrate Compass into an Existing CodeIgniter

Script-Tutorials.com: Watermark processing on images using PHP and GD

Oracle Technology Network: Using PHP and Oracle Database 11g (Tutorials)


Community Events





Don't see your event here?
Let us know!


symfony api zendserver voicesoftheelephpant language laravel framework series bugfix introduction developer deployment release community podcast tips conference list library interview

All content copyright, 2014 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework