<?xml version="1.0"?>
<rss version="2.0">
  <channel>
    <title>PHPDeveloper.org</title>
    <link>http://www.phpdeveloper.org</link>
    <description>Up-to-the Minute PHP News, views and community</description>
    <language>en-us</language>
    <pubDate>Wed, 22 May 2013 12:56:36 -0500</pubDate>
    <ttl>30</ttl>
    <item>
      <title><![CDATA[Danne Lundqvist's Blog: Detecting UTF BOM - byte order mark]]></title>
      <guid>http://www.phpdeveloper.org/news/14435</guid>
      <link>http://www.phpdeveloper.org/news/14435</link>
      <description><![CDATA[<p>
In a new post to his blog <i>Danne Lundqvist</i> looks at <a href="http://www.dotvoid.com/2010/04/detecting-utf-bom-byte-order-mark/">a common pitfall</a> that could trip you up if you're not careful with your UTF-8 data - not looking for the UTF byte order mark that tells the application if it needs to be handled as UTF content.
</p>
<p>
One such thing is the occurrence of the UTF byte order mark, or BOM. [...] For UTF-8, especially on Windows, it has become more and more common to use it to indicate that the file is indeed UTF. Most text editors handle this well and you won't ever see these bytes. As it should be.
</p>
<p>
He points out what could cause an issue - using <a href="http://php.net/strcmp">strcmp</a> or <a href="http://php.net/substr">substr</a> but it can be prevented by looking at and removing those first three bytes if needed. He includes a snippet of code that does just that.
</p>]]></description>
      <pubDate>Thu, 29 Apr 2010 11:47:03 -0500</pubDate>
    </item>
    <item>
      <title><![CDATA[Sean Coates' Blog: UTF: WTF?]]></title>
      <guid>http://www.phpdeveloper.org/news/11445</guid>
      <link>http://www.phpdeveloper.org/news/11445</link>
      <description><![CDATA[<p>
<i>Sean Coates</i> has <a href="http://seancoates.com/utf-wtf">reposted an article</a> that was originally published in <a href="http://www.phparch.com">php|architect</a> magazine covering UTF-8 and proper Unicode encoding.
</p>
<blockquote>
If I had to guess, I would estimate that I've spent somewhere in the range of 40 hours wrangling UTF-8 in the past 3 months, which is not only expensive for my employer, but also disheartening as a developer who's got real work to do. Admittedly, this number is inflated, due to the heavy development cycle we completed with the launch of our new site.
</blockquote>
<p>
<i>Sean</i> goes on to talk about Unicode issues in general (partially supported in some places, too many points of failure) and some of his other experiences with "the UTF-8 monster" that have given him trouble over time.
</p>]]></description>
      <pubDate>Mon, 24 Nov 2008 09:31:04 -0600</pubDate>
    </item>
  </channel>
</rss>
