News Feed
Sections




News Archive
Looking for more information on how to do PHP the right way? Check out PHP: The Right Way

Danne Lundqvist's Blog:
Detecting UTF BOM - byte order mark
April 29, 2010 @ 11:47:03

In a new post to his blog Danne Lundqvist looks at a common pitfall that could trip you up if you're not careful with your UTF-8 data - not looking for the UTF byte order mark that tells the application if it needs to be handled as UTF content.

One such thing is the occurrence of the UTF byte order mark, or BOM. [...] For UTF-8, especially on Windows, it has become more and more common to use it to indicate that the file is indeed UTF. Most text editors handle this well and you won't ever see these bytes. As it should be.

He points out what could cause an issue - using strcmp or substr but it can be prevented by looking at and removing those first three bytes if needed. He includes a snippet of code that does just that.

0 comments voice your opinion now!
byteordermark utf utf8 utf16 detect


blog comments powered by Disqus

Similar Posts

Elliot Haughin's Blog: Building UTF8 Compatible CodeIgniter Applications

Padraic Brady's Blog: ZF Blog Tutorial Addendum #1: Base URL, Magic Quotes, Database Schema & UTF-8

SitePoint PHP Blog: Bringing Unicode to PHP with Portable UTF-8

PHPFreaks.com: Protecting php applications with PHPIDS

Pablo Viquez's Blog: JSON, ISO 8859-1 and UTF-8 - Part


Community Events





Don't see your event here?
Let us know!


release application developer api configure interview wordpress series podcast bugfix introduction language community code laravel library threedevsandamaybe symfony list install

All content copyright, 2014 PHPDeveloper.org :: info@phpdeveloper.org - Powered by the Solar PHP Framework