<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Kurinchi Blogger Scribbles ... &#187; unicode</title>
	<atom:link href="http://kurinchilamp.kurinchilion.com/tag/unicode/feed" rel="self" type="application/rss+xml" />
	<link>http://kurinchilamp.kurinchilion.com</link>
	<description>On Open Source Technologies</description>
	<lastBuildDate>Mon, 02 Jan 2012 06:14:45 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>UTF and Byte Order Mark (BOM)</title>
		<link>http://kurinchilamp.kurinchilion.com/2009/11/utf-and-byte-order-mark-bom.html</link>
		<comments>http://kurinchilamp.kurinchilion.com/2009/11/utf-and-byte-order-mark-bom.html#comments</comments>
		<pubDate>Thu, 12 Nov 2009 01:09:15 +0000</pubDate>
		<dc:creator>kurinchilamp</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[PHP]]></category>
		<category><![CDATA[BOM]]></category>
		<category><![CDATA[Byte Order Mark]]></category>
		<category><![CDATA[multi-lingual website]]></category>
		<category><![CDATA[unicode]]></category>
		<category><![CDATA[Unicode Transformation Format]]></category>
		<category><![CDATA[UTF]]></category>

		<guid isPermaLink="false">http://kurinchilamp.kurinchilion.com/?p=707</guid>
		<description><![CDATA[UTF abbr. Unicode Transformation Format is a super set of characters that facilitates the display of characters from ALL languages that is currently in use. BOM stands for Byte Order Mark (also known as ZERO WIDTH NON-BREAKING SPACE) appearing at the beginning of a data stream to define the byte order (whether it is little [...]]]></description>
			<content:encoded><![CDATA[<p><strong>UTF</strong> <em>abbr.</em> <strong>Unicode Transformation Format </strong>is a super set of characters that facilitates the display of characters from ALL languages that is currently in use. </p>
<p><strong>BOM </strong>stands for <strong>Byte Order Mark </strong>(also known as <strong>ZERO WIDTH NON-BREAKING SPACE</strong>) appearing at the beginning of a data stream to define the byte order (whether it is <em>little endian </em>or <em>big endian</em>). BOM is to indicate that the file is <strong>Unicode </strong>based.</p>
<p><strong>Possible problem caused by Byte Order Mark</strong><br />
In IE6, it will send the dispaly into quirks mode with its presence before &#8220;&lt;DOCTYPE &#8230;&#8221;</p>
<p><strong>How to check the presence of BOM?</strong><br />
View the source code of the page that does not display correctly in a user agent. Or, open the file in the editor which does not support UTF encoding to see the BOM&#8217;s signature on the files.</p>
<p><em>Suggestion</em><br />
If the user-agent gives problem when you program for multi-lingual websites, check the editor settings to see what format is being used for saving program files. Notepad++ has the option to convert files to UTF and to UTF without BOM. Use the appropriate settings to convert your files.</p>
<p>A character set related tutorial on W3C site can be <a href="http://www.w3.org/International/tutorials/tutorial-char-enc/">found here</a></p>
<p>Byte Order Mark on <a href="http://en.wikipedia.org/wiki/Byte_order_mark">Wiki</a></p>
<p>Article on <a href="http://randomchaos.com/documents/?source=php_and_unicode">PHP and UTF</a></p>
]]></content:encoded>
			<wfw:commentRss>http://kurinchilamp.kurinchilion.com/2009/11/utf-and-byte-order-mark-bom.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

