<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Kurinchi Blogger Scribbles ... &#187; http</title>
	<atom:link href="http://kurinchilamp.kurinchilion.com/tag/http/feed" rel="self" type="application/rss+xml" />
	<link>http://kurinchilamp.kurinchilion.com</link>
	<description>On Open Source Technologies</description>
	<lastBuildDate>Fri, 03 Sep 2010 03:26:10 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>HTTP-Header: User-agent modification</title>
		<link>http://kurinchilamp.kurinchilion.com/2009/12/http-header-user-agent-modification.html</link>
		<comments>http://kurinchilamp.kurinchilion.com/2009/12/http-header-user-agent-modification.html#comments</comments>
		<pubDate>Sun, 06 Dec 2009 07:09:21 +0000</pubDate>
		<dc:creator>kurinchilamp</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[HTML]]></category>
		<category><![CDATA[CSS]]></category>
		<category><![CDATA[http]]></category>
		<category><![CDATA[http headers]]></category>
		<category><![CDATA[javascript]]></category>
		<category><![CDATA[modify user agent]]></category>
		<category><![CDATA[user-agent]]></category>

		<guid isPermaLink="false">http://kurinchilamp.kurinchilion.com/?p=740</guid>
		<description><![CDATA[When a browser makes a request to the server, it sends information such as its operating system name, version in addition to the type of the browser from which the request originates in the header variable &#8220;user-agent&#8221;. Depending on the request made, server responds back with specific details which can range from browser specific css [...]]]></description>
			<content:encoded><![CDATA[<p>When a browser makes a request to the server, it sends information such as its operating system name, version in addition to the type of the browser from which the request originates in the header variable &#8220;user-agent&#8221;. </p>
<p>Depending on the request made, server responds back with specific details which can range from browser specific <strong>css </strong>layout to calling certain <em>javascripts</em>for certain browsers.</p>
<p>Changing the <strong>user-agent</strong> also helps us in understanding how spider-agents or bots will see the pages that we build. It also helps us design applications that will mimic a request as originating from a mobile phone and helps us in viewing the behavior of the application in different mobile clients.</p>
<p>To read more about HTTP-Headers check <a href="http://www.ietf.org/rfc/rfc2616.txt" target="_blank">ietf</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fkurinchilamp.kurinchilion.com%2F2009%2F12%2Fhttp-header-user-agent-modification.html&amp;linkname=HTTP-Header%3A%20User-agent%20modification"><img src="http://kurinchilamp.kurinchilion.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://kurinchilamp.kurinchilion.com/2009/12/http-header-user-agent-modification.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>World Wide Web Get &#8211; wget</title>
		<link>http://kurinchilamp.kurinchilion.com/2009/04/world-wide-web-get-wget.html</link>
		<comments>http://kurinchilamp.kurinchilion.com/2009/04/world-wide-web-get-wget.html#comments</comments>
		<pubDate>Thu, 30 Apr 2009 11:39:15 +0000</pubDate>
		<dc:creator>kurinchilamp</dc:creator>
				<category><![CDATA[LINUX]]></category>
		<category><![CDATA[copy website]]></category>
		<category><![CDATA[download website]]></category>
		<category><![CDATA[ftp]]></category>
		<category><![CDATA[gnu wget]]></category>
		<category><![CDATA[grab website]]></category>
		<category><![CDATA[http]]></category>
		<category><![CDATA[https]]></category>
		<category><![CDATA[wget]]></category>
		<category><![CDATA[world wide web get]]></category>

		<guid isPermaLink="false">http://kurinchilamp.kurinchilion.com/?p=280</guid>
		<description><![CDATA[Grab entire website content with &#8220;wget&#8221; The web get feature &#8220;wget&#8221; enables a user to grab the entire content of the website. There may be situations where your may have difficulty in getting the source code from your software vendor, or may not have access details to certain location in which case &#8220;Wget&#8221; comes in [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Grab entire website content with &#8220;wget&#8221;</strong></p>
<p>The web get feature &#8220;wget&#8221; enables a user to grab the entire content of the website. There may be situations where your may have difficulty in getting the source code from your software vendor, or may not have access details to certain location in which case &#8220;Wget&#8221; comes in handy. This feature is available for win32 platform as well. Work around for this is for the user to install cygwin to run the &#8220;wget&#8221; command<span id="more-280"></span></p>
<p><strong>$ cd /tmp<br />
$ mkdir sitename.com<br />
$ cd sitename.com<br />
$ wget -r -H -k -Dsitename.com, www.sitename.com sitenamey.com</strong></p>
<p><strong>Switches </strong>used &#8230;</p>
<p>-r switch for recursively handling files<br />
-H switch to &#8220;span hosts&#8221; as in some sites there may be links to other other domains<br />
-D switch to indicate from which domains we will gather the files. It is ideal to use this with -H switch<br />
-k switch to indicate that the links from the site refer to local copies and not to the original internet location</p>
<p>To get a mirror copy</p>
<p><strong>$ wget -r http://somesite.com</strong></p>
<p>Use it with &#8211;convert-links to make offline copies with local links</p>
<p><strong>$ wget &#8211;convert-links -r http://somesite.com</strong></p>
<p>To save the files with .html extension</p>
<p><strong>$ wget &#8211;html-extension -r http://somesite.com</strong></p>
<p>Now you will have all the files downloaded to sitenamey.com.</p>
<p><strong>Where to get WGET?</strong></p>
<p>http://wget.addictivecode.org/FrequentlyAskedQuestions?action=show&#038;redirect=Faq#download</p>
<p>Some sites may block the user agent used by Wget in order to avoid heavy trafficking of their bandwidth. You could inturn use the user agent bot from yahoo or google or msn to grab a local copy.</p>
<p>&#8220;wget&#8221; can be used with various settings to grab site content which includes options like setting http-keep-alive (persistent connections), cookies, post-data, https settings and ftp settings. Check the manual for more detailed help on this topic.</p>
<p>Another contender to &#8220;wget&#8221; is a tool called &#8220;HTTrack&#8221; which is free as well. It has a version for Windows users as well. Check <a href="http://www.httrack.com/">http://www.httrack.com/</a> to download copy of this tool.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fkurinchilamp.kurinchilion.com%2F2009%2F04%2Fworld-wide-web-get-wget.html&amp;linkname=World%20Wide%20Web%20Get%20%26%238211%3B%20wget"><img src="http://kurinchilamp.kurinchilion.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Save/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://kurinchilamp.kurinchilion.com/2009/04/world-wide-web-get-wget.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
