<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Basically, Dan &#187; articles</title>
	<atom:link href="http://danielhough.co.uk/blog/tag/articles/feed/" rel="self" type="application/rss+xml" />
	<link>http://danielhough.co.uk/blog</link>
	<description>One long adventure.</description>
	<lastBuildDate>Sun, 01 Aug 2010 14:47:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Monitoring the Feeds</title>
		<link>http://danielhough.co.uk/blog/2010/01/monitoring-the-feeds/</link>
		<comments>http://danielhough.co.uk/blog/2010/01/monitoring-the-feeds/#comments</comments>
		<pubDate>Tue, 26 Jan 2010 12:37:23 +0000</pubDate>
		<dc:creator>Dan</dc:creator>
				<category><![CDATA[Dissertation]]></category>
		<category><![CDATA[articles]]></category>

		<guid isPermaLink="false">http://danielhough.co.uk/blog/?p=25</guid>
		<description><![CDATA[I've finally finished the RSS Parsing &#38; HTML Parsing section of the project. Since about 0:00 this morning (26/01/2010) the system has collected 180 unique articles from the Daily Mail, the Guardian, the Telegraph and the Express. I'm going to self-cluster these articles as they come in and soon enough will begin developing the modular [...]]]></description>
			<content:encoded><![CDATA[<p>I've finally finished the RSS Parsing &amp; HTML Parsing section of the project. Since about 0:00 this morning (26/01/2010) the system has collected 180 unique articles from the <a title="The Daily Mail" href="http://www.dailymail.co.uk">Daily Mail</a>, <a title="The Guardian" href="http://www.guardian.co.uk">the Guardian</a>, <a title="The Telegraph" href="http://www.telegraph.co.uk">the Telegraph</a> and <a title="The Express" href="http://www.express.co.uk">the Express</a>.</p>
<p>I'm going to self-cluster these articles as they come in and soon enough will begin developing the modular system which represents articles as (for the time being, just) vectors, and the methods needed to compare and cluster them. Then accuracy can be measured, settings tweaked and algorithms debugged until I find the best configuration.</p>
<p>After that, the mammoth task of just letting it run for ages begins, while in the meantime I a) begin a report about this crazy adventure and b) work on some rad visualisations for the data collected.</p>
<p>That's the plan at least. Wish me luck!</p>
]]></content:encoded>
			<wfw:commentRss>http://danielhough.co.uk/blog/2010/01/monitoring-the-feeds/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
