<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Unbound &#187; Wikipedia</title>
	<atom:link href="http://blog.dataunbound.com/category/wikipedia/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.dataunbound.com</link>
	<description>Helping organizations access and share data effectively.  Special focus on web APIs for data integration.</description>
	<lastBuildDate>Sat, 12 Feb 2011 21:00:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<image>
  <link>http://blog.dataunbound.com</link>
  <url>http://blog.dataunbound.com/wp-content/plugins/favicon-manager/dataunbound.ico</url>
  <title>Data Unbound</title>
</image>
		<item>
		<title>pageid/curid as a unique id for Wikipedia pages</title>
		<link>http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/</link>
		<comments>http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/#comments</comments>
		<pubDate>Fri, 10 Apr 2009 18:00:55 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[freebase]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=469</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=pageid%2Fcurid+as+a+unique+id+for+Wikipedia+pages&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=freebase&amp;rft.subject=Wikipedia&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-10&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/&amp;rft.language=English"></span>
In my learning how to program Freebase, I&#039;ve come across links to the Wikipedia that make use of a curid parameter.  For example, http://en.wikipedia.org/wiki/index.html?curid=296716 is the same as http://en.wikipedia.org/wiki/Daniel_Akaka At least, the two pages seem to be the same thing as far as I can see. How to do a lookup btween curid and the [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=pageid%2Fcurid+as+a+unique+id+for+Wikipedia+pages&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=freebase&amp;rft.subject=Wikipedia&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-10&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/&amp;rft.language=English"></span>
<p>In my learning how to program Freebase, I&#039;ve come across links to the Wikipedia that make use of a <em>curid </em>parameter.  For example,</p>
<p><a href="http://en.wikipedia.org/wiki/index.html?curid=296716">http://en.wikipedia.org/wiki/index.html?curid=296716</a></p>
<p>is the same as</p>
<p><a href="http://en.wikipedia.org/wiki/Daniel_Akaka">http://en.wikipedia.org/wiki/Daniel_Akaka</a></p>
<p>At least, the two pages seem to be the same thing as far as I can see.</p>
<p>How to do a lookup btween curid and the page title?  One way is ff we&#039;re screen-scraping, the page source of <a href="http://en.wikipedia.org/wiki/Daniel_Akaka">http://en.wikipedia.org/wiki/Daniel_Akaka</a> contains</p>
<pre id="line28">var wgArticleId = "296716";</pre>
<p>And if you go to <a href="http://en.wikipedia.org/wiki/index.html?curid=296716">http://en.wikipedia.org/wiki/index.html?curid=296716</a> lots of indication of what the title is, including the permanent link (e.g., <a href="http://en.wikipedia.org/w/index.php?title=Daniel_Akaka&amp;oldid=278490360">http://en.wikipedia.org/w/index.php?title=Daniel_Akaka&amp;oldid=278490360</a>)</p>
<p>To dig deeper, I might want to understand the <a href="http://www.mediawiki.org/wiki/Page_table">mediawiki data structure</a> and the <a href="http://www.mediawiki.org/wiki/API">mediawiki API</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

