<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Unbound</title>
	<atom:link href="http://blog.dataunbound.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.dataunbound.com</link>
	<description>Data Architect, Consultant, Trainer, and Author Raymond Yee on data and software in research and education</description>
	<lastBuildDate>Thu, 18 Jun 2009 17:56:15 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<image>
  <link>http://blog.dataunbound.com</link>
  <url>http://blog.dataunbound.com/wp-content/plugins/favicon-manager/dataunbound.ico</url>
  <title>Data Unbound</title>
</image>
		<item>
		<title>A first pass at an org chart for the US Federal government</title>
		<link>http://blog.dataunbound.com/2009/06/18/a-first-pass-at-an-org-chart-for-the-us-federal-government/</link>
		<comments>http://blog.dataunbound.com/2009/06/18/a-first-pass-at-an-org-chart-for-the-us-federal-government/#comments</comments>
		<pubDate>Thu, 18 Jun 2009 17:56:15 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[government]]></category>
		<category><![CDATA[budget]]></category>
		<category><![CDATA[OMB]]></category>
		<category><![CDATA[OPML]]></category>
		<category><![CDATA[org chart]]></category>
		<category><![CDATA[US government]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=659</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=A+first+pass+at+an+org+chart+for+the+US+Federal+government&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=government&amp;rft.source=Data+Unbound&amp;rft.date=2009-06-18&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/06/18/a-first-pass-at-an-org-chart-for-the-us-federal-government/&amp;rft.language=English"></span>
When I started trying to understand how the US Government works, I&#039;ve been trying to find a chart that would list all the different department, agencies, and other organizational entities that comprise the government &#8212; and show how they are related to each other.  I can&#039;t believe that I&#039;d be the only person to [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=A+first+pass+at+an+org+chart+for+the+US+Federal+government&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=government&amp;rft.source=Data+Unbound&amp;rft.date=2009-06-18&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/06/18/a-first-pass-at-an-org-chart-for-the-us-federal-government/&amp;rft.language=English"></span>
<p>When I started trying to understand how the US Government works, I&#039;ve been trying to find a chart that would list all the different department, agencies, and other organizational entities that comprise the government &#8212; and show how they are related to each other.  I can&#039;t believe that I&#039;d be the only person to find such an org chart useful; indeed, this idea is echoed in a project idea listed on the Sunlight Labs wiki as <a href="http://wiki.sunlightlabs.com/index.php/OPML_the_Federal_Government">OPML the Federal Government</a>:  </p>
<blockquote><p>Project Idea: This is a quick win&#8211; just create an OPML file of the existing structure of the Federal Government agencies in all branches.
</p></blockquote>
<p>As a step to creating such a representation, I&#039;ve scraped the data in
<div style="line-height:1.1em;margin-left:0.5in;text-indent:-0.5in;">
<p style="margin:0">Appendix C of OMB Circular No. A-11 (Sept 2008).<a href="http://www.whitehouse.gov/omb/circulars/a11/current_year/app_c.pdf">http://www.whitehouse.gov/omb/circulars/a11/current_year/app_c.pdf</a>.</p>
</div>
<blockquote><p>Under the MAX system, OMB assigns agency and bureau codes that are used to identify and access data in the budget database. The following table lists these codes in budget order. It also provides the corresponding agency codes assigned by Treasury. In certain instances, a different Treasury agency code may be used for some accounts in an agency; a complete listing can be found in the Budget Accounts Title (BAT) file.</p></blockquote>
<p>I&#039;ve uploaded this PDF to scribd to make it easier for readers to see the data the pdf has:</p>
<p>Source<br />
<a style="margin: 12px auto 6px auto; font-family: Helvetica,Arial,Sans-serif; font-style: normal; font-variant: normal; font-weight: normal; font-size: 14px; line-height: normal; font-size-adjust: none; font-stretch: normal; -x-system-font: none; display: block; text-decoration: underline;" title="View OMB Circular a 11 Appendix c on Scribd" href="http://www.scribd.com/doc/16554820/OMB-Circular-a-11-Appendix-c">OMB Circular a 11 Appendix c</a> <object id="doc_107398057351895" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="100%" height="500" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="name" value="doc_107398057351895" /><param name="align" value="middle" /><param name="quality" value="high" /><param name="play" value="true" /><param name="loop" value="true" /><param name="scale" value="showall" /><param name="wmode" value="opaque" /><param name="devicefont" value="false" /><param name="bgcolor" value="#ffffff" /><param name="menu" value="true" /><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://d.scribd.com/ScribdViewer.swf?document_id=16554820&amp;access_key=key-21e6erbhy5kh3zigvgh9&amp;page=1&amp;version=1&amp;viewMode=" /><param name="allowfullscreen" value="true" /><embed id="doc_107398057351895" type="application/x-shockwave-flash" width="100%" height="500" src="http://d.scribd.com/ScribdViewer.swf?document_id=16554820&amp;access_key=key-21e6erbhy5kh3zigvgh9&amp;page=1&amp;version=1&amp;viewMode=" allowscriptaccess="always" allowfullscreen="true" menu="true" bgcolor="#ffffff" devicefont="false" wmode="opaque" scale="showall" loop="true" play="true" quality="high" align="middle" name="doc_107398057351895"></embed></object></p>
<p>I read this PDF into Adobe Acrobat 8, saved it as &#034;XML 1.0&#034;, messaged the XML a bit by hand to make it easier to apply some XQuery to create a starter OPML 1.0 file, and then did some more manual editing to represent the data in the correct hierarchy to produce: </p>
<p><a href="http://labs.dataunbound.com/doc/2009/06/OMB_A_11_C.xml">http://labs.dataunbound.com/doc/2009/06/OMB_A_11_C.xml</a></p>
<p>My working assumption is that OMB Agency/Bureau codes + Treasury Agency Codes provide the key to unlocking a significant part of the higher levels of the US Federal Government.  More on this assumption later.</p>
<p>You can see this OPML rendered by optimalbrowser.com as such:  </p>
<p><iframe src="http://optimalbrowser.com/?url=http%3A%2F%2Flabs.dataunbound.com%2Fdoc%2F2009%2F06%2FOMB_A_11_C.xml&amp;widget=1&amp;depth=0&amp;linktarget=_top" height="400" width="500" style="border: none;"></iframe></p>
<p>Some Possible Next Steps:</p>
<ul>
<li>I think the OPML is correct &#8212; but I need to do a closer look through the OPML to make sure I didn&#039;t make any errors.</li>
<li>It&#039;d be a good long-term good to get this org chart data in nice machine-readable from from the federal government in the first place.  Might it come from the GSA <a href="http://groups.google.com/group/sunlightlabs/msg/0e6aa04965fd6489">as Clay Johnson wondered on the Sunlight Labs list</a>?</li>
<li>Barring getting the data directly from the Feds in a nice form, we can look at scraping the data from sources such as http://www.usa.gov/Agencies/federal.shtml .  My colleagues Eric Kansa and Erik Wilde did do a scrape of <a href="http://www.usa.gov/Agencies/Federal/All_Agencies/index.shtml">A-Z Index of U.S. Government Departments and Agencies (A): USA.gov</a> into <a href="http://isd.ischool.berkeley.edu/stimulus/agencies/agencies.xml">http://isd.ischool.berkeley.edu/stimulus/agencies/agencies.xml</a>.  I&#039;d like to reconcile my OPML file with this list of agencies so that we can tie OMB/treasury codes with the URLs of websites.</li>
<li>Instead of scraping Appendix C of OMB Circular A-11, one might dive directly into the s<a href="http://www.whitehouse.gov/omb/budget/Supplemental/">preadsheets from the Federal budget database</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/06/18/a-first-pass-at-an-org-chart-for-the-us-federal-government/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>copyright status of White House photos on Flickr?</title>
		<link>http://blog.dataunbound.com/2009/05/02/copyright-status-of-white-house-photos-on-flickr/</link>
		<comments>http://blog.dataunbound.com/2009/05/02/copyright-status-of-white-house-photos-on-flickr/#comments</comments>
		<pubDate>Sat, 02 May 2009 18:51:07 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[copyright]]></category>
		<category><![CDATA[creative commons]]></category>
		<category><![CDATA[government]]></category>
		<category><![CDATA[Flickr]]></category>
		<category><![CDATA[public domain]]></category>
		<category><![CDATA[White House]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=630</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=copyright+status+of+White+House+photos+on+Flickr%3F&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=copyright&amp;rft.subject=creative+commons&amp;rft.subject=government&amp;rft.source=Data+Unbound&amp;rft.date=2009-05-02&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/05/02/copyright-status-of-white-house-photos-on-flickr/&amp;rft.language=English"></span>
On the &#034;noise list&#034; at the School of Information at Berkeley, we recently got into a discussion about the copyright status of The Official White House Photostream&#039;s Photostream on Flickr.   Some of us would agree with the argument presented on the blog of the Creative Commons (Why Did the White House Choose Attribution and not [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=copyright+status+of+White+House+photos+on+Flickr%3F&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=copyright&amp;rft.subject=creative+commons&amp;rft.subject=government&amp;rft.source=Data+Unbound&amp;rft.date=2009-05-02&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/05/02/copyright-status-of-white-house-photos-on-flickr/&amp;rft.language=English"></span>
<p>On the &#034;noise list&#034; at the School of Information at Berkeley, we recently got into a discussion about the copyright status of <a href="http://www.flickr.com/photos/whitehouse/">The Official White House Photostream&#039;s Photostream</a> on Flickr.   Some of us would agree with the argument presented on the blog of the Creative Commons (<a href="http://creativecommons.org/weblog/entry/14237">Why Did the White House Choose Attribution and not Public Domain?</a>) that</p>
<blockquote><p>The photos are likely in the public domain because <a onclick="javascript:pageTracker._trackPageview ('/outbound/en.wikipedia.org');" href="http://en.wikipedia.org/wiki/Copyright_status_of_work_by_the_U.S._government">they are works created by the federal government</a> and not entitled to copyright protection. As you might recall, the <a onclick="javascript:pageTracker._trackPageview ('/outbound/www.whitehouse.gov');" href="http://www.whitehouse.gov/copyright">Whitehouse.gov’s copyright notice</a> indicates as much.</p></blockquote>
<p>At present, Flickr doesn&#039;t allow an ordinary user to state that one of his pictures is in the public domain.    I&#039;ve been waiting for <a href="http://wiki.creativecommons.org/CC0">CC0</a> to be added to list of CC licenses you can use.  Of course, the White House isn&#039;t necessarily a regular joe user,  and there is already a structure in Flickr already to handle public domain-ish photos:  the <a href="http://www.flickr.com/commons">Flickr Commons</a> with its <a href="http://www.flickr.com/commons/usage/">&#034;no known copyright restrictions&#034; provision</a> .    Perhaps putting White House photos in Flickr Commons won&#039;t quite work either.  Would one be able to put images produced by the US government  into the Flickr Commons in general?</p>
<p>BTW, one of the comments on the Creative Commons post pointed to <a class="moz-txt-link-freetext" href="http://www.flickr.com/people/whitehouse/">http://www.flickr.com/people/whitehouse/</a> in which we find the following stipulations:</p>
<blockquote><p>These official White House photographs are being made available for publication by news organizations and/or for personal use printing by the subject(s) of the photographs. The photographs may not be used in materials, advertisements, products, or promotions that in any way suggest approval or endorsement of the President, the First Family, or the White House.</p></blockquote>
<p>Given that these photos are arguably in the public domain (as argued in the creativecommons.org blog post), are these stipulations legally enforceable?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/05/02/copyright-status-of-white-house-photos-on-flickr/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Previous recommendations would say &quot;open the data&quot; to Recovery.gov</title>
		<link>http://blog.dataunbound.com/2009/04/30/previous-recommendations-would-say-open-the-data-to-recoverygov/</link>
		<comments>http://blog.dataunbound.com/2009/04/30/previous-recommendations-would-say-open-the-data-to-recoverygov/#comments</comments>
		<pubDate>Fri, 01 May 2009 00:10:24 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=621</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Previous+recommendations+would+say+%26%23034%3Bopen+the+data%26%23034%3B+to+Recovery.gov&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=Uncategorized&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-30&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/30/previous-recommendations-would-say-open-the-data-to-recoverygov/&amp;rft.language=English"></span>
As many have jumped into making recommendations on how Recovery data  should be packaged and disseminated, I&#039;m reminded of some important previous work in this area.
The first is the ACM U.S. Public Policy Committee (USACM) Recommendations on Open Government.  I have a tremendous respect for the ACM as  &#034;the world’s largest educational and [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Previous+recommendations+would+say+%26%23034%3Bopen+the+data%26%23034%3B+to+Recovery.gov&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=Uncategorized&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-30&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/30/previous-recommendations-would-say-open-the-data-to-recoverygov/&amp;rft.language=English"></span>
<p>As many have jumped into making recommendations on how Recovery data  should be packaged and disseminated, I&#039;m reminded of some important previous work in this area.</p>
<p>The first is the <a href="http://www.acm.org/public-policy/open-government">ACM U.S. Public Policy Committee (USACM) Recommendations on Open Government</a>.  I have a tremendous respect for the ACM as  &#034;the world’s largest educational and scientific computing society&#034;.  The <a href="http://usacm.acm.org/usacm/about.html">ACM U.S. Public Policy Committee (USACM)</a> &#034;serves as the focal point for ACM&#039;s interaction with U.S. government organizations, the computing community, and the U.S. public in all matters of U.S. public policy related to information technology.&#034;   The policy statement on &#034;open government&#034;  first sets the context for its recommendations:</p>
<blockquote><p>Individual citizens, companies and organizations have begun to use computers to analyze government data, often creating and sharing tools that allow others to perform their own analyses. This process can be enhanced by government policies that promote data reusability, which often can be achieved through modest technical measures. But today, various parts of governments at all levels have differing and sometimes detrimental policies toward promoting a vibrant landscape of third-party web sites and tools that can enhance the usefulness of government data.</p></blockquote>
<p>The recommendations  &#034;for data that is already considered public information&#034; are:</p>
<div>
<ul>
<li> Data published by the government should be in formats and approaches that promote analysis and reuse of that data.</li>
</ul>
<ul>
<li> Data republished by the government that has been received or stored in a machine-readable format (such as online regulatory filings) should preserve the machine-readability of that data.</li>
</ul>
<ul>
<li> Information should be posted so as to also be accessible to citizens with limitations and disabilities.</li>
</ul>
<ul>
<li> Citizens should be able to download complete datasets of regulatory, legislative or other information, or appropriately chosen subsets of that information, when it is published by government.</li>
</ul>
<ul>
<li> Citizens should be able to directly access government-published datasets using standard methods such as queries via an API (Application Programming Interface).</li>
</ul>
<ul>
<li> Government bodies publishing data online should always seek to publish using data formats that do not include executable content.</li>
</ul>
<ul>
<li> Published content should be digitally signed or include attestation of publication/creation date, authenticity, and integrity.</li>
</ul>
</div>
<p>The second is a set of <a href="http://wiki.opengovdata.org/index.php/OpenDataPrinciples">Open Government Data Principles</a> formulated in October 2007  by the <a href="http://public.resource.org/open_government_meeting.html">Open Government Working Group</a>,  &#034;30 open government advocates gathered to develop a set of principles of open government data&#034;:</p>
<blockquote><p>Government data shall be considered open if they are made public in a way that complies with the principles below:</p>
<dl>
<dt> 1. <a title="Talk:OpenDataPrinciples/complete" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/complete">Complete</a> </dt>
<dd> <em>All public data are made available.  <a title="Talk:OpenDataPrinciples/public" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/public">Public</a> <a title="OpenDataPrinciples/data" href="http://wiki.opengovdata.org/index.php/OpenDataPrinciples/data">data</a> are data that are not subject to valid privacy, security or privilege limitations.</em> </dd>
</dl>
<dl>
<dt> 2. <a title="Talk:OpenDataPrinciples/primary" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/primary">Primary</a> </dt>
<dd> <em>Data are collected at the source, with the finest possible level of granularity, not in aggregate or modified forms.</em> </dd>
</dl>
<dl>
<dt> 3. <a title="Talk:OpenDataPrinciples/timely" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/timely">Timely</a> </dt>
<dd> <em>Data are made available as quickly as necessary to preserve the value of the data.</em> </dd>
</dl>
<dl>
<dt> 4. <a title="Talk:OpenDataPrinciples/accessible" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/accessible">Accessible</a> </dt>
<dd> <em>Data are available to the widest range of users for the widest range of purposes.</em> </dd>
</dl>
<dl>
<dt> 5. <a title="Talk:OpenDataPrinciples/machine processable" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/machine_processable">Machine processable</a> </dt>
<dd> <em>Data are reasonably structured to allow automated processing.</em> </dd>
</dl>
<dl>
<dt> 6. <a title="Talk:OpenDataPrinciples/non-discriminatory" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/non-discriminatory">Non-discriminatory</a> </dt>
<dd> <em>Data are available to anyone, with no requirement of registration.</em> </dd>
</dl>
<dl>
<dt> 7. <a title="Talk:OpenDataPrinciples/non-proprietary" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/non-proprietary">Non-proprietary</a> </dt>
<dd> <em>Data are available in a format over which no entity has exclusive control.</em> </dd>
</dl>
<dl>
<dt> 8. <a title="Talk:OpenDataPrinciples/unlicensed" href="http://wiki.opengovdata.org/index.php/Talk:OpenDataPrinciples/unlicensed">License-free</a> </dt>
<dd> <em>Data are not subject to any copyright, patent, trademark or trade secret regulation. Reasonable privacy, security and privilege restrictions may be allowed.</em> </dd>
</dl>
<p>Compliance must be <a title="OpenDataPrinciples/reviewable" href="http://wiki.opengovdata.org/index.php/OpenDataPrinciples/reviewable">reviewable</a>.</p></blockquote>
<p>The final is the paper  “<a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1138083">Government Data and the Invisible Hand</a>.” (<span style="font-style: italic;">Yale Journal of Law &amp; Technology</span> 11: 160.) by David Robinson, Harlan Yu, and Edward Felten.  The abstract contains the following recommendation:</p>
<blockquote><p>Today, government bodies consider their own websites to be a higher priority than technical infrastructures that open up their data for others to use&#8230;.It would be preferable for government to understand providing reusable data, rather than providing websites, as the core of its online publishing responsibility.</p></blockquote>
<p>In  ProgrammableWeb last year, I <a href="http://blog.programmableweb.com/2008/06/18/should-the-feds-focus-on-reusable-data-or-user-interfaces/">distilled the paper&#039;s argument</a> as follows:</p>
<blockquote><p>The conclusion is based on a claim that the executive branch is comparatively ineffective at creating tools for presenting data and should therefore leave that work to a private sector (either nonprofit or commercial entities) that is best able to respond to a wide variety of possible uses for government data. That doesn’t mean that the government should provide no user interface to the data, but rather “should focus on creating a simple, reliable and publicly accessible infrastructure that exposes the underlying data.” Fancier interfaces and tools should be built by others.</p>
<p>Moreover, the authors have recommended a specific mechanism for ensuring that the government does not privilege any user interface over their public data infrastructure: “require that federal websites themselves use the same open systems for accessing the underlying data as they make available to the public at large.”</p></blockquote>
<p>Let me now make sure that these recommendations are at least referenced somewhere at the &#034;National Dialogue&#034; around the Recovery.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/30/previous-recommendations-would-say-open-the-data-to-recoverygov/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Amazon Web services in education program</title>
		<link>http://blog.dataunbound.com/2009/04/30/amazon-web-services-in-education-program/</link>
		<comments>http://blog.dataunbound.com/2009/04/30/amazon-web-services-in-education-program/#comments</comments>
		<pubDate>Thu, 30 Apr 2009 14:52:25 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[Amazon]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=617</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Amazon+Web+services+in+education+program&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=Amazon&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-30&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/30/amazon-web-services-in-education-program/&amp;rft.language=English"></span>
Next time I teach my Mixing and Remixing Information course, I&#039;ll probably apply for a grant from the AWS in Education program:
AWS in Education provides a set of programs that enable the worldwide academic community to easily leverage the benefits of Amazon Web Services for teaching and research. With AWS in Education, educators, academic researchers, [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Amazon+Web+services+in+education+program&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=Amazon&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-30&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/30/amazon-web-services-in-education-program/&amp;rft.language=English"></span>
<p>Next time I teach my Mixing and Remixing Information course, I&#039;ll probably apply for a grant from the <a href="http://aws.amazon.com/education/">AWS in Education</a> program:</p>
<blockquote><p>AWS in Education provides a set of programs that enable the worldwide academic community to easily leverage the benefits of Amazon Web Services for teaching and research. With AWS in Education, educators, academic researchers, and students can apply to obtain free usage credits to tap into the on-demand infrastructure of Amazon Web Services to teach advanced courses, tackle research endeavors and explore new projects – tasks that previously would have required expensive up-front and ongoing investments in infrastructure.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/30/amazon-web-services-in-education-program/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I&#039;ll be teaching a seminar on mashups at the Educause 2009 Annual Conference</title>
		<link>http://blog.dataunbound.com/2009/04/28/ill-be-teaching-a-seminar-on-mashups-at-the-educause-2009-annual-conference/</link>
		<comments>http://blog.dataunbound.com/2009/04/28/ill-be-teaching-a-seminar-on-mashups-at-the-educause-2009-annual-conference/#comments</comments>
		<pubDate>Tue, 28 Apr 2009 21:19:52 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[training]]></category>
		<category><![CDATA[tutorial]]></category>
		<category><![CDATA[API]]></category>
		<category><![CDATA[Educause]]></category>
		<category><![CDATA[higher education]]></category>
		<category><![CDATA[mashup]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=545</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=I%26%23039%3Bll+be+teaching+a+seminar+on+mashups+at+the+Educause+2009+Annual+Conference&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=training&amp;rft.subject=tutorial&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/28/ill-be-teaching-a-seminar-on-mashups-at-the-educause-2009-annual-conference/&amp;rft.language=English"></span>
I&#039;m excited to be teaching a pre-conference seminar at the Educause 2009 Annual Conference.  My proposal for running a half-day seminar Creating and Enabling Web Mashups was accepted.  The seminar will take place at Tuesday, November 3, 2009 at 8:30AM.  I&#039;m looking forward to spending some time in Denver.
Here&#039;s a short abstract for the [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=I%26%23039%3Bll+be+teaching+a+seminar+on+mashups+at+the+Educause+2009+Annual+Conference&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=training&amp;rft.subject=tutorial&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/28/ill-be-teaching-a-seminar-on-mashups-at-the-educause-2009-annual-conference/&amp;rft.language=English"></span>
<p>I&#039;m excited to be teaching a <a href="http://net.educause.edu/Program/Program/1020919?HEADING=Preconference%20Seminars&amp;INCLUDE=SEM">pre-conference seminar</a> at the <a href="http://net.educause.edu/e09/">Educause 2009 Annual Conference</a>.  My proposal for running a half-day seminar <a href="http://net.educause.edu/E09/Program/1020920?PRODUCT_CODE=E09/SEM05A">Creating and Enabling Web Mashups</a> was accepted.  The seminar will take place at Tuesday, November 3, 2009 at 8:30AM.  I&#039;m looking forward to spending some time in Denver.</p>
<p>Here&#039;s a short abstract for the session:</p>
<blockquote><p>There are thousands of web mashups that recombine everything from Google Maps and Flickr with useful data drawn from multiple website.  Mashups are educational, fun, and even transformative.  In this tutorial, you will begin to build mashups that address problems of interest to you.   You will learn how to combine APIs and data into mashups.   You will also learn how to let others recombine content from your website.</p></blockquote>
<p>Here&#039;s a longer abstract:</p>
<blockquote><p>The Web contains thousands of mashups that recombine everything from Google Maps, Flickr, Amazon.com, the New York Times  with useful information about travel, finance, real estate, and more. By fusing elements from multiple web sites, mashups are often informative, fun, and even transformative &#8212; representing the way the Web as a whole is heading.</p>
<p>In this hands-on tutorial, you will learn how to build basic mashups and how to develop mashups to address problems of interest to you.   You will learn how to exploit such web elements as URLs, tags, and RSS feeds in your mashups; and how to combine APIs and data into mashups.   You will also learn how to enable users to recombine content from your website.  Although the most sophisticated mashups demand a wide range of technical knowledge, anyone with a solid knowledge of HTML will be able to learn practical skills from this tutorial.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/28/ill-be-teaching-a-seminar-on-mashups-at-the-educause-2009-annual-conference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Congressional Oversight Panel, TARP, and Elizabeth Warren</title>
		<link>http://blog.dataunbound.com/2009/04/28/congressional-oversight-panel-tarp-and-elizabeth-warren/</link>
		<comments>http://blog.dataunbound.com/2009/04/28/congressional-oversight-panel-tarp-and-elizabeth-warren/#comments</comments>
		<pubDate>Tue, 28 Apr 2009 21:13:59 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[government]]></category>
		<category><![CDATA[Congressional Oversight]]></category>
		<category><![CDATA[Elizabeth Warren]]></category>
		<category><![CDATA[Jon Stewart]]></category>
		<category><![CDATA[TARP]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=555</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Congressional+Oversight+Panel%2C+TARP%2C+and+Elizabeth+Warren&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=government&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/28/congressional-oversight-panel-tarp-and-elizabeth-warren/&amp;rft.language=English"></span>
I wish I had time to follow the TARP carefully &#8212; following the Stimulus already keeps busy enough.   However, I learned a lot from Jon Stewart&#039;s April 15 interview with Elizabeth Warren the head of the Congressional Oversight Panel:   Part 1 and Part 2.
]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Congressional+Oversight+Panel%2C+TARP%2C+and+Elizabeth+Warren&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=government&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/28/congressional-oversight-panel-tarp-and-elizabeth-warren/&amp;rft.language=English"></span>
<p>I wish I had time to follow the TARP carefully &#8212; <a href="http://blog.dataunbound.com/category/recoverygov-tracking/">following the Stimulus</a> already keeps busy enough.   However, I learned a lot from Jon Stewart&#039;s April 15 interview with <a href="http://en.wikipedia.org/wiki/Elizabeth_Warren">Elizabeth Warren</a> the head of the <a href="http://cop.senate.gov/">Congressional Oversight Panel</a>:   <a href="http://www.thedailyshow.com/video/index.jhtml?videoId=224261&amp;title=Elizabeth-Warren-Pt.-1">Part 1</a> and <a href="http://www.thedailyshow.com/video/index.jhtml?videoId=224262&amp;title=Elizabeth-Warren-Pt.-2">Part 2</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/28/congressional-oversight-panel-tarp-and-elizabeth-warren/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Participating in the national online dialogue around recovery.gov</title>
		<link>http://blog.dataunbound.com/2009/04/28/participating-in-the-national-online-dialogue-around-recoverygov/</link>
		<comments>http://blog.dataunbound.com/2009/04/28/participating-in-the-national-online-dialogue-around-recoverygov/#comments</comments>
		<pubDate>Tue, 28 Apr 2009 21:00:36 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[government]]></category>
		<category><![CDATA[recovery.gov tracking]]></category>
		<category><![CDATA[recovery]]></category>
		<category><![CDATA[stimulus]]></category>
		<category><![CDATA[thenationaldialogue]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=595</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Participating+in+the+national+online+dialogue+around+recovery.gov&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=government&amp;rft.subject=recovery.gov+tracking&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/28/participating-in-the-national-online-dialogue-around-recoverygov/&amp;rft.language=English"></span>
Yesterday, I wrote a story on ProgrammableWeb (An Online Dialogue to Shape Recovery.gov) to educate readers on recovery.gov (the government website aimed to let American track the spending of money arising from the  American Recovery and Reinvestment Act of 2009 &#8212; the &#034;Stimulus Package&#034;)   and to draw attention to a “national dialogue” this week (until [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Participating+in+the+national+online+dialogue+around+recovery.gov&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=government&amp;rft.subject=recovery.gov+tracking&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/28/participating-in-the-national-online-dialogue-around-recoverygov/&amp;rft.language=English"></span>
<p>Yesterday, I wrote a story on <a href="http://programmableweb.com">ProgrammableWeb</a> (<a href="http://blog.programmableweb.com/2009/04/27/an-online-dialogue-to-shape-recoverygov/">An Online Dialogue to Shape Recovery.gov</a>) to educate readers on <a href="http://recovery.gov">recovery.gov</a> (the government website aimed to let American track the spending of money arising from the  <a href="http://en.wikipedia.org/wiki/American_Recovery_and_Reinvestment_Act_of_2009">American Recovery and Reinvestment Act of 2009</a> &#8212; the &#034;Stimulus Package&#034;)   and to draw attention to <a href="http://www.thenationaldialogue.org/">a “national dialogue”</a> this week (until May 3) to solicit ideas aimed at answering the key question:</p>
<blockquote><p>What ideas, tools, and approaches can make Recovery.gov a place where all citizens can transparently monitor the expenditure and use of recovery funds?</p></blockquote>
<p>I&#039;ve been reading some of the <a href="http://www.thenationaldialogue.org/ideas/">ideas presented so far</a> and voted on a couple.  I added comments to two so far.   In response to the proposal  <a href="http://www.thenationaldialogue.org/ideas/xml-web-services">XML Web Services</a> (&#034;Make recovery data available as a web service via SOAP XML.&#034;), I wrote:</p>
<blockquote><p>I agree that some type of rigorous programmatic interface that allows developers to access the data from recovery.gov is essential.  I think that SOAP and associated the rest of WS-* stack might be one way to implement such access mechanisms, but I would not want SOAP to the exclusive protocol used.   I would argue, for instance, that a RESTful approach is also an excellent alternative to consider for recovery.gov.</p></blockquote>
<p>On a front closer to what our work has been about, in response to <a href="http://www.thenationaldialogue.org/ideas/making-stimulus-spending-data-accessible-to-the-public">Making stimulus spending data accessible to the public</a>, I wrote</p>
<blockquote><p>I&#039;m one of the Berkeley researchers mentioned above involved with making recommendations on how data feeds should be use to make the recovery more transparent (see <a href="http://www.ischool.berkeley.edu/newsandevents/news/20090417recoveryguidelines">http://www.ischool.berkeley.edu/newsandevents/news/20090417recoveryguidelines</a> and <a href="http://isd.ischool.berkeley.edu/stimulus/2009-029/">http://isd.ischool.berkeley.edu/stimulus/2009-029/</a>)</p>
<p>Although some (but not all) agencies receiving and dispersing recovery funds are using feeds in their reporting (see a list that we compiled at http://isd.ischool.berkeley.edu/stimulus/feeds/feeds.html), the best data on dollars appropriated, obligated, or spent is in the Excel spreadsheets.  Although there are apparently templates for the reports, they keep changing format and there&#039;s nothing to stop agencies from inserting extra fields or omitting other fields.   We know this for a fact since we&#039;ve written programs to scrape the data from the spreadsheets and find it a challenge to keep up with changes that keep breaking our scripts.</p>
<p>The federal government should made the data in the form of  XML feeds in the first place (backed by a schema so that we can check that the data is valid),  instead of making people who want to use that data scrape it out of Excel in a highly fragile process.</p></blockquote>
<p>As I wrote yesterday, it will be interesting to see how well the recovery.gov site actually does at aggregating a large number of proposals and surfacing the best ones.  Moreover,</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/28/participating-in-the-national-online-dialogue-around-recoverygov/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tracking the stimulus/recovery in the news</title>
		<link>http://blog.dataunbound.com/2009/04/28/tracking-the-stimulusrecovery-in-the-news/</link>
		<comments>http://blog.dataunbound.com/2009/04/28/tracking-the-stimulusrecovery-in-the-news/#comments</comments>
		<pubDate>Tue, 28 Apr 2009 19:47:27 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[recovery.gov tracking]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=577</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Tracking+the+stimulus%2Frecovery+in+the+news&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=recovery.gov+tracking&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/28/tracking-the-stimulusrecovery-in-the-news/&amp;rft.language=English"></span>
Over the last couple of months, I&#039;ve been studying the Stimulus through the lens of the weekly reports published on recovery.gov.   My colleagues Erik Wilde and Eric Kansa (at the School of Information at UC Berkeley) and I  made recommendations on how data feeds should be used to foster transparency around stimulus data,  in addition [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Tracking+the+stimulus%2Frecovery+in+the+news&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=recovery.gov+tracking&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-28&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/28/tracking-the-stimulusrecovery-in-the-news/&amp;rft.language=English"></span>
<p>Over the last couple of months, I&#039;ve been studying the <a href="http://www.recovery.gov/">Stimulus</a> through the lens of the <a href="http://www.recovery.gov/?q=content/agency-weekly-reports">weekly reports published on recovery.gov</a>.   My colleagues <a href="http://dret.net/netdret/">Erik Wilde</a> and <a href="http://isd.ischool.berkeley.edu/person/ekansa">Eric Kansa</a> (at the <a href="http://ischoo.berkeley.edu/">School of Information at UC Berkeley</a>) and I  <a href="http://www.ischool.berkeley.edu/newsandevents/news/20090417recoveryguidelines">made recommendations on how data feeds should be used</a> to foster transparency around stimulus data,  in addition to <a href="http://isd.ischool.berkeley.edu/stimulus/2009-029/">developing prototypes</a> of the types of visualizations one could do with such data feeds.   We&#039;re continuing work on that front, specifically scraping data currently found in Excel and transforming that data into XML (Atom) feeds.</p>
<p>It is much easier to transform the financial data  into visualizations and analyses, once it is in the form of feeds (rather than Excel).   The federal government should made the data in the form of  XML in the first place  (backed by a schema so that we can check that the data is valid),  instead of making people who want to use that data scrape the data out of Excel in a highly fragile process.</p>
<p>To discern the meaning of the data we are extracting from various government sites,  I am now trying to keep up with the news around the recovery.  Here are some of the sources I&#039;ve been tracking so far:</p>
<ul>
<li>My first stop is usually <a href="http://www.propublica.org/ion/stimulus">Eye on the Stimulus</a> section of <a href="http://www.propublica.org/">ProPublica</a>, which according to its <a href="http://www.propublica.org/about">About Us page</a> is &#034;an independent, non-profit newsroom that produces investigative journalism in the public interest.&#034;  There&#039;s so much great material on the site that deserves an in-depth description.  For instance, I&#039;ve relied heavily on <a href="http://www.propublica.org/special/the-stimulus-plan-a-detailed-list-of-spending">The Stimulus Plan: A Detailed List of Spending</a> to get an overall pictures of where money is supposed to be going.</li>
<li>I only realized this morning that  ProPublica&#039;s stimulus coverage is part of <a href="http://www.shovelwatch.org/">ShovelWatch</a>, a partnership for &#034;Tracking the Stimulus From Bills to Buildings&#034; consisting of  <a href="http://www.propublica.org/">ProPublica</a> and two sister organizations  <a href="http://www.thetakeaway.org/about/">The Takeaway</a> (a &#034;new national morning news program&#034;) and <a href="http://www.wnyc.org/">WNYC</a>.</li>
<li>The <em>New York Times</em> has extensive coverage  at its topic page <a href="http://topics.nytimes.com/topics/reference/timestopics/subjects/u/united_states_economy/economic_stimulus/index.html">Economic Stimulus</a>.</li>
<li>I will start tracking stories matching the search term  <a href="http://news.google.com/news?pz=1&amp;ned=us&amp;hl=en&amp;q=stimulus+OR+recovery&amp;cf=all&amp;as_qdr=w&amp;as_drrb=q">&#034;recovery or stimulus&#034; on Google News</a>.</li>
<li>I like to listen to <a href="http://www.npr.org/search.php?text=stimulus+or+recovery">NPR&#039;s coverage of the stimulus or recovery</a>.</li>
<li>Of course, there the <a href="http://www.recovery.gov/?q=content/news">news section of recovery.gov</a> itself &#8212; which is naturally missing any critical commentary.</li>
</ul>
<p>This list represents my current starting points.  I naturally expect to find a lot of other useful sources as I go along.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/28/tracking-the-stimulusrecovery-in-the-news/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Typographical or semantic irregularities at recovery.gov?</title>
		<link>http://blog.dataunbound.com/2009/04/10/irregularities-at-recoverygov/</link>
		<comments>http://blog.dataunbound.com/2009/04/10/irregularities-at-recoverygov/#comments</comments>
		<pubDate>Sat, 11 Apr 2009 01:07:41 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/2009/04/10/irregularities-at-recoverygov/</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Typographical+or+semantic+irregularities+at+recovery.gov%3F&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=Uncategorized&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-10&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/10/irregularities-at-recoverygov/&amp;rft.language=English"></span>


Irregularities at recovery.gov
Originally uploaded by Raymond Yee

Why are there two reports with the same date?  This screenshot is from the reports from the Department of Labor on recovery.gov.
]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=Typographical+or+semantic+irregularities+at+recovery.gov%3F&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=Uncategorized&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-10&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/10/irregularities-at-recoverygov/&amp;rft.language=English"></span>
<div style="float: right; margin-left: 10px; margin-bottom: 10px;"><a title="photo sharing" href="http://www.flickr.com/photos/raymondyee/3429733115/"><img style="border: solid 2px #000000;" src="http://farm4.static.flickr.com/3323/3429733115_7c6a31812a_m.jpg" alt="" /></a></p>
<p><span style="font-size: 0.9em; margin-top: 0px;"><br />
<a href="http://www.flickr.com/photos/raymondyee/3429733115/">Irregularities at recovery.gov</a></p>
<p>Originally uploaded by <a href="http://www.flickr.com/people/raymondyee/">Raymond Yee</a><br />
</span></div>
<p>Why are there two reports with the same date?  This screenshot is from the <a href="http://www.recovery.gov/?q=content/weekly-report&amp;agency_code=16&amp;startdate=2009-04-03&amp;noofreports=1&amp;status=1&amp;report_id=140">reports from the Department of Labor</a> on recovery.gov.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/10/irregularities-at-recoverygov/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>pageid/curid as a unique id for Wikipedia pages</title>
		<link>http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/</link>
		<comments>http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/#comments</comments>
		<pubDate>Fri, 10 Apr 2009 18:00:55 +0000</pubDate>
		<dc:creator>Raymond Yee</dc:creator>
				<category><![CDATA[Wikipedia]]></category>
		<category><![CDATA[freebase]]></category>

		<guid isPermaLink="false">http://blog.dataunbound.com/?p=469</guid>
		<description><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=pageid%2Fcurid+as+a+unique+id+for+Wikipedia+pages&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=Wikipedia&amp;rft.subject=freebase&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-10&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/&amp;rft.language=English"></span>
In my learning how to program Freebase, I&#039;ve come across links to the Wikipedia that make use of a curid parameter.  For example,
http://en.wikipedia.org/wiki/index.html?curid=296716
is the same as
http://en.wikipedia.org/wiki/Daniel_Akaka
At least, the two pages seem to be the same thing as far as I can see.
How to do a lookup btween curid and the page title?  One way is [...]]]></description>
			<content:encoded><![CDATA[	
	<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Adc&amp;rfr_id=info%3Asid%2Focoins.info%3Agenerator&amp;rft.title=pageid%2Fcurid+as+a+unique+id+for+Wikipedia+pages&amp;rft.aulast=&amp;rft.aufirst=&amp;rft.subject=Wikipedia&amp;rft.subject=freebase&amp;rft.source=Data+Unbound&amp;rft.date=2009-04-10&amp;rft.type=blogPost&amp;rft.format=text&amp;rft.identifier=http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/&amp;rft.language=English"></span>
<p>In my learning how to program Freebase, I&#039;ve come across links to the Wikipedia that make use of a <em>curid </em>parameter.  For example,</p>
<p><a href="http://en.wikipedia.org/wiki/index.html?curid=296716">http://en.wikipedia.org/wiki/index.html?curid=296716</a></p>
<p>is the same as</p>
<p><a href="http://en.wikipedia.org/wiki/Daniel_Akaka">http://en.wikipedia.org/wiki/Daniel_Akaka</a></p>
<p>At least, the two pages seem to be the same thing as far as I can see.</p>
<p>How to do a lookup btween curid and the page title?  One way is ff we&#039;re screen-scraping, the page source of <a href="http://en.wikipedia.org/wiki/Daniel_Akaka">http://en.wikipedia.org/wiki/Daniel_Akaka</a> contains</p>
<pre id="line28">var wgArticleId = "296716";</pre>
<p>And if you go to <a href="http://en.wikipedia.org/wiki/index.html?curid=296716">http://en.wikipedia.org/wiki/index.html?curid=296716</a> lots of indication of what the title is, including the permanent link (e.g., <a href="http://en.wikipedia.org/w/index.php?title=Daniel_Akaka&amp;oldid=278490360">http://en.wikipedia.org/w/index.php?title=Daniel_Akaka&amp;oldid=278490360</a>)</p>
<p>To dig deeper, I might want to understand the <a href="http://www.mediawiki.org/wiki/Page_table">mediawiki data structure</a> and the <a href="http://www.mediawiki.org/wiki/API">mediawiki API</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.dataunbound.com/2009/04/10/pageidcurid-as-a-unique-id-for-wikipedia-pages/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
