<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Island 94 &#187; data</title>
	<atom:link href="http://www.island94.org/tag/data/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.island94.org</link>
	<description>Ben Sheldon&#039;s lost &#38; found</description>
	<lastBuildDate>Wed, 18 Jan 2012 17:04:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>The work itself isn’t inhumane</title>
		<link>http://www.island94.org/2011/10/the-work-itself-isnt-inhumane/</link>
		<comments>http://www.island94.org/2011/10/the-work-itself-isnt-inhumane/#comments</comments>
		<pubDate>Mon, 31 Oct 2011 22:54:02 +0000</pubDate>
		<dc:creator>Ben</dc:creator>
				<category><![CDATA[boring]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[employment]]></category>

		<guid isPermaLink="false">http://www.island94.org/?p=2944</guid>
		<description><![CDATA[Two weeks ago, while riding BART from East Bay to San Francisco, I was offered a transit survey from a BART Survey Team member. Pointing to what seemed to me to be a large number of written-response questions for a survey being delivered on a moving train, I asked her, "Do you have to read [...]<p><a href="http://www.island94.org/2011/10/the-work-itself-isnt-inhumane/">&#9734; Permalink</a></p>


<strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/06/data-divides-and-umbrellafication/' rel='bookmark' title='Data divides and umbrellafication'>Data divides and umbrellafication</a> <small>Jesse Lichtenstein in “Transparency for All”, writing for Wired: The concern that open data may simply empower the empowered is...</small></li>
<li><a href='http://www.island94.org/2011/09/crime-and-data-leadership/' rel='bookmark' title='Crime and Data Leadership'>Crime and Data Leadership</a> <small>This afternoon I have been following the mis-framing of Chicago’s newly announced plan to release 10 years of… not crime...</small></li>
<li><a href='http://www.island94.org/2011/06/data-driven-content-first-design/' rel='bookmark' title='Data-driven, content-first design'>Data-driven, content-first design</a> <small>I’m working on an app for the DonorChoose.org Hacking Education Contest. DonorsChoose works by having teachers submit classroom project/supply needs that...</small></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.island94.org/wp-content/uploads/2011/10/IMG_1116.jpg"><img class="aligncenter size-medium wp-image-2946" title="IMG_1116" src="http://www.island94.org/wp-content/uploads/2011/10/IMG_1116-600x450.jpg" alt="" width="600" height="450" /></a></p>
<p>Two weeks ago, while riding BART from East Bay to San Francisco, I was offered a transit survey from a BART Survey Team member. Pointing to what seemed to me to be a large number of written-response questions for a survey being delivered on a moving train, I asked her, "Do you have to read and code all these surveys?"</p>
<p>The answer was on my mind as I was in the Bay Area to attend a Summit at Code for America, the organization where I will be serving as a Fellow for 2012. During this trip, I observed (though this is by no means a fresh observation) that geeks are drawn to architecting databases, "open" no less, without recognizing or paying much attention to the experience of the people who will be entering and maintaining the data.</p>
<p>As someone who has spent a large amount of life sorting (and resorting) spreadsheets in Excel---and creator of an <a href="http://mappingaccess.org/">open database of my own</a>---the experience of maintaining data isn't particularly liberating (nor is compiling federal grant reports, for that matter). The experience reminds me of what Joel Johnson wrote for <em>Wired Magazine</em> in <a href="http://www.wired.com/magazine/2011/02/ff_joelinchina/all/1">"1 Million Workers. 90 Million iPhones. 17 Suicides. Who’s to Blame?"</a>:</p>
<blockquote><p>"...the work itself isn’t inhumane---unless you consider a repetitive, exhausting, and alienating workplace over which you have no influence or authority to be inhumane. And that would pretty much describe every single manufacturing or burger-flipping job ever."</p></blockquote>
<p>Which is to say that maintaining a database is <em>work</em> and doubly so when the "data driven decisions" are not the maintainers' to make. I think of this whenever the licensing costs of "closed" commercial databases are cited as driving the need for "open"; I agree that data in the public interest should be "free",  but I recognize that freedom comes with a cost.</p>
<p>At the Summit, Tim O'Reilly said, "Data is infrastructure." If that's the case, I hope us data "architects" recognize the experience of people---the data pavers, data plumbers, and data janitors---who maintain such "virtual" infrastructure if we expect to gain greater liberties than our current physical landscape provides.</p>
<p>The BART Survey Team member's answer on that shaky train was "Yes".</p>


<p><strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/06/data-divides-and-umbrellafication/' rel='bookmark' title='Data divides and umbrellafication'>Data divides and umbrellafication</a> <small>Jesse Lichtenstein in “Transparency for All”, writing for Wired: The concern that open data may simply empower the empowered is...</small></li>
<li><a href='http://www.island94.org/2011/09/crime-and-data-leadership/' rel='bookmark' title='Crime and Data Leadership'>Crime and Data Leadership</a> <small>This afternoon I have been following the mis-framing of Chicago’s newly announced plan to release 10 years of… not crime...</small></li>
<li><a href='http://www.island94.org/2011/06/data-driven-content-first-design/' rel='bookmark' title='Data-driven, content-first design'>Data-driven, content-first design</a> <small>I’m working on an app for the DonorChoose.org Hacking Education Contest. DonorsChoose works by having teachers submit classroom project/supply needs that...</small></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.island94.org/2011/10/the-work-itself-isnt-inhumane/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Crime and Data Leadership</title>
		<link>http://www.island94.org/2011/09/crime-and-data-leadership/</link>
		<comments>http://www.island94.org/2011/09/crime-and-data-leadership/#comments</comments>
		<pubDate>Wed, 14 Sep 2011 21:36:30 +0000</pubDate>
		<dc:creator>Ben</dc:creator>
				<category><![CDATA[criticism]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[information]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[management]]></category>

		<guid isPermaLink="false">http://www.island94.org/?p=2829</guid>
		<description><![CDATA[This afternoon I have been following the mis-framing of Chicago's newly announced plan to release 10 years of... not crime statistics as has been reported, but... police incident reports. From the Boston Globe: Chicago to publish crime stats online CHICAGO—Long a city with a reputation for withholding information, Chicago now wants to make public every [...]<p><a href="http://www.island94.org/2011/09/crime-and-data-leadership/">&#9734; Permalink</a></p>


<strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/06/data-divides-and-umbrellafication/' rel='bookmark' title='Data divides and umbrellafication'>Data divides and umbrellafication</a> <small>Jesse Lichtenstein in “Transparency for All”, writing for Wired: The concern that open data may simply empower the empowered is...</small></li>
<li><a href='http://www.island94.org/2011/06/uncrime-mapping/' rel='bookmark' title='Uncrime Mapping'>Uncrime Mapping</a> <small>I really don’t understand the appeal of crime maps. Trulia, a housing search tool, just launched a crime map, too...</small></li>
<li><a href='http://www.island94.org/2011/10/community-mapping-class-interview-outcomes/' rel='bookmark' title='Community mapping class interview &amp; outcomes'>Community mapping class interview &amp; outcomes</a> <small>  Last year I interviewed Richard (Dick) Howe, Lowell’s Registrar of Deeds about the impact of his participation in a...</small></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>This afternoon I have been following the mis-framing of Chicago's newly announced plan to release 10 years of... not crime statistics as has been reported, but... police incident reports. From the <a href="http://www.boston.com/news/nation/articles/2011/09/14/apnewsbreak_chicago_to_publish_crime_stats_online/">Boston Globe</a>:</p>
<blockquote><p><strong>Chicago to publish crime stats online</strong></p>
<p>CHICAGO—Long a city with a reputation for withholding information, Chicago now wants to make public every crime over the past 10 years -- a highly unusual move among the nation's major police departments.</p>
<p>Starting Wednesday, millions of crime statistics dating to 2001 will be posted online in a searchable database. It will be updated daily, providing fodder for residents to evaluate their own neighborhoods, academics to study crime and techie types to create websites or apps.</p></blockquote>
<p>I'll harp on that last sentence in which the "techie type's" activities are described so rudderlessly: this data will fuel <a href="http://www.island94.org/2011/06/uncrime-mapping/">a hundred red and green heatmaps</a>, but probably provide little opportunity for reflection on the true nature of this data---police incident reports---let alone the politics, policies and policing that generated this data in the first place. Just as an example, this is from a WBEZ story entitled "<a href="http://www.wbez.org/story/downside-hiring-more-cops-chicago-90944">The downside of hiring more cops in Chicago</a>":</p>
<blockquote><p>But there are also costs with increasing the number of police on the street and those costs can be tough to measure. "The good intentions of actually creating the uniformed presence to lower the immediate problems of crime may have an unintended result when you're looking further down the line," according to George Gascon. He's the district attorney for the city of San Francisco, and before that he was the chief of police. He says low-income, often minority communities, are flooded with police, and residents are over-criminalized. Lots of people are arrested, sometimes for small infractions.</p>
<p>Kids get criminal records, they're cut off from educational and employment opportunities, and all of that ultimately makes the crime problems worse. "I'm not saying that we should look the other way to crime, to the contrary. What I'm saying is that the strategies that we used in the past have not worked well, and we need to evolve away from that. In many neighborhoods basically we have been at war with our people," Gascon said.</p></blockquote>
<p>An incident does not make an indictment, and an infraction need not be a crime, let alone a conviction; nor should we forget what goes unreported. And yet this dataset is being distributed and---this is the actual problem---represented by the media and (soon) a myriad of websites and apps <em>as full-stop crime</em>.</p>
<p>This is not a criticism of the police or the City of Chicago, who should be commended for being more transparent and making this information more readily available; this is a plea for the media and application makers to appropriately label this data and use it with an understanding of its limitations.</p>
<p>Ben Fry, <a href="http://flowingdata.com/2011/05/13/ben-fry-on-visualization-future-and-data-literacy/">on visualization future and data literacy</a> looks toward the future:</p>
<blockquote><p>"I think the real thing that's going to change is that we're going to start understanding that visualization isn't this sort of monolithic thing... I like to look at it a lot like writing. You have novels and poetry and haikus. You know there's lots of different types of writing and styles of writing — and I think the same thing happens in visualization... some things are tools for analysis and some things are purely for entertainment, and there's not so much a spectrum that there is different ways of addressing it.</p></blockquote>
<p>But this understanding and widespread "data literacy" is not here yet. Its up to us so called techie types to exhibit "Data Leadership" and work to better interpret the and explain the complexity and nuance of our analysis---the absence of which I've <a href="http://www.island94.org/2011/06/data-divides-and-umbrellafication/">complained about before</a>.</p>
<p>Data leadership is appropriately labeling data. Data leadership is presenting data with a recognition of its limitations. Data leadership is consideration for how your presentation of data may be interpreted <em>and responsibility</em> for the consequences.</p>
<p>Data leadership is ultimately a recognition of the broader context of human experience and how information is collected, analyzed and integrated into our lives and decision-making processes---both individually and socially. I realize that's awfully heady for discussing glorified spreadsheets, but to riff off the old chestnut, you can't manage what you misrepresented.</p>
<p><em>(Thanks <a href="http://justinmassa.com">Justin</a>, <a href="http://circuitous.org">Bec</a> and <a href="http://freegeekchicago.org/">David</a> for drawing my attention to this.)</em></p>


<p><strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/06/data-divides-and-umbrellafication/' rel='bookmark' title='Data divides and umbrellafication'>Data divides and umbrellafication</a> <small>Jesse Lichtenstein in “Transparency for All”, writing for Wired: The concern that open data may simply empower the empowered is...</small></li>
<li><a href='http://www.island94.org/2011/06/uncrime-mapping/' rel='bookmark' title='Uncrime Mapping'>Uncrime Mapping</a> <small>I really don’t understand the appeal of crime maps. Trulia, a housing search tool, just launched a crime map, too...</small></li>
<li><a href='http://www.island94.org/2011/10/community-mapping-class-interview-outcomes/' rel='bookmark' title='Community mapping class interview &amp; outcomes'>Community mapping class interview &amp; outcomes</a> <small>  Last year I interviewed Richard (Dick) Howe, Lowell’s Registrar of Deeds about the impact of his participation in a...</small></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.island94.org/2011/09/crime-and-data-leadership/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data divides and umbrellafication</title>
		<link>http://www.island94.org/2011/06/data-divides-and-umbrellafication/</link>
		<comments>http://www.island94.org/2011/06/data-divides-and-umbrellafication/#comments</comments>
		<pubDate>Mon, 20 Jun 2011 02:41:47 +0000</pubDate>
		<dc:creator>Ben</dc:creator>
				<category><![CDATA[civic literacy]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data leadership]]></category>

		<guid isPermaLink="false">http://www.island94.org/?p=2716</guid>
		<description><![CDATA[Jesse Lichtenstein in "Transparency for All", writing for Wired: The concern that open data may simply empower the empowered is not an argument against open data; it's an argument against looking at open data as an end in itself. Massive data dumps and even friendly online government portals are insufficient. Ordinary people need to know [...]<p><a href="http://www.island94.org/2011/06/data-divides-and-umbrellafication/">&#9734; Permalink</a></p>


<strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/09/crime-and-data-leadership/' rel='bookmark' title='Crime and Data Leadership'>Crime and Data Leadership</a> <small>This afternoon I have been following the mis-framing of Chicago’s newly announced plan to release 10 years of… not crime...</small></li>
<li><a href='http://www.island94.org/2011/06/data-driven-content-first-design/' rel='bookmark' title='Data-driven, content-first design'>Data-driven, content-first design</a> <small>I’m working on an app for the DonorChoose.org Hacking Education Contest. DonorsChoose works by having teachers submit classroom project/supply needs that...</small></li>
<li><a href='http://www.island94.org/2011/10/the-work-itself-isnt-inhumane/' rel='bookmark' title='The work itself isn’t inhumane'>The work itself isn’t inhumane</a> <small>Two weeks ago, while riding BART from East Bay to San Francisco, I was offered a transit survey from a...</small></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Jesse Lichtenstein in "Transparency for All", writing for <em>Wired:</em></p>
<blockquote><p>The concern that open data may simply empower the empowered is not an argument against open data; it's an argument against looking at open data as an end in itself. Massive data dumps and even friendly online government portals are insufficient. Ordinary people need to know what information is available, and they need the training to be conversant in it. And if people are to have more than theoretical access to the information, it needs to be easy and cheap to use. That means investing in the kinds of organizations doing outreach, advocacy, and education in the communities least familiar with the benefits of data transparency. If we want truly open government, we still have to do the hard work of addressing basic and stubborn inequalities. However freely it flows, the data alone isn't enough.</p></blockquote>
<p>Yes and yes. Most government data is policy-level, which means to understand and act on that data, you not only need to be data literate, but also civically literate to transform knowledge into power. Call me a constructionist, but community organizing creates a stronger sense of agency than statistics.</p>
<p>And "cheap" data inconsiderately presented can be harmful. The worst data abuses come from trying to use policy-level, non-contextualized or incomplete data to inform individual decision-making. For example, I had to add this caveat to the <a href="http://bostoncyclistsunion.org/resources/crash-map/">Boston Bike Crash Map</a> after getting several anxiety-producing inquiries:</p>
<blockquote><p>This data alone is not appropriate for making routing decisions. The presence or absence of incidents in a location should not be used to determine the relative safety of biking there as this data does not include ridership or traffic information; i.e. a location may contain less incidents because bikers know to avoid it.</p></blockquote>
<p>For decision making, I've come to call this phenomenon the "umbrellafication" of data---after the <a href="http://umbrellatoday.com/">service</a> that boils the weather forecast down to a simple yes/no answer to "Do you need an umbrella today?" Unfortunately, like trying to portray <a href="http://www.island94.org/2011/06/uncrime-mapping/">crime as a spectrum of green to red</a>, issues and datasets that can be easily synthesized and presented are the exception, not the rule.</p>


<p><strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/09/crime-and-data-leadership/' rel='bookmark' title='Crime and Data Leadership'>Crime and Data Leadership</a> <small>This afternoon I have been following the mis-framing of Chicago’s newly announced plan to release 10 years of… not crime...</small></li>
<li><a href='http://www.island94.org/2011/06/data-driven-content-first-design/' rel='bookmark' title='Data-driven, content-first design'>Data-driven, content-first design</a> <small>I’m working on an app for the DonorChoose.org Hacking Education Contest. DonorsChoose works by having teachers submit classroom project/supply needs that...</small></li>
<li><a href='http://www.island94.org/2011/10/the-work-itself-isnt-inhumane/' rel='bookmark' title='The work itself isn’t inhumane'>The work itself isn’t inhumane</a> <small>Two weeks ago, while riding BART from East Bay to San Francisco, I was offered a transit survey from a...</small></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.island94.org/2011/06/data-divides-and-umbrellafication/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data-driven, content-first design</title>
		<link>http://www.island94.org/2011/06/data-driven-content-first-design/</link>
		<comments>http://www.island94.org/2011/06/data-driven-content-first-design/#comments</comments>
		<pubDate>Sat, 04 Jun 2011 05:11:27 +0000</pubDate>
		<dc:creator>Ben</dc:creator>
				<category><![CDATA[contest]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[design]]></category>
		<category><![CDATA[development]]></category>
		<category><![CDATA[layout]]></category>

		<guid isPermaLink="false">http://www.island94.org/?p=2682</guid>
		<description><![CDATA[<p><figure title=""><img src="http://www.island94.org/wp-content/uploads/2011/06/DonorsChoose-TextLengths-600x503.png" class="attachment-h5bp-post-image wp-post-image" alt="DonorsChoose-TextLengths" title="DonorsChoose-TextLengths" /></figure></p>I'm working on an app for the DonorChoose.org Hacking Education Contest. DonorsChoose works by having teachers submit classroom project/supply needs that people can then donate to pay for through the internet. Right now the only way to "share" those projects is through the usual email, Facebook and Twitter; my idea for am app is to create [...]<p><a href="http://www.island94.org/2011/06/data-driven-content-first-design/">&#9734; Permalink</a></p>


<strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/06/print-share-not-everyone-is-a-social-media-ninja-nor-need-they-be/' rel='bookmark' title='Print &amp; Share: not everyone is a social media ninja (nor need they be)'>Print &amp; Share: not everyone is a social media ninja (nor need they be)</a> <small>Today is the deadline for DonorsChoose’s Hacking Education Contest, and fortunately I have completed and submitted Print and Share (with no...</small></li>
<li><a href='http://www.island94.org/2011/07/donorschoose-contest-update-consolation-prize-edition/' rel='bookmark' title='DonorsChoose Contest Update: Consolation Prize Edition'>DonorsChoose Contest Update: Consolation Prize Edition</a> <small>DonorsChoose announced the winners for their Hacking Education contest today and unfortunately Print &amp; Share, the app I developed with...</small></li>
<li><a href='http://www.island94.org/2011/04/belief-based-design/' rel='bookmark' title='Belief-based design'>Belief-based design</a> <small>Matt Webb posted “Inbox Hero” about a month back (via AJ): Rand: The question isn’t who is going to let...</small></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p><figure title=""><img src="http://www.island94.org/wp-content/uploads/2011/06/DonorsChoose-TextLengths-600x503.png" class="attachment-h5bp-post-image wp-post-image" alt="DonorsChoose-TextLengths" title="DonorsChoose-TextLengths" /></figure></p><p><img class="aligncenter size-medium wp-image-2683" title="pdfpulltabs" src="http://www.island94.org/wp-content/uploads/2011/06/pdfpulltabs-600x342.png" alt="" width="600" height="342" /></p>
<p>I'm working on an app for the <a href="http://www.donorschoose.org/hacking-education">DonorChoose.org Hacking Education Contest</a>. DonorsChoose works by having teachers submit classroom project/supply needs that people can then donate to pay for through the internet. Right now the only way to "share" those projects is through the usual email, Facebook and Twitter; my idea for am app is to create custom printable 4-up (quartersheet) flyers for handing out, and pull-tab sheets for posting-up. Despite the "power of teh internets", I think consideration of local resources and capacity is an important value to recognize---and helping teachers and allies advertise their needs (and the positive outcomes they hope to create) more easily within their physical communities would go far towards advancing DonorsChoose.org's mission.</p>
<p>This project has the typical technical difficulties: my goal is to use the DonorsChoose API to fetch a particular project and then populate an HTML+CSS preview---allowing the teacher to then live-edit the text (both for layout and because writing for a local audience is different than an internet one)---and then use <a href="http://code.google.com/p/dompdf/">dompdf</a> to convert that HTML+CSS to a printable PDF. The dompdf library is nifty: it supports CSS 2d transformations, custom fonts, images (necessary for QR code hotness), light positioning (no floats, so tables it is), and media-queries  (@media screen and @media print) to fix all the little layout and typography issues in translating from the web to print, as well as dompdf's own myriad quirks. Basically it's everything I need to make a not-too-ugly flyer and pull-tab sheet; the image at the top of this post is the ugly proof-of-concept.</p>
<p>But the biggest challenge is designing a layout scheme that is flexible to the wide range of DonorsChoose project content. Designing for an 8.5 x 11" sheet of paper is way different than designing for the web: creating a balanced---or at least aesthetically-acceptable---design is no easy matter when there is no such thing as overflow. I hope to get around some issues by providing a live-preview so that teachers can fix any egregious text over-/under- runs, but the goal is to get teachers to Click-Print-Post as quickly as possible.</p>
<p>Fortunately I have a designer (the awesome <a href="http://b.illbrown.com">Billy Brown</a>) helping with the layout, but I need to give him some idea of what to expect. Fortunately DonorsChoose made available project data for ~296,000 projects. So I parsed through the lengths for the 4 main pieces of content I want to use in order to get the distribution of lengths. Sure, saying "Design for a title that is 10-50 characters" doesn't have the highest specificity, but it's a whole lot more useful than the alternative of blind experimentation. The data below is charts of those lengths---I probably will have to limit my data to just the past year or so since DonorsChoose has changed their requirements/text-fields over time, but its interesting so far to see the full distributions.</p>
<p><a href="http://www.island94.org/wp-content/uploads/2011/06/DonorsChoose-TextLengths.png"><img class="aligncenter size-medium wp-image-2684" title="DonorsChoose-TextLengths" src="http://www.island94.org/wp-content/uploads/2011/06/DonorsChoose-TextLengths-600x503.png" alt="" width="600" height="503" /></a></p>


<p><strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/06/print-share-not-everyone-is-a-social-media-ninja-nor-need-they-be/' rel='bookmark' title='Print &amp; Share: not everyone is a social media ninja (nor need they be)'>Print &amp; Share: not everyone is a social media ninja (nor need they be)</a> <small>Today is the deadline for DonorsChoose’s Hacking Education Contest, and fortunately I have completed and submitted Print and Share (with no...</small></li>
<li><a href='http://www.island94.org/2011/07/donorschoose-contest-update-consolation-prize-edition/' rel='bookmark' title='DonorsChoose Contest Update: Consolation Prize Edition'>DonorsChoose Contest Update: Consolation Prize Edition</a> <small>DonorsChoose announced the winners for their Hacking Education contest today and unfortunately Print &amp; Share, the app I developed with...</small></li>
<li><a href='http://www.island94.org/2011/04/belief-based-design/' rel='bookmark' title='Belief-based design'>Belief-based design</a> <small>Matt Webb posted “Inbox Hero” about a month back (via AJ): Rand: The question isn’t who is going to let...</small></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.island94.org/2011/06/data-driven-content-first-design/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Notes from the City of Boston’s Open311 / Citizens Connect API Developer Meeting</title>
		<link>http://www.island94.org/2011/01/notes-from-the-city-of-bostons-open311-developer-meeting/</link>
		<comments>http://www.island94.org/2011/01/notes-from-the-city-of-bostons-open311-developer-meeting/#comments</comments>
		<pubDate>Tue, 01 Feb 2011 00:54:16 +0000</pubDate>
		<dc:creator>Ben</dc:creator>
				<category><![CDATA[Boston]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[govdata]]></category>
		<category><![CDATA[government]]></category>
		<category><![CDATA[meeting]]></category>

		<guid isPermaLink="false">http://www.island94.org/?p=2463</guid>
		<description><![CDATA[Today I attended a developer meeting at Boston City Hall for their Citizen Connect API, a to-be-launched Open311 implementation. The city currently has official iPhone and Android apps that allow community members to submit broken streetlights, potholes, graffiti and snow removal, but the intent of the "open" part is to allow unaffiliated developers to integrate [...]


<strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/08/7-years-in-boston/' rel='bookmark' title='7 years in Boston'>7 years in Boston</a> <small>This August marks the completion of my 7th year in Boston, with the loose exceptions of the 1 month I...</small></li>
<li><a href='http://www.island94.org/2011/05/apps-off-the-approved-vendor-list/' rel='bookmark' title='Apps off the approved vendor list'>Apps off the approved vendor list</a> <small>I ran across a year-old article I had bookmarked from GovTech entitled “Do Apps for Democracy and Other Contests Create...</small></li>
<li><a href='http://www.island94.org/2011/09/crime-and-data-leadership/' rel='bookmark' title='Crime and Data Leadership'>Crime and Data Leadership</a> <small>This afternoon I have been following the mis-framing of Chicago’s newly announced plan to release 10 years of… not crime...</small></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Today I attended a developer meeting at Boston City Hall for their <a href="http://www.cityofboston.gov/doit/apps/citizensconnect.asp">Citizen Connect API</a>, a to-be-launched <a href="http://open311.org/">Open311</a> implementation. The city currently has official <a href="http://itunes.apple.com/us/app/boston-citizens-connect/id330894558">iPhone</a> and <a href="http://www.androidzoom.com/android_applications/tools/boston-citizens-connect_najk.html">Android</a> apps that allow community members to submit broken streetlights, potholes, graffiti and snow removal, but the intent of the "open" part is to allow unaffiliated developers to integrate the system into their own applications. The other developers attending were the 7-person Boston <a href="http://codeforamerica.org/boston/">Code for America</a> crew (who just arrived a few days before), <a href="http://www.seeclickfix.com/">SeeClickFix</a> and a university researcher (the latter 2 via teleconference). </p>
<p>The City of Boston uses a <a href="http://www.lagan.com/">Lagan CRM</a> system to create and track tickets and cases. To feed that ticketing system the city offers <a href="http://www.cityofboston.gov/mayor/24/">constituent services</a> in person, via telephone (the meeting was held next to the call center which had ~10 agents active at the time), via the web, and through the smartphone apps. The Lagan system tracks 150-170 types of tickets, but Boston currently exposes only 6 of them through Open311 API (streetlights, potholes, graffiti, 2 types of snow removal, and other); this decision was explained as being driven by the UI needs of the official smartphone apps. The Open311 system is only a data bridge into the Lagan CRM and thus won't support any additional metadata or external decisioning (this dismissed a Code for America fellow's suggestion of voting on tickets).</p>
<p>The API is not currently available. The city estimates 2 - 3 weeks until they have a test server up, and from there they will evaluate whether to give applications access to the live system. The test server will be a sandbox that is either refreshed every 24 hours, has new data streamed to it, or may simulate workflows (e.g. submit, review, comment, close)---it was still in discussion.</p>
<p>In addition to the Open311 API, the city also offers data dumps of its entire ticketing system, offset by 24 hours. Unfortunately, those data dumps don't include the channel through which they were inputted, e.g. it's not recorded whether the ticket came thru Open311, telephone, web or in person. This is allegedly through the city's <a href="http://hubmaps1.cityofboston.gov/datahub/">Data Hub</a> / <a href="http://www.cityofboston.gov/doit/databoston/app/data.aspx">Data Dashboard</a>, but I can't find it.</p>
<p>The City of Boston is taking a more deliberate and restrictive approach than <a href="http://www.eot.state.ma.us/developers/">MassDOT/MBTA</a> in opening up their data, though CRM tickets are clearly different than bus/train route and dispatch data. The university researcher's (Ben Clark) use case for the data was spot-on: determining who is utilizing these smartphone tools, and importantly who isn't. Time will tell how the influence of outside software developers will push the city's implementation---and how will it effect less-technology focused solutions.</p>
<p>I was really excited to meet the Code for America crew as they are bringing a lot of excitement and energy to the gov data scene. I did get the impression that they were unprepared for managing institutional forces: there was a question about why the city couldn't devote more IT resources to the project that were answered with some allusions to Dilbert (without acknowledging that the current capital budget was probably set 14 months ago). Open311 doesn't seem like CfA's <a href="http://codeforamerica.org/boston/">primary focus</a> in Boston, but if this was their first City of Boston meeting, I think they learned a lot.</p>


<p><strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/08/7-years-in-boston/' rel='bookmark' title='7 years in Boston'>7 years in Boston</a> <small>This August marks the completion of my 7th year in Boston, with the loose exceptions of the 1 month I...</small></li>
<li><a href='http://www.island94.org/2011/05/apps-off-the-approved-vendor-list/' rel='bookmark' title='Apps off the approved vendor list'>Apps off the approved vendor list</a> <small>I ran across a year-old article I had bookmarked from GovTech entitled “Do Apps for Democracy and Other Contests Create...</small></li>
<li><a href='http://www.island94.org/2011/09/crime-and-data-leadership/' rel='bookmark' title='Crime and Data Leadership'>Crime and Data Leadership</a> <small>This afternoon I have been following the mis-framing of Chicago’s newly announced plan to release 10 years of… not crime...</small></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.island94.org/2011/01/notes-from-the-city-of-bostons-open311-developer-meeting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Public data vs. Self-reporting</title>
		<link>http://www.island94.org/2010/12/public-data-vs-self-reporting/</link>
		<comments>http://www.island94.org/2010/12/public-data-vs-self-reporting/#comments</comments>
		<pubDate>Thu, 02 Dec 2010 19:54:44 +0000</pubDate>
		<dc:creator>Ben</dc:creator>
				<category><![CDATA[access]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[digital divide]]></category>
		<category><![CDATA[maps]]></category>
		<category><![CDATA[participation gap]]></category>
		<category><![CDATA[public data]]></category>

		<guid isPermaLink="false">http://www.island94.org/?p=2322</guid>
		<description><![CDATA[A comment by Pete Stidman of the Boston Cyclists Union, in response to a proposal to build a Chicago Bike Crash Map: I would advise against self reporting—because what you get is only crashes within whatever bike subculture is reached by the map and by the web. Boston did that before our map and the [...]


<strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/06/data-divides-and-umbrellafication/' rel='bookmark' title='Data divides and umbrellafication'>Data divides and umbrellafication</a> <small>Jesse Lichtenstein in “Transparency for All”, writing for Wired: The concern that open data may simply empower the empowered is...</small></li>
<li><a href='http://www.island94.org/2011/09/crime-and-data-leadership/' rel='bookmark' title='Crime and Data Leadership'>Crime and Data Leadership</a> <small>This afternoon I have been following the mis-framing of Chicago’s newly announced plan to release 10 years of… not crime...</small></li>
<li><a href='http://www.island94.org/2011/06/data-driven-content-first-design/' rel='bookmark' title='Data-driven, content-first design'>Data-driven, content-first design</a> <small>I’m working on an app for the DonorChoose.org Hacking Education Contest. DonorsChoose works by having teachers submit classroom project/supply needs that...</small></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>A <a href="http://www.stevevance.net/planning/i-want-to-make-a-crash-reporting-tool/#comment-106046851">comment</a> by Pete Stidman of the Boston Cyclists Union, in response to a <a href="http://www.stevevance.net/planning/i-want-to-make-a-crash-reporting-tool/">proposal to build a Chicago Bike Crash Map</a>:</p>
<blockquote><p>I would advise against self reporting—because what you get is only crashes within whatever bike subculture is reached by the map and by the web. Boston did that before our map and the spread of accidents across the city is very different in each—and could give people the wrong ideas about priority streets and areas. (This is particularly true in communities of color, which were heavily under-reported in Boston's self-reported crash survey.)</p></blockquote>
<p>Boston's <a href="http://bostoncyclistsunion.org/resources/crash-map/">bike crash map</a> is generated from (somewhat-) public data on on ambulance runs and EMT reports; Portland has a <a href="http://bikeportland.org/closecall/home.php">self-reported bike incident map</a>.</p>


<p><strong>Related posts:</strong><ol><li><a href='http://www.island94.org/2011/06/data-divides-and-umbrellafication/' rel='bookmark' title='Data divides and umbrellafication'>Data divides and umbrellafication</a> <small>Jesse Lichtenstein in “Transparency for All”, writing for Wired: The concern that open data may simply empower the empowered is...</small></li>
<li><a href='http://www.island94.org/2011/09/crime-and-data-leadership/' rel='bookmark' title='Crime and Data Leadership'>Crime and Data Leadership</a> <small>This afternoon I have been following the mis-framing of Chicago’s newly announced plan to release 10 years of… not crime...</small></li>
<li><a href='http://www.island94.org/2011/06/data-driven-content-first-design/' rel='bookmark' title='Data-driven, content-first design'>Data-driven, content-first design</a> <small>I’m working on an app for the DonorChoose.org Hacking Education Contest. DonorsChoose works by having teachers submit classroom project/supply needs that...</small></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.island94.org/2010/12/public-data-vs-self-reporting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Good enough data</title>
		<link>http://www.island94.org/2010/02/good-enough-data/</link>
		<comments>http://www.island94.org/2010/02/good-enough-data/#comments</comments>
		<pubDate>Tue, 23 Feb 2010 01:10:51 +0000</pubDate>
		<dc:creator>Ben</dc:creator>
				<category><![CDATA[ability]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[parsing]]></category>
		<category><![CDATA[portfolio]]></category>
		<category><![CDATA[yawn]]></category>

		<guid isPermaLink="false">http://www.island94.org/?p=1776</guid>
		<description><![CDATA[<p><figure title=""><img src="http://www.island94.org/wp-content/uploads/2010/02/btop-map-combined-600x554.png" class="attachment-h5bp-post-image wp-post-image" alt="btop-map-combined" title="btop-map-combined" /></figure></p>I’ve been spending some time at work scraping data. Long story short: government transparency is not transparent when the only access they give you is a pile of poorly structured html. That’s better than government opacity but not past the level of frosted glass: titillating but unsatisfying. If your expected audience is pencil pushers, please [...]<p><a href="http://www.island94.org/2010/02/good-enough-data/">&#9734; Permalink</a></p>


No related posts.]]></description>
			<content:encoded><![CDATA[<p><figure title=""><img src="http://www.island94.org/wp-content/uploads/2010/02/btop-map-combined-600x554.png" class="attachment-h5bp-post-image wp-post-image" alt="btop-map-combined" title="btop-map-combined" /></figure></p><p><img class="aligncenter size-medium wp-image-1783" title="btop-map-combined" src="http://www.island94.org/wp-content/uploads/2010/02/btop-map-combined-500x462.png" alt="" width="500" height="462" /></p>
<p>I’ve been spending some time at work scraping data. Long story short: government transparency is not transparent when the only access they give you is a pile of poorly structured html. That’s better than government opacity but not past the level of frosted glass: titillating but unsatisfying. If your expected audience is pencil pushers, please release your data in a spreadsheet. <a href="http://transmissionproject.org/current/2009/11/ntia-broadband-access-data">That’s what I did</a>.</p>
<p>Notes for nerds:</p>
<p><strong>Regular Expressions vs. Parsing Engines: </strong>I wrote a the first parser in Python with Regular Expressions, then rewrote it in BeautifulSoup (a Python parser). It took me about 2 hours to write it the first time with RegExp. It took me about 2 days to do it with BeautifulSoup. It’s slightly easier to maintain now, but you tell me which one is more semantically correct:</p>
<p><code>project_title = re.search('&lt;tr&gt;&lt;td&gt;&lt;b&gt;Project&amp;nbsp;title&lt;/b&gt;&lt;/td&gt;&lt;td&gt;(.+)&lt;/td&gt;&lt;/tr&gt;', line)</code></p>
<p>versus</p>
<p><code>project_title = app.find(text="Project&amp;nbsp;title").parent.parent.nextSibling.string</code></p>
<p>Yep, it’s written in 2-column tables with each row being a different data-set: the first column holds a key (if there is a key; sometimes there isn’t) and the second column being the data . With RegExp, I know exactly what I’m looking for. With the parser, I have to find the element in the tree, then traverse up, over and down (if there isn’t a key, I have to go up, up, over, over, over, down, over, down). The data itself is a big set of applications (about 2000+ total) and each application has about 15 different data-sets (some with keys, some just follow a consistent-ish pattern).</p>
<p>Fortunately, I have an <a href="http://www.media-democracy.net/">appreciative audience</a> for my troubles and it lets me <a href="http://transmissionproject.org/current/2010/2/btop-applications-and-awards-by-state">draw pretty maps</a> like the ones above. Also <a href="http://flowingdata.com/2009/11/12/how-to-make-a-us-county-thematic-map-using-free-tools/">done with Python</a> by parsing an SVG vector image.</p>
<p><strong>Michigan boaters beware</strong>: there is now an isthmus between Mackinaw City and St. Ignace. Rather than rewrite the process for grouped-shapes—Michigan being in 2 parts—it was good enough to make Michigan 1. Hawaii somehow endured.</p>


<p>No related posts.</p>]]></content:encoded>
			<wfw:commentRss>http://www.island94.org/2010/02/good-enough-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced

Served from: www.island94.org @ 2012-02-03 22:01:23 -->
