<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Gothos</title>
	<atom:link href="http://gothos.info/feed/" rel="self" type="application/rss+xml" />
	<link>http://gothos.info</link>
	<description>A Geospatial Librarian's World</description>
	<lastBuildDate>Mon, 26 Jul 2010 13:18:07 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>NYC Subway and Transit GIS Layers</title>
		<link>http://gothos.info/2010/07/nyc-subway-and-transit-gis-layers/</link>
		<comments>http://gothos.info/2010/07/nyc-subway-and-transit-gis-layers/#comments</comments>
		<pubDate>Sat, 24 Jul 2010 21:50:50 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Data Sources]]></category>
		<category><![CDATA[Resources]]></category>
		<category><![CDATA[ArcGIS]]></category>
		<category><![CDATA[bus]]></category>
		<category><![CDATA[gis data]]></category>
		<category><![CDATA[mta nyc]]></category>
		<category><![CDATA[new york city]]></category>
		<category><![CDATA[qgis]]></category>
		<category><![CDATA[subway]]></category>
		<category><![CDATA[transit]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=522</guid>
		<description><![CDATA[I&#8217;ve started outlining a one-day, introductory GIS practicum / workshop that I hope to offer in the coming academic year. One of the primary examples I want to use in the workshop is site selection for a retail store, and I thought it would be great to use a subway layer as part of the [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve started outlining a one-day, introductory GIS practicum / workshop that I hope to offer in the coming academic year. One of the primary examples I want to use in the workshop is site selection for a retail store, and I thought it would be great to use a subway layer as part of the exercise. But alas, I searched high and low for a layer late last year (for a site selection project) and couldn&#8217;t find a publicly available one. I had purchased some proprietary layers, but really don&#8217;t want to use them for this workshop because I want to be able to freely distribute all of the materials to anyone; the layer I purchased is also outdated now because the MTA cut many services (including two subway lines) last month.</p>
<p>But thanks to Steve Romalewski at the CUNY Mapping Service, there&#8217;s now an alternative! Steve&#8217;s work is a HUGE contribution to the GIS community in New York and fills a glaring hole in the city&#8217;s collection of freely available GIS data. The MTA does host a <a href="http://mta.info/developers/download.html" target="_blank">data feed service</a> (based on the General Transit Feed Specification created by Google) where it provides the geography of all its transit services, among other things. Steve downloaded and processed this raw data and turned it into shapefiles. He quickly discovered that it required a fair amount of scrubbing to be usable, and he&#8217;s cleaned it up and documented the entire process in great detail in several posts on his blog (<a href="http://spatialityblog.com/" target="_blank">Spatiality</a>). Links to download individual shapefiles are available at the bottom of each post, following his discussion of issues and methodology for each set of layers. The CUNY Center for Urban Research has created an index page with each post, which you can <a href="http://www.urbanresearch.org/resources/subway-data-gis-format" target="_blank">access here</a>.</p>
<p>In addition, he&#8217;s created a lyr file for the subway lines in order to symbolize them correctly by color and a separate mxd file for labels. While the shapefiles represent where the lines are, there are some problems representing them as they appear cartographically on the MTA&#8217;s subway maps. Many lines, including some with different colors, share the same trunk line. For example the A and C trains (blue lines) share the same trunk with the B and D trains (orange lines) along 8th Ave from 59th St to 145th St. Depending on how you sort your symbol categories, you&#8217;ll only see one color (and line) depending on which one you have on top. Steve points out two ways for solving this issue &#8211; you can edit the geography and offset one of the lines, which is tedious and creates problems as you change scale (he has some great screen shots that depict this). If you&#8217;re using ArcGIS, he shows off some cartographic tools that you can use to offest lines by prioritizing values in the attribute table. This is more ideal, as it gives the illusion that the lines are side by side cartographically while keeping the geometry of the shapefile intact.</p>
<p>So if you&#8217;re using ArcGIS you&#8217;ll be good to go. I&#8217;ve downloaded the files to play around with, but as I&#8217;m at home and using QGIS I had some more work to do, since lyr and mxd files are proprietary ESRI formats that the open source packages can&#8217;t handle. I&#8217;ve assigned the appropriate colors to each subway line and saved them a QGIS style file (.qml), which you can import in the symbology window to quickly and easily get the right colors (which I plucked from the MTA&#8217;s website). I&#8217;ve also saved the RGB and hex values for each line in a text file, if you&#8217;re using some other GIS software and need to input them manually. As far as I know there isn&#8217;t an easy way to circumvent the shared-line subway problem if you&#8217;re using QGIS (see screenshot below), so you&#8217;d have your work cut out for you if you want to faithfully represent the lines the way they appear on the MTA maps. But if you&#8217;re using the layers for analysis (which is what I&#8217;ll be doing) or you don&#8217;t need to emulate &#8220;the&#8221; subway map in exact detail, it shouldn&#8217;t matter.</p>
<ul>
<li><a href="http://gothos.info/resource_files/nyc_subway_colors.qml" target="_blank">NYC subway QGIS style file (.qml)</a></li>
<li><a href="http://gothos.info/resource_files/nyc_subway_colors.txt" target="_blank">NYC subway colors RGB and Hex (.txt)</a></li>
</ul>
<div id="attachment_525" class="wp-caption aligncenter" style="width: 616px"><a href="http://gothos.info/wp-content/uploads/2010/07/nycsub_qgis.png"><img class="size-full wp-image-525 " title="nycsub_qgis" src="http://gothos.info/wp-content/uploads/2010/07/nycsub_qgis.png" alt="NYC subway layers from CUNY Mapping Service in QGIS" width="606" height="504" /></a><p class="wp-caption-text">NYC subway layers from CUNY Mapping Service in QGIS</p></div>
<p>Footnote &#8211; for anyone who is interested, the proprietary data that I purchase for the college is from a company called <a href="http://www.halcrow.com/maps/products/gis_data.htm" target="_blank">Halcrow</a>. The entire NYC transportation package costs $465. It includes NYC subways and buses (lines and stations for each, along with ridership statistics from 2008 and a historical bus stops layer from 1998), LIRR and Metro North (lines and stations), but also includes the PATH train, freight lines, and truck routes.</p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2010/07/nyc-subway-and-transit-gis-layers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Learning Python at PyCamp</title>
		<link>http://gothos.info/2010/06/learning-python-at-pycamp/</link>
		<comments>http://gothos.info/2010/06/learning-python-at-pycamp/#comments</comments>
		<pubDate>Thu, 10 Jun 2010 19:39:09 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Data Processing]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Resources]]></category>
		<category><![CDATA[programming]]></category>
		<category><![CDATA[pycamp]]></category>
		<category><![CDATA[python]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=513</guid>
		<description><![CDATA[I got back from leave a couple week ago, and spent part of it at a Python boot camp. I&#8217;ve gotten tired of hacking away at data in spreadsheets and read in several places that Python is a good language to learn for beginning programmers &#8211; it&#8217;s also open source, flexible, and is used by [...]]]></description>
			<content:encoded><![CDATA[<p>I got back from leave a couple week ago, and spent part of it at a Python boot camp. I&#8217;ve gotten tired of hacking away at data in spreadsheets and read in several places that Python is a good language to learn for beginning programmers &#8211; it&#8217;s also open source, flexible, and is used by many in the GIS community for processing data and building plugins and software (the instructor for the camp, Chris Calloway, pointed me to this presentation on <a href="http://proceedings.esri.com/library/userconf/devsummit09/papers/pythonscriptingadvancedtechniques.pdf target="_blank"">Python scripting techniques for ArcGIS</a>).</p>
<p>The workshop was a three-day event hosted at Penn State by the Triangle Zope and Python Users Group (<a href="http://trizpug.org/" target="_blank">TriZPUG</a>). It was geared towards beginners and non-programmers (although many of my fellow classmates were IT and systems people) and provided a pretty thorough review of all of the elements of the language &#8211; now it&#8217;s up to me to tie it all together! The price was extremely reasonable (only $300 for a 3 day class!) and I&#8217;d certainly recommend it if there&#8217;s a camp in your area; although I would also recommend reading a book or taking a tutorial to familiarize yourself with the basics BEFORE attending the class; I did, and as a result I think I got more out of it than I would have had going in cold.</p>
<p>The next <a href="http://trizpug.org/boot-camp/" target="_blank">PyCamp</a> is being held in LA in a few days, and the following one will be in Toronto from Aug 30th to Sept 3rd (although this isn&#8217;t posted on the website yet); the normal workshop is a five day affair, the one I attended was a mini 3 day version which suited my needs pretty well.</p>
<p>There are tons of Python tutorials on the web and Python&#8217;s site is pretty definitive. If you&#8217;re looking for a book, I&#8217;d recommend <a href="http://www.worldcat.org/title/practical-programming-an-introduction-to-computer-science-using-python/oclc/251217630" target="_blank">Practical Programming: An Introduction to Computer Science Using Python</a>. Unlike the &#8220;Learn Language X&#8221; books, this one introduces you to general theory and practice in programming, and the authors illustrate the applications with practical examples using Python &#8211; it&#8217;s been immensely helpful to me. Now that I&#8217;m around the initial learning curve, I&#8217;ve been relying more on <a href="http://www.worldcat.org/title/beginning-python-from-novice-to-professional/oclc/62150088" target="_blank">Beginning Python: From Novice to Professional</a>, which is better as a reference book and good for illustrating many of the uses for individual objects, methods, etc (which I had a hard time grasping before I covered the basics of programming).</p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2010/06/learning-python-at-pycamp/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google Maps to Create a Census Finding Aid</title>
		<link>http://gothos.info/2010/05/google-maps-to-create-a-census-finding-aid/</link>
		<comments>http://gothos.info/2010/05/google-maps-to-create-a-census-finding-aid/#comments</comments>
		<pubDate>Thu, 13 May 2010 20:19:17 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Data Processing]]></category>
		<category><![CDATA[Maps]]></category>
		<category><![CDATA[Resources]]></category>
		<category><![CDATA[american community survey]]></category>
		<category><![CDATA[census data]]></category>
		<category><![CDATA[google maps]]></category>
		<category><![CDATA[kml]]></category>
		<category><![CDATA[new york city]]></category>
		<category><![CDATA[pumas]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=472</guid>
		<description><![CDATA[Yikes! It&#8217;s been quite awhile since my last post (the past couple months have been a little tough for me), but I just finished an interesting project that I can share.
I constantly get questions from students who are interested in getting recent demographic and socio-economic profiles for neighborhoods in New York City. The problem is [...]]]></description>
			<content:encoded><![CDATA[<p>Yikes! It&#8217;s been quite awhile since my last post (the past couple months have been a little tough for me), but I just finished an interesting project that I can share.</p>
<p>I constantly get questions from students who are interested in getting recent demographic and socio-economic profiles for neighborhoods in New York City. The problem is that neighborhoods are not officially defined, so we have to look for a surrogate. The City has created neighborhood-like areas out of census tracts called community districts and they publish profiles for them, but this data is from the decennial census  and not current enough for their needs.  ZIP code data is also only available from the decennial census.</p>
<p>We can use PUMAs (Public Use Microdata Areas) to approximate neighborhoods in large cities, and they are published as part of the 3 year estimates of the American Community Survey. The problem is, in order to look up the data from the census you need to search by PUMA number &#8211; there are no qualitative place names. The city and the census have worked together to assign names to neighborhoods as part of the <a href="http://www.census.gov/hhes/www/housing/nychvs/nychvs.html" target="_blank">NYC Housing and Vacancy Survey</a>, but this is the only place (I&#8217;ve found) that uses these names. You need to look in several places to figure out what the PUMA number and boundaries for an area are and then navigate through the census site to find it. Too much for the average student who visits me at the reference desk or emails me looking for data.</p>
<p>My solution was to create a finding aid in Google maps that tied everything together:<br />
<iframe width="425" height="350" frameborder="0" scrolling="no" marginheight="0" marginwidth="0" src="http://maps.google.com/maps?f=q&amp;source=embed&amp;hl=en&amp;geocode=&amp;q=http:%2F%2Fwww.baruch.cuny.edu%2Fgeoportal%2Fkml%2Fpumas_nyc.kml&amp;sll=37.0625,-95.677068&amp;sspn=31.922255,56.513672&amp;ie=UTF8&amp;ll=40.697488,-73.979681&amp;spn=0.364413,0.583649&amp;output=embed"></iframe><br /><small><a href="http://maps.google.com/maps?f=q&amp;source=embed&amp;hl=en&amp;geocode=&amp;q=http:%2F%2Fwww.baruch.cuny.edu%2Fgeoportal%2Fkml%2Fpumas_nyc.kml&amp;sll=37.0625,-95.677068&amp;sspn=31.922255,56.513672&amp;ie=UTF8&amp;ll=40.697488,-73.979681&amp;spn=0.364413,0.583649" style="color:#0000FF;text-align:left">View Larger Map</a></small></p>
<p><a href="http://gothos.info/wp-content/uploads/2010/05/puma_kml_4.png"><img class="size-thumbnail wp-image-491  alignleft" style="margin: 10px;" title="puma_kml_4" src="http://gothos.info/wp-content/uploads/2010/05/puma_kml_4-150x150.png" alt="" width="150" height="150" /></a> I downloaded PUMA boundaries from the Census TIGER file site in a shapefile format. I opened them up in ArcGIS and used an excellent script that I downloaded called <a href="http://arcscripts.esri.com/details.asp?dbid=14273" target="_blank">Export to KML</a>. ArcGIS 9.3 does support KML exports via the toolbox, and there are a number of other scripts and stand-alone programs that can do this (I tried several) but Export to KML was best (assuming you have access to ArcGIS) in terms of the level of customization and the thoroughness of the user documentation. I symbolized the PUMAs in ArcGIS using the colors and line thickness that I wanted and fired up the tool. It allows you to automatically group and color features based on the layer&#8217;s symbology. I was able to add a &#8220;snippet&#8221; to each feature to help identify it (I used the PUMA number as the attribute name and the neighborhood name as my snippet, so both appear in the legend) and added a description that would appear in the pop up window when that feature is clicked. In that description, I added the URL from the ACS census profile page for a particular PUMA &#8211; the cool part here is that the URL is consistent and contains the PUMA number. So, I replaced the specific number and inserted the [field] name from the PUMAs attribute table that contained the number. When I did the export, the URLs for each individual feature were created with their PUMA number inserted into the link.</p>
<p>There were a few quirks &#8211; I discovered that you can&#8217;t automatically display labels on a Google Map without subterfuge, like creating the <a href="http://www.google.com/support/forum/p/maps/thread?tid=489ca54d8665228a&amp;hl=en" target="_blank">labels as images and not text</a>. Google Earth (but not Maps) supports labels if you <a href="http://www.google.com/support/forum/p/earth/thread?tid=5b0422ebdb13f5ea&amp;hl=en" target="_blank">create multi-geometry</a> where you have a point for a label and a polygon for the feature. If you select a labeling attribute on the initial options screen of the Export to KML tool, you create an icon in the middle of each polygon that has a different description pop-up (which I didn&#8217;t want so I left it to none and lived without labels). I made my features 75% transparent (a handy feature of Export to KML) so that you could see the underlying Google Map features through the PUMA, but this made the fill AND the lines transparent, making the features too difficult to see. After the export I opened the KML in a text editor and changed the color values for the lines / boundaries by hand, which was easy since the styles are saved by feature group (boroughs) and not by individual feature (pumas). I also manually changed the value of the folder open element (from 0 to 1) so that the feature and feature groups (pumas and boroughs) are expanded by default when someone opens the map.</p>
<p>After making the manual edits, I uploaded the KML to my webserver and pasted the url for it into the Google Maps search box, which overlayed my KML on the map. Then I was able to get a persistent link to the map and code for embedding it into websites via the Google Map Interface. No need to add it to Google My Maps, as I have my own space. One big quirk &#8211; it&#8217;s difficult to make changes to an existing KML once you&#8217;ve uploaded and displayed it. After I uploaded what I thought would be my final version I noticed a typo. So I fixed it locally, uploaded the KML and overwrote the old one. But &#8211; the changes I made didn&#8217;t appear. I tried reloading and clearing the cache in my browser, but no good &#8211; once the KML is uploaded and Google caches it, you won&#8217;t see any of your changes until Google re-caches. The conventional wisdom is to change the name of the file every single time &#8211; which is pretty dumb as you&#8217;ll never be able to have a persistent link to anything. There are <a href="http://www.google.com/support/forum/p/maps/thread?tid=5a442227daf9e5fd&amp;hl=en" target="_blank">ways to circumvent the problem</a>, or you can just wait it out. I waited one day and by the next the file was updated; good enough for me, as I&#8217;ll only need to update it once a year.</p>
<p>I&#8217;m hosting the map, along with some static PDF maps and a spreadsheet of PUMA names and neighborhood numbers, from the <a href="http://guides.newman.baruch.cuny.edu/content.php?pid=95819&amp;sid=985209" target="_blank">NYC Data LibGuide</a> I created (part of my college&#8217;s collection of <a href="http://guides.newman.baruch.cuny.edu/index.php" target="_blank">research guides</a>). If you&#8217;re looking for neighborhood names to associate with PUMA numbers for your city, you&#8217;ll have to hunt around and see if a local planning agency or non-profit has created them for a project or research study (as the Census Bureau does not create them). For example, the County of Los Angeles Department of Mental Health uses pumas in a <a href="http://dmh.lacounty.gov/AboutDMH/MHSA/MHSA_Plans/PEI/Data/service_area_profiles.html" target="_blank">large study</a> they did where they associated local place names with each puma.</p>
<p>If you&#8217;re interested in dabbling in some KML, there&#8217;s <a href="http://code.google.com/apis/kml/documentation/kml_tut.html" target="_blank">Google&#8217;s KML tutorial</a>. I&#8217;d also recommend <a href="http://www.worldcat.org/title/kml-handbook-geographic-visualization-for-the-web/oclc/227921862" target="_blank">The KML Handbook</a> by Josie Wernecke. The catch for any guide to KML is that while all KML elements are supported by Google Earth, there&#8217;s only <a href="http://code.google.com/apis/kml/documentation/mapsSupport.html" target="_blank">partial support for Google Maps</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2010/05/google-maps-to-create-a-census-finding-aid/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Track 2010 Census Participation Rates</title>
		<link>http://gothos.info/2010/03/track-2010-census-participation-rates/</link>
		<comments>http://gothos.info/2010/03/track-2010-census-participation-rates/#comments</comments>
		<pubDate>Tue, 30 Mar 2010 15:57:37 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Maps]]></category>
		<category><![CDATA[2010 Census]]></category>
		<category><![CDATA[census]]></category>
		<category><![CDATA[Take 10 Map]]></category>
		<category><![CDATA[united states]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=467</guid>
		<description><![CDATA[The 2010 Census is in full swing &#8211; the target date of April 1st is coming up soon. I mailed my form back last week. If you&#8217;re curious as to how many others have mailed theirs back, check out the bureau&#8217;s interactive Take 10 Map. Built on top of a Google Map interface, it allows [...]]]></description>
			<content:encoded><![CDATA[<p>The 2010 Census is in full swing &#8211; the target date of April 1st is coming up soon. I mailed my form back last week. If you&#8217;re curious as to how many others have mailed theirs back, check out the bureau&#8217;s interactive <a href="http://2010.census.gov/2010census/take10map/" target="_blank">Take 10 Map</a>. Built on top of a Google Map interface, it allows you to track participation rates by state, county, place, reservation, and census tract. You can zoom in to change the scale and select different geography, or enter a zip code, city, or state to zoom to an area of choice. Clicking on an area will display it&#8217;s participation rate to date, compared to the state and national rates.</p>
<p>Data is updated daily, Monday through Friday. Once you click on a particular area, if you click the Track Participation Rate link it will create a widget that you can embed in a website to provide the updated rate. Unlike a lot of the other interactive web maps floating around these days, the bureau does give you the ability to download the actual data behind the map, if you want to do some analysis of your own.</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/03/Take-10-Map.png"><img class="aligncenter size-medium wp-image-468" title="Take 10 Map" src="http://gothos.info/wp-content/uploads/2010/03/Take-10-Map-300x216.png" alt="" width="300" height="216" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2010/03/track-2010-census-participation-rates/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Evaluating Open Source GIS for Libraries</title>
		<link>http://gothos.info/2010/03/evaluating-open-source-gis-for-libraries/</link>
		<comments>http://gothos.info/2010/03/evaluating-open-source-gis-for-libraries/#comments</comments>
		<pubDate>Wed, 17 Mar 2010 15:56:08 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Resources]]></category>
		<category><![CDATA[academic articles]]></category>
		<category><![CDATA[GIS]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[software]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=458</guid>
		<description><![CDATA[I&#8217;ve hit a couple of milestones this month.
I had my first peer-reviewed journal article published, Evaluating open source GIS for libraries. After my initial exploration of open source GIS that I documented on this blog over a year and a half ago, I took a systematic approach to evaluating a number of software packages for [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve hit a couple of milestones this month.</p>
<p>I had my first peer-reviewed journal article published, <em>Evaluating open source GIS for libraries</em>. After my initial exploration of open source GIS that I documented on this blog over a year and a half ago, I took a systematic approach to evaluating a number of software packages for thematic mapping. This article documents the tests and results and provides the requisite background on open source software, GIS, and how both are manifest in academic libraries. Given the lengthy process of academic publishing (the whole process began in Dec 2008 with my first test and ended in March 2010 with publication), some of my observations of individual software packages have changed with the release of bug fixes, new features, and new versions. Generally, individual software packages and open source GIS as a whole have improved during this short span of time, but my primary observations and the big picture still hold.</p>
<p>Title: <a href="http://gothos.info/resource_files/fpd_libhitech_foss_gis_march2010.pdf">Evaluating open source GIS for libraries</a><br />
Author(s): Francis P. Donnelly<br />
Journal: <a href="http://www.emeraldinsight.com/0737-8831.htm">Library Hi Tech</a><br />
Year: 2010 Volume: 28 Issue: 1 Page: 131 &#8211; 151<br />
ISSN: 0737-8831<br />
DOI: <a href="http://www.emeraldinsight.com/10.1108/07378831011026742">10.1108/07378831011026742</a><br />
Publisher: Emerald Group Publishing Limited</p>
<p>I&#8217;ve previously mentioned Steiniger and Bocher&#8217;s excellent article, <em>An overview on current free open source desktop GIS developments</em> in the International Journal of Geographic Information Science, which Steiniger has posted on <a href="http://www.spatialserver.net/osgis/">his website</a>. I recently discovered he&#8217;s written a second article with Hay entitled <em>Free and Open Source Geographic Information Tools for Landscape Ecology</em> in Ecological Informatics, which is also available there. The second article provides an in-depth look and great summary tables of landscape analysis applications for eight different open source GIS apps, focusing on advanced tools for researchers. In contrast, my article focuses on basic mapping capabilities for novice to intermediate users.</p>
<p>The other milestone is this blog &#8211; I just noticed that we&#8217;ve passed the two year mark. While there have only been a few public comments here and there, I have received a number of emails over the years with questions and comments and the number of visitors to the site has grown consistently from month to month. I&#8217;m glad that it&#8217;s been useful to so many people; it&#8217;s certainly been useful to me (as an extension to my feeble brain) and I&#8217;ll endeavor to keep it going. Thanks to everyone for your comments and feedback. Best &#8211; frank</p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2010/03/evaluating-open-source-gis-for-libraries/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mapping Hard to Count Areas for Census 2010</title>
		<link>http://gothos.info/2010/02/mapping-hard-to-count-areas-for-census-2010/</link>
		<comments>http://gothos.info/2010/02/mapping-hard-to-count-areas-for-census-2010/#comments</comments>
		<pubDate>Wed, 24 Feb 2010 02:37:15 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Resources]]></category>
		<category><![CDATA[census]]></category>
		<category><![CDATA[Census 2010]]></category>
		<category><![CDATA[census tract]]></category>
		<category><![CDATA[Mapping]]></category>
		<category><![CDATA[new york city]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=452</guid>
		<description><![CDATA[There was an interesting article in the New York Times today about neighborhoods in New York that typically get under-counted in the Census. These include areas with high immigrant populations as well as places that have had new construction since the last census, as the buildings haven&#8217;t been added to the Census Bureau&#8217;s master address [...]]]></description>
			<content:encoded><![CDATA[<p>There was an interesting <a href="http://www.nytimes.com/2010/02/24/nyregion/24census.html" target="_blank">article in the New York Times</a> today about neighborhoods in New York that typically get under-counted in the Census. These include areas with high immigrant populations as well as places that have had new construction since the last census, as the buildings haven&#8217;t been added to the Census Bureau&#8217;s master address file.</p>
<p>What the article didn&#8217;t mention is that CUNY&#8217;s Center for Urban Research has created a great online ap called the <a href="http://www.urbanresearch.org/resources/Census-2010-HTC-site" target="_blank">Census 2010 Hard to Count mapping site</a>. The site is built on the Census Bureau’s Tract Level Planning Database, which identified twelve population and housing variables, such as language isolation, recent movers, poverty, and crowded housing, that were associated with low mail response in the 2000 Census. This tool was designed to help Census reps, local government officials, and community activists identify traditionally under-counted areas to insure a more complete count this time around.</p>
<p>The database is national in scope, and you can easily map tracts for a particular state, county, city, metro area, or tribal area, and you can search for an area using an individual address. The map is built on a Google Maps interface, and zooming in will change the units mapped from larger units (states, counties, etc) to tracts. You can easily select one of the twelve variables color-coded in the menu to the left of the map, or a Hard to Count index of all the variables.</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/02/Screenshot-Census-2010-Mapping-the-Hard-to-Count-Population-Mozilla-Firefox.png"><img src="http://gothos.info/wp-content/uploads/2010/02/Screenshot-Census-2010-Mapping-the-Hard-to-Count-Population-Mozilla-Firefox-300x228.png" alt="" title="Census 2010 - Mapping the Hard to Count Population" width="300" height="228" class="aligncenter size-medium wp-image-453" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2010/02/mapping-hard-to-count-areas-for-census-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Calculated Fields in SpatiaLite / SQLite</title>
		<link>http://gothos.info/2010/02/calculated-fields-in-spatialite-sqlite/</link>
		<comments>http://gothos.info/2010/02/calculated-fields-in-spatialite-sqlite/#comments</comments>
		<pubDate>Wed, 03 Feb 2010 14:22:48 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Data Processing]]></category>
		<category><![CDATA[calculated fields]]></category>
		<category><![CDATA[geodatabase]]></category>
		<category><![CDATA[joining tables]]></category>
		<category><![CDATA[new york city]]></category>
		<category><![CDATA[pumas]]></category>
		<category><![CDATA[qgis]]></category>
		<category><![CDATA[spatialite]]></category>
		<category><![CDATA[sql]]></category>
		<category><![CDATA[sqlite]]></category>
		<category><![CDATA[Tutorial]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=418</guid>
		<description><![CDATA[After downloading data, it&#8217;s pretty common that you&#8217;ll want to create calculated fields, such as percent totals or change, to use for analysis and mapping. The next step in my QGIS / SpatiaLite experiment was to create a calculated field (aka derived field). I&#8217;ll run through three ways of accomplishing this, using my subway commuter [...]]]></description>
			<content:encoded><![CDATA[<p>After downloading data, it&#8217;s pretty common that you&#8217;ll want to create calculated fields, such as percent totals or change, to use for analysis and mapping. The next step in my QGIS / SpatiaLite experiment was to create a calculated field (aka derived field). I&#8217;ll run through three ways of accomplishing this, using my subway commuter data to calculate the percentage of workers in each NYC PUMA who commute to work. Just to keep everything straight:</p>
<blockquote>
<ul>
<li> sub_commuters is a census data table for all PUMAs in NY State
<ul>
<li>[SUBWAY] field that has the labor force that commutes by subway</li>
<li>[WORKERS_16] field with the total labor force</li>
<li>[SUB_PER] a calculated field with the % of labor force that commutes by subway</li>
<li>[GEO_ID2] the primary key field, FIPS code that is the unqiue identifier</li>
</ul>
</li>
<li> nyc_pumas is a feature class with all PUMAs in NYC
<ul>
<li>[PUMA5ID00] is the primary key field, FIPS code that is the unqiue identifier</li>
</ul>
</li>
<li>pumas_nyc_subcom is the data table that results from joining sub_commuters and nyc_pumas; it can be converted to a feature class for mapping</li>
</ul>
</blockquote>
<h3>Spreadsheet</h3>
<p>The first method would be to add the calculated field to the data after downloading it from the census in a spreadsheet, as part of the cleaning / preparation stage. You could then save it as a delimited text file for import to SpatiaLite. No magic there, so I&#8217;ll skip to the second method.</p>
<h3>SpatiaLite</h3>
<p>The second method would be to create the calculated field in the SpatiaLite database. I&#8217;ll go through the steps I used to figure this out. The basic SQL select query:</p>
<blockquote><p>SELECT *, (SUBWAY / WORKERS_16) AS SUB_PER FROM sub_commuters</p></blockquote>
<p>This gives us the proper result, but there are two problems. First, the data in my SUBWAY and WORKERS_16 field are stored as integers, and when you divide the result is rounded to the nearest whole number. Not very helpful here, as my percentage results get rounded to 0 or 1. There are many ways to work around this: set the numeric fields as double, real, or float in the spreadsheet before import (didn&#8217;t work for me), specify the field types when importing (didn&#8217;t get that option with the SpatiaLite GUI, but maybe you can with the command line), add * 100 to the expression to multiply the percentage to a whole number (ok unless you need decimals in your result) or use the CAST operator. CAST converts the current data type of a field to a specified data type in the result of the expression. So:</p>
<blockquote><p>SELECT *, (CAST (SUBWAY AS REAL)/ CAST(WORKERS_16 AS REAL)) AS SUB_PER FROM sub_commuters</p></blockquote>
<p><a href="http://gothos.info/wp-content/uploads/2010/02/sql1.png"><img class="alignleft size-thumbnail wp-image-424" style="margin-left: 20px; margin-right: 20px;" title="sql1" src="http://gothos.info/wp-content/uploads/2010/02/sql1-150x150.png" alt="" width="90" height="90" /></a>This gave me the percentages with several decimal places (since we&#8217;re casting the fields as real instead of integer), which is what I needed. The second problem is that this query just produces a temporary view; in order to map this data, we need to create a new table to make the calculated field permanent and join it to a feature class. Here&#8217;s how we do that:</p>
<blockquote><p>CREATE TABLE pumas_nyc_subcom AS<br />
SELECT *, (CAST (SUBWAY AS REAL)/ CAST(WORKERS_16 AS REAL)) AS SUB_PER<br />
FROM sub_commuters, nyc_pumas<br />
WHERE nyc_pumas.PUMA5ID00=sub_commuters.geo_id2</p></blockquote>
<p>The CREATE TABLE AS statement let&#8217;s us create a new table from the existing two tables &#8211; the data table of subway commuters and the feature class table for NYC PUMAs. We select all the fields in both while throwing in the new calculated field, and we join the data table to the feature class all in one step, and via the join we end up with just data from NYC (the data for the rest of the state gets dropped). After that, it&#8217;s just a matter of taking our new table and enabling the geometry to make it a feature class (as explained in the <a href="http://gothos.info/?p=378">previous post</a>).</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/02/sql2.png"><img class="alignleft size-thumbnail wp-image-425" style="margin-left: 20px; margin-right: 20px;" title="sql2" src="http://gothos.info/wp-content/uploads/2010/02/sql2-150x150.png" alt="" width="90" height="90" /></a>This seems like it should work &#8211; but I discovered another problem. The resulting calculated field that has the percentage of subway commuters per PUMA, SUB_PER, has no data type associated with it. Looking at the schema for the table in SpatiaLite shows that the data type is blank. If I bring this into QGIS, I&#8217;m not able to map this field as a numeric value, because QGIS doesn&#8217;t know what it is. I have to define the data type for this field. SpatiaLite (SQLite really) doesn&#8217;t allow you to re-define an existing field &#8211; we have to create and define a new blank field, and the set the value of our calculated field equal to it. Here are the SQL statements to make it all happen:</p>
<blockquote><p>ALTER TABLE sub_commuters ADD SUB_PER REAL</p></blockquote>
<blockquote><p>UPDATE sub_commuters SET SUB_PER=(CAST (SUBWAY AS REAL)/ CAST(WORKERS_16 AS REAL))</p></blockquote>
<blockquote><p>CREATE TABLE pumas_nyc_subcom AS<br />
SELECT * FROM sub_commuters, nyc_pumas<br />
WHERE nyc_pumas.PUMA5ID00=sub_commuters.geo_id2</p></blockquote>
<p><a href="http://gothos.info/wp-content/uploads/2010/02/sql4.png"><img class="alignleft size-thumbnail wp-image-426" style="margin-left: 20px; margin-right: 20px;" title="sql4" src="http://gothos.info/wp-content/uploads/2010/02/sql4-150x150.png" alt="" width="90" height="90" /></a>So, we add a new blank field to our data table and define it as real. Then we update our data table by seting that blank field equal to our expression, thus filling the field with the result of our expression. Once we have the defined calculated field, we can create a new table from the data plus the features based on the ID they share in common. Once the table is created, then we can activate the geometry (right click on geometry field in the feature class and activate &#8211; see <a href="http://gothos.info/?p=378">previous post</a> for details) so we can map it in QGIS. Phew!</p>
<h3>QGIS</h3>
<p><a href="http://gothos.info/wp-content/uploads/2010/02/sql3.png"><img class="alignleft size-thumbnail wp-image-428" style="margin-left: 20px; margin-right: 20px;" title="sql3" src="http://gothos.info/wp-content/uploads/2010/02/sql3-150x150.png" alt="" width="90" height="90" /></a>The third method is to create the calculated field within QGIS, using the new field calculator. It&#8217;s pretty easy to do &#8211; you select the layer in the table of contents and go into an edit mode. Open the attribute table for the features and click the last button in the row of buttons underneath the table &#8211; this is the field calculator button. Once we&#8217;re in the field calculator window, we can choose to update an existing field or create a new field. We give the output field a name and a data type, enter our expression SUBWAY / WORKERS_16, hit OK, and we have our new field. Save the edits and we should be good to go. HOWEVER &#8211; I wasn&#8217;t able to add a calculated fields to features in a SpatiaLite geodatabase without getting errors. I posted to the QGIS forum &#8211; initially it was thought that the SpatiaLite driver was read only, but it turns out that&#8217;s not the case and so and the developers are investigating a possible bug. The investigation continues &#8211; stay tuned. I have tried the field calculator with shapefiles and it works perfectly (incidentally, you can export SpatiaLite features out of the database as shapefiles).</p>
<p>I&#8217;m providing the database I created <a href="http://gothos.info/lib3010/subway_test.zip">here</a> for download, if anyone wants to experiment.</p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2010/02/calculated-fields-in-spatialite-sqlite/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SpatiaLite and QGIS: Loading, Joining, Mapping Shapefiles and Tables</title>
		<link>http://gothos.info/2010/01/spatialite-and-qgis-loading-joining-mapping-shapefiles-and-tables/</link>
		<comments>http://gothos.info/2010/01/spatialite-and-qgis-loading-joining-mapping-shapefiles-and-tables/#comments</comments>
		<pubDate>Sat, 30 Jan 2010 21:52:30 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Data Processing]]></category>
		<category><![CDATA[GIS]]></category>
		<category><![CDATA[joining data]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[qgis]]></category>
		<category><![CDATA[shapefile]]></category>
		<category><![CDATA[spatialite]]></category>
		<category><![CDATA[sqlite]]></category>
		<category><![CDATA[Tutorial]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=378</guid>
		<description><![CDATA[I stuck with with the Long Term Support Version of QGIS (1.02) last semester while I was teaching, but now I finally have had a chance to experiment with the latest version (1.4) which has a lot of great new features including: improved symbolization, labeling, print layouts, and support for SpatiaLite &#8211; a personal (single [...]]]></description>
			<content:encoded><![CDATA[<p>I stuck with with the Long Term Support Version of QGIS (1.02) last semester while I was teaching, but now I finally have had a chance to experiment with the latest version (1.4) which has a lot of great new features including: improved symbolization, labeling, print layouts, and support for SpatiaLite &#8211; a personal (single file) geodaatbase based on SQLite. For a summary of the new QGIS features check out <a href="http://blog.qgis.org/node/142" target="_blank">the QGIS blog</a> and <a href="http://linfiniti.com/2009/11/some-more-sneak-peeks-into-qgis-trunk/" target="_blank">this developer&#8217;s blog</a>, and for an overview of SpatialLite you can go <a href="http://www.gaia-gis.it/spatialite/docs.html" target="_blank">to the official docs page</a> and <a href="http://bostongis.org/?content_name=spatialite_tut01#195" target="_blank">this tutorial</a>. The latter will show you the obvious strengths of SpatialLite &#8211; the ability to store features and attributes in one container, with the ability to run standard SQL and spatial queries on both. Since that&#8217;s covered pretty well, I thought I&#8217;d  run through a basic operation &#8211; how do you load a shapefile and an attribute table in SpatialLite, join them, connect to the database in QGIS and map the data. I&#8217;m using the SpatialLite GUI, but for those more inclined you could use the command line tool instead.</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/01/spatlite2.png"><img class="size-thumbnail wp-image-382  alignleft" style="margin-left: 20px; margin-right: 20px;" title="spatlite2" src="http://gothos.info/wp-content/uploads/2010/01/spatlite2-150x150.png" alt="Loading shapefile in Spatialite" width="90" height="90" /></a>Fire up the GUI, and create a new, empty geodatabase under the File menu.Once we have a container, we can hit the load a shapefile button. I have a  census PUMA layer for NYC that I&#8217;ve formatted by erasing water features. Click load, go to the path, give the file a nice brief name, and specify the SRID &#8211; the EPSG code that specifies what coordinate system my shapefile is in. In this case, it&#8217;s 4269 as the layer is in NAD83 (you can check your files by opening the prj file in a text editor or by using the OGR tools).</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/01/spatlite3.png"><img class="size-thumbnail wp-image-383  alignleft" style="margin-left: 20px; margin-right: 20px;" title="spatlite3" src="http://gothos.info/wp-content/uploads/2010/01/spatlite3-150x150.png" alt="Table view" width="90" height="90" /></a>Once it&#8217;s loaded, you can expand the listing in the table of contents to see all the field names of the feature, and you can right click on it and choose the edit option to see all of the data in the attribute table.</p>
<p>Next we can load a data table. I have a 2006-2008 ACS census table in tab-delimited text format that I&#8217;ve pre-formatted. The table has the number of workers (labor force age 16+) and number of workers who commuted to work via the subway for every PUMA in the State of New York (it&#8217;s faster to download the whole state and filter out the city PUMAs later). Hit the load txt/csv button, specify a path, a new table name (subway_commuters), the delimiter used, and load the table. It&#8217;s given a different icon in the table of contents (toc), to distinguish a regular data table from a feature class.</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/01/spatlite4.png"><img class="alignleft size-thumbnail wp-image-384" style="margin-left: 20px; margin-right: 20px;" title="spatlite4" src="http://gothos.info/wp-content/uploads/2010/01/spatlite4-150x150.png" alt="spatlite4" width="90" height="90" /></a>The next step is to join them together; I already insured that they both share a common, unique identifier; a FIPS code that has a state and PUMA code. If I run a standard SELECT query I can join the tables in a temporary view &#8211; but that&#8217;s not what I want. I can save the query as a view, but I won&#8217;t be able to access the view within QGIS (at least not with this current stable version of SpatialLite, 2.31). What we have to do here is create a brand new table that combines both the puma feature class and the subway commuter table (referred to in Microsoft Access land as a Make Table Query). Here&#8217;s the SQL that we type in the command window:</p>
<blockquote><p>CREATE TABLE pumas_nyc_subcom AS<br />
SELECT *<br />
FROM nyc_pumas, sub_commuters<br />
WHERE PUMA5ID00=GEO_ID2</p></blockquote>
<p>Execute the query, and we get a message that an empty results set was generated. Uh, ok. But then if we select the database path at the top of the TOC , right-click, and refresh, we&#8217;ll see our new combined table, pumas_nyc_subcom, and we can expand it and take a look at the data. The join worked, but we&#8217;re not done yet. Right now this is just a regular old data table (notice the icon?) We have to turn this into a feature class next.</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/01/spatlite6.png"><img class="size-thumbnail wp-image-385  alignleft" style="margin-left: 20px; margin-right: 20px;" title="spatlite6" src="http://gothos.info/wp-content/uploads/2010/01/spatlite6-150x150.png" alt="Joined and created feature class" width="90" height="90" /></a>Expand the fields for the new table in the TOC, select the Geometry field, right click, and check the geometry. We&#8217;ll see that it&#8217;s MULTIPOLYGON geometry, the projection is still NAD83, and there are 55 features (the non-NYC PUMAS were filtered out during the join, leaving us just with NYC data). Right click on Geometry again, choose the option to Recover Geometry. Specify the geometry type and the SRID, run, refresh the database, and success. A little globe appears next to pumas_nyc_subcom, indicating that it&#8217;s now a feature class.</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/01/spatlite7.png"><img class="aligncenter size-full wp-image-408" style="margin-top: 20px; margin-bottom: 20px;" title="spatlite7" src="http://gothos.info/wp-content/uploads/2010/01/spatlite7.png" alt="spatlite7" width="258" height="77" /></a></p>
<p><a href="http://gothos.info/wp-content/uploads/2010/01/spatlite8.png"><img class="size-thumbnail wp-image-386  alignleft" style="margin-left: 20px; margin-right: 20px;" title="spatlite8" src="http://gothos.info/wp-content/uploads/2010/01/spatlite8-150x150.png" alt="QGIS Spatialite connection interface" width="90" height="90" /></a>At this point we can fire up QGIS. In the toolbar for versions post 1.02, there should be a connect to SpatialLite button. Hit connect, add a New database, and browse to get to it. Once it&#8217;s loaded, then we can hit connect to connect to it, and we&#8217;ll be able to see all feature classes (but NOT data tables, which is why we had to go through the join). Select pumas_nyc_subcom, which has features and data, and click add.</p>
<p style="text-align: center;">
<p>As with any GIS, now we have to symbolize the features to map the subway commuters. Right click on the layer in the table of contents, select properties, and you&#8217;ll get to the recently redesigned properties menu. Go to Symbology, map the subway commuters field by graduated values, change some colors, and voila, a map!</p>
<p><a href="http://gothos.info/wp-content/uploads/2010/01/spatlite12.png"><img class="size-thumbnail wp-image-387  alignleft" style="margin-left: 20px; margin-right: 20px;" title="spatlite12" src="http://gothos.info/wp-content/uploads/2010/01/spatlite12-150x150.png" alt="QGIS map with data and new labels" width="90" height="90" /></a>The developers are still experimenting with improvements &#8211; there&#8217;s a button in the upper right-hand corner of the symbology tab that asks you if you want to try the New Symbology &#8211; this is a new layout, with the introduction of graduated color palettes. It&#8217;s pretty slick, but still a work in progress (color ranges are assigned from dark to light, with the lowest values getting the darkest color; the opposite of cartographic convention). The same label properties are there too, but you can experiment with the improved labeling engine under the Plugins menu. The automatic placement of labels is vastly improved.</p>
<p>Mapping totals for subway commuters isn&#8217;t as interesting as mapping the percentage of commuters in each PUMA who ride the subway. So I&#8217;ll share my experiments working with calculated fields (in SpatialLite and QGIS) in my next post.</p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2010/01/spatialite-and-qgis-loading-joining-mapping-shapefiles-and-tables/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Natural Earth Vector and Raster Data</title>
		<link>http://gothos.info/2009/12/natural-earth-vector-and-raster-data/</link>
		<comments>http://gothos.info/2009/12/natural-earth-vector-and-raster-data/#comments</comments>
		<pubDate>Tue, 15 Dec 2009 19:35:51 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Data Sources]]></category>
		<category><![CDATA[country data]]></category>
		<category><![CDATA[GIS]]></category>
		<category><![CDATA[gis data]]></category>
		<category><![CDATA[global data]]></category>
		<category><![CDATA[natural earth]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=363</guid>
		<description><![CDATA[I haven&#8217;t been posting regularly as I&#8217;ve been swamped this semester &#8211; but now that it&#8217;s coming to an end I should be able to crank out a post or two each month.
I recently saw a message on Maps-L about a new GIS data source, Natural Earth, and just got around to taking a look [...]]]></description>
			<content:encoded><![CDATA[<p>I haven&#8217;t been posting regularly as I&#8217;ve been swamped this semester &#8211; but now that it&#8217;s coming to an end I should be able to crank out a post or two each month.</p>
<p>I recently saw a message on Maps-L about a new GIS data source, <a href="http://www.naturalearthdata.com/" target="_blank">Natural Earth</a>, and just got around to taking a look at it. It&#8217;s run by a volunteer organization dedicated to providing free, integrated, public domain map layers for producing high-quality maps at small scales. They have a pretty comprehensive website that includes a blog, feature list, contributor information, and details on how to volunteer.</p>
<p>Natural Earth provides smooth, generalized vector and raster layers at three scales: 1:10m, 1:50m, and 1:110m. See my screen shot of the Delmarva peninsula to see the distinctions (beige area is 110m, red line is 50m, and blue line is 10m).</p>
<p><a href="http://gothos.info/wp-content/uploads/2009/12/nat_earth.png"><img class="alignleft size-medium wp-image-364" style="margin: 10px;" title="nat_earth" src="http://gothos.info/wp-content/uploads/2009/12/nat_earth-300x217.png" alt="nat_earth" width="300" height="217" /></a></p>
<p>Having a choice of scales with vector and raster data layers from the same source is a huge plus (many other country-level boundary files available on the web are detailed and suitable for large scale maps, but look messy when you zoom out to a smaller scale). Natural Earth also provide outlines for land and water (including legal water boundaries for all the Pacific islands), hydrographic features generalized to the different scales, ice shelves, urban areas, and several lat/long grid line layers.</p>
<p>For country boundaries they&#8217;ve gotten around the tangled issue of country definitions by providing different layers for different definitions, so you can choose the one that&#8217;s most appropriate &#8211; sovereign states (so, Greenland would be part of the Denmark polygon, Alaska and Puerto Rico part of the US, and French Guiana part of France), countries (Greenland separate from the Denmark polygon, Puerto Rico separate from the US, Alaska part of the US, and French Guiana part of France), and subunits (each place its own polygon). As you move down this hierarchy, places are linked back to their whole (so there are fields in the subunit file that list which country and sovereign state it&#8217;s part of).</p>
<p>At this point subdivisions (states / provinces) are only provided for the US and Canada. They do provide some descriptive metadata for each layer on the website, but the metadata doesn&#8217;t follow any standardized format for geographic data. The biggest missing link is unique identifiers &#8211; none of the countries have ISO or FIPS codes, so there aren&#8217;t any fields to join attribute data to for thematic mapping (except country name, which never works smoothly given the amount of variation with names).</p>
<p>Overall this looks like a great resource. Vector data is in shapefile format, raster data is in tiff, and everything is defined as simple WGS 84, so these files should work with almost any GIS package, ready to go.</p>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2009/12/natural-earth-vector-and-raster-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Update on Some Data Sources</title>
		<link>http://gothos.info/2009/10/update-on-some-data-sources/</link>
		<comments>http://gothos.info/2009/10/update-on-some-data-sources/#comments</comments>
		<pubDate>Sat, 31 Oct 2009 18:36:16 +0000</pubDate>
		<dc:creator>Frank</dc:creator>
				<category><![CDATA[Data Sources]]></category>
		<category><![CDATA[american community survey]]></category>
		<category><![CDATA[census data]]></category>
		<category><![CDATA[country profiles]]></category>
		<category><![CDATA[GIS]]></category>
		<category><![CDATA[gis data]]></category>
		<category><![CDATA[pumas]]></category>
		<category><![CDATA[qgis]]></category>

		<guid isPermaLink="false">http://gothos.info/?p=358</guid>
		<description><![CDATA[Here&#8217;s my last chance to squeeze in a post before the month is over. There have been a lot of changes and updates with some key data sites lately. Here&#8217;s a summary:

The homepage for gdata, which provides global GIS data that was created as part of UC Berkeley&#8217;s Biogeomancer project, has moved to the DIVA-GIS [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s my last chance to squeeze in a post before the month is over. There have been a lot of changes and updates with some key data sites lately. Here&#8217;s a summary:</p>
<ul>
<li>The homepage for gdata, which provides global GIS data that was created as part of UC Berkeley&#8217;s Biogeomancer project, has moved to the <a href="http://www.diva-gis.org/gData" target="_blank">DIVA-GIS</a> website. DIVA-GIS is a free GIS software project designed specifically for biology and ecology applications, with support from UC Berkeley as well as several other research institutions and independent contributors. It looks like the old download interface has been incorporated into the DIVA-GIS page.</li>
<li>The US Census Bureau has recently released its latest iteration of the <a href="http://www.census.gov/geo/www/tiger/" target="_blank">TIGER shapefiles</a>, the 2009 TIGER/Line Shapefiles. Since they seem to be making annual updates, which has involved changing the URLs around, it may be better to link to their main TIGER shapefile page where you can get to the latest and previous versions of the files.</li>
<li>The bureau has released its latest American Community Survey (ACS) data: 2008 annual estimates for geographic areas with 65,000 plus people, and three year 2006-2008 estimates for geographic areas with 20,000 plus people. Available through the <a href="http://factfinder.census.gov/home/saff/main.html?_lang=en" target="_blank">American Factfinder</a>.</li>
<li>Over the summer, UM Information Studies student Clint Newsom and I created a 2005-2007 PUMA-level <a href="http://www.baruch.cuny.edu/geoportal/data/nymag/" target="_blank">New York Metropolitan ACS Geodatabase (NYMAG)</a>. It&#8217;s available for download on the new <a href="http://www.baruch.cuny.edu/geoportal/" target="_blank">Baruch Geoportal</a>, which was re-launched as a public website this past September. It&#8217;s a personal geodatabase in Microsoft Access format, so it can only be directly used with ArcGIS. I plan on creating the 2006-2008 version sometime between January and March 2010, and hope to release an Access and SQLite version, as the latest development versions of QGIS now offer direct support for SQlite geodatabases in the Spatialite format (which is awesome!).</li>
<li>While it&#8217;s not a source for GIS data or attribute tables, it&#8217;s still worth mentioning that the <a href="https://www.cia.gov/library/publications/the-world-factbook/" target="_blank">CIA World Factbook</a> completely revised their website this past summer. The previous web versions of the factbook took their design cues from the old paper copies of the report. The CIA revamped the entire site and apparently will be using a model of continuous rather than annual updates. It&#8217;s a great site for getting country profiles &#8211; another good option is the <a href="http://data.un.org/CountryProfile.aspx" target="_blank">UN World Statistics Pocketbook</a>, which is part of the <a href="http://data.un.org/Default.aspx" target="_blank">UNdata</a> page.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://gothos.info/2009/10/update-on-some-data-sources/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
