Eldiario.es writes about wikimaps

The Spanish language news website Eldiario.es has recently written about my work with the Wikimaps Warper on Wikimedia Commons and historical maps.

http://www.eldiario.es/hojaderouter/internet/Wikipedia-mapas-cartografia-creative_commons_0_494851710.html

It also talks about the new role of maps and OpenStreetMap within the wikipedia ecosystem.

mapa-centro-ciudad-alemana-Dresde_EDIIMA20160322_0446_18

Advertisement

A “Frightful Number!” – Mapping Daniel Defoe’s A Journal of the Plague Year

Two wonderful historical maps using Mapwarper.net by Vimala C. Pasupathi @Exhaust_Fumes showing Melville in Rome and in London during the Great Plaque

http://hofstra.github.io/itinerary/

Screenshot from 2016-03-24 00:05:44

“A Frightful Number!” Mapping Daniel Defoe’s A Journal of the Plague Year

“A Frightful Number” uses data culled from Daniel Defoe’s “creative non-fiction” Journal of the Plague Year and from historical sources to track the spread of the 1665 London plague. Created with Hofstra DRC’s innovative mapping tool Itinerary, the site annotates the epidemic’s progress month by month, parish by parish.

As a supplement to the novel, “‘A Frightful Number’” represents and engages with Defoe’s epidemiological mapping of the plague as it makes its way across London and its suburbs, enabling readers to gain a sense of the geography of London to which Defoe refers and a better grasp of the plague’s progression through and impact on the city. “‘A Frightful Number’” also seeks to dissect and analyze the relationship between the novel as a literary form and cartography and how their respective histories intersect throughout the early modern period. Presenting A Journal of the Plague Year in cartographic form helps underscore for readers the difficulties and obstacles that the narrator, H.F., faces as he attempts to “map” and, ultimately, survive the plague, an invisible and elusive entity whose presence is only manifest in its gruesome effects on London’s population. H.F.’s map of the plague is, in this sense, a useful and necessary technology that can make the invisible visible, giving the disease a shape and form in order to better understand its origins and causes, as well as its potential consequences for London and the nation. That said, such a task proves to be immensely difficult, if not impossible, for Defoe’s narrator, who survives the plague but doesn’t seem to come much closer to grasping it in all of its deadly complexity. “‘A Frightful Number’” thus seeks to reflect this sense of ambivalence in Defoe’s novel: to offer a map of the Great Plague that, at the same time, exposes the very real problems and limitations in fully realizing such a project.

Melville in Rome

Screenshot from 2016-03-24 00:07:48

Wikimaps on Maps Mania: Maps in the Age of Cholera (Leeds) & The Vintage Maps of Berlin

Cross posted from The Wikimaps Blog wikimaps.wikimedia.fi

The wonderful, prolific and very popular Maps Mania blog  featured the Wikimaps Warper a few times recently, do check them out!

The first interactive map: The Vintage Maps of Berlin uses the Wikimaps Warper.

mapsmania

Keir writes:

This collection of old historical maps of Berlin centers around what is now Museum Island in Berlin.

In the oldest maps you can clearly see the two towns of Cölln and Altberlin, on opposite banks of the River Spree. As you progress through the maps you can see how this area of Berlin’s has changed and developed over the centuries.

Do check out the 11 maps of Berlin from 1652 to today  here: http://homepage.ntlworld.com/keir.clarke/leaflet/berlin.htm

The second post, and interactive map entitled Maps in the Age of Cholera based on an epidemiological map of Leeds (co-incidentally my home!).

logo (1)

This was also created by Keir and he writes:

Twenty Years before John Snow famously mapped the locations of cholera victims in Broad Street, London, Robert Baker plotted the deaths of cholera victims in Leeds.

Maps in the Age of Cholera is a story map based around Robert Baker’s ‘Sanitary Map of the Town of Leeds’ exploring the 1832 cholera epidemic in the Yorkshire town. Baker never made the link between cholera and contaminated water. However, in his map and in the accompanying report to the Leeds Board of Health, Baker noted that “the disease was worst in those parts of the town where there is often an entire want of sewage, drainage and paving”.

The map itself uses this Leaflet Story Map plug-in. The Leaflet Story Map library uses jQuery to create a scroll driven story map. The map tiles scheme for Robert Baker’s 1832 ‘Sanitary Map of the Town of Leeds’ comes from Wikimaps Warper.

do go check out the interactive story map here http://homepage.ntlworld.com/keir.clarke/leaflet/cholera.htm

strace and ruby-prof to identify slow rails startup on wikimaps

The Wikimaps map warper – a free georeferencing georectification tool for historical maps and images hosted on Wikimedia Commons had some issues recently. Here’s a short document describing some diagnostics to identify the problems.

The wikimaps warper runs on the Wikimedia Labs infrastructure  – essentially an OpenStack cluster of servers. The Labs is run by some great admins and dev ops and they are on the ball when it comes to rolling out needed updates to al the servers and instances. However each instance is generally admin’ed by people within individual projects. It was after they applied a kernel update due to a security patch and restarted all the machines that the warper was seen to be offline. The ultimate issue was that there was something wrong with the NFS – as many other instances and tools were having problems – I’m not going to blame the NFS because it showed up where the warper application was encountering issues.

The problem

The application is a ruby on rails application which runs using Apache and Passenger. It was timing out, it wouldn’t start. I ssh’d into the server. The passenger version was an old one (via a package, rather than a gem) and didn’t support increasing the timeout version. So I removed that, and installed a new version. The library would also be running using a faster version of Ruby to get running. Did all that, but the warper wouldn’t start.

I ssh’ed into the server and couldn’t see any errors anywhere. But running a console command, or a “rake -T” which would load up the Rails environment, took 11 minutes. ELEVEN MINUTES!

Eleven Minutes vs 22 Seconds  – Diagnostics

I use a little old atom cpu, 2gig memory netbook, and it takes less than 22 seconds to load up there.  Enter Strace and ruby-prof. (Note that the screenshots and reports following are based on me getting the loading time down to something workable, but there still work to do)

strace is a linux command line tool which traces system calls and signals. It’s amazing for debugging things. I’ve hardly ever used it. I ran strace

strace rake -T

and watched for around 5 minutes the Rails loading process. I stopped it as it wasn’t telling me much and I wasn’t prepared to wait 11 minutes! But what it did was show me something like the following – the inefficient way Rails loads up all the libraries, the gems:

strace -e trace=open,stat,read,write bin/rake -T

to trace only certain file related calls: here is an example of what it looks like on my netbook

strace-raw

It shows, firstly that its doing lots of file calls, and secondly that its getting lots of misses. Not very efficient but also, totally normal and not revealing.

strace with summary report is what I wanted.

 strace -o strace.txt -c bin/rake -T

gave something better:

strace-summary

You can see in this case that it’s taking around 5 seconds with open calls (the errors are the expected misses, I think). Compare this from my netbook

strace-summary-netbook

5 seconds vs 0.04 seconds. But what files was it opening, what was Ruby up to? Enter ruby-prof

Ruby-Prof

ruby-prof -s total -p flat bin/rake -- -T > ruby_prof_flat.txt

ruby prof is more of a profiling tool. I’m using the command line utility, but its most often used in code, for example around a block or class. Check it out.

The ruby-prof can output its data in a few different formats or profiles. I tried the more colourful ones, which didn’t tell me much:

graph:

this is somewhat useful, but the document was massive  and hard to compare or see where the bottlenecks were. You can see that “require”  -the loading of the libraries was troublesome though.

ruby-prof-warper-graph

colourful stack – not sure what this told me more but it looked pretty.

ruby-prof-warper-stack

Before finally choosing the default option – flat:

here it is on the server

ruby-prof-warper-flat

and compare that with my crappy netbook

ruby-prof-netbook-flat

So it gave a much better diagnostic. It’s slowing right down reading in the libraries, the gems. Now, on both machines, I am using rvm and both are loading from a home directory. But on the server, the home directory was on NFS, not a more local partition!

I copied the rvm directory to a local partition and symlinked things and now the application can start much quicker.

There’s probably more I can do to optimise this – I believe it’s still hitting the NFS in a few places, but it’s given me a good pointer in the right direction to making the wikimaps warper run better and stronger for the future.