Dealing with historical geographic places (Southall et al., 2011) is important in museums, libraries, archives, and media companies, but challenging: 1) Historical places change in time. 2) It is difficult to understand the spatial and temporal context of the places. 3) Historical place names can often be seen only on historical maps. 4) Historical geographic data is scattered across multiple sources that can be incomplete and/or mutually conflicting. 5) To preserve semantic interoperability across Cultural Heritage (CH) datasets, there is a need to find out how the same place is represented in different repositories. 6) If a place is nowhere to be found—a situation quite common—there should be a mechanism to suggest and share new place concepts among the CH community.
To tackle these challenges, we have developed a Linked Open Data brokering service model HIPLA for using and maintaining historical place gazetteers and maps based on distributed SPARQL endpoints. Using Linked Data technologies, HIPLA provides a common search interface to historical geographic data like place names with coordinates and historical maps. Contextual information, e.g. historical events or photographs related to a geographic location, is provided to help the user to gain a deeper understanding of the historical place. HIPLA also serves as a sustainable and evolving repository of historical places by implementing Dynamic Ontology Services for Evolving Ontologies (Hyvönen et al., 2015). Cultural Heritage organizations can connect their legacy cataloguing systems to HIPLA using a widget or an API in the same vain as in the ONKI ontology service (Tuominen et al., 2009).
The general HIPLA model is being implemented to create and manage a national level gazetteer and map service Hipla.fi. Hipla.fi is based on four Finnish datasets in SPARQL endpoints totalling some 840,000 geocoded places, on 450 historical maps from two atlas series aligned on modern maps, and on the Getty Thesaurus of Geographic Names (TGN) SPARQL endpoint in the US.
This paper first presents Hipla.fi’s user groups (section 2) and the end-user interface (section 3), complementing the crowdsourcing view to the system (Hyvönen et al., 2015). Then the system architecture is outlined (section 4), and finally lessons learned are discussed (section 5). Hipla.fi is available at http://hipla.fi.
The audiences of HIPLA are 1) collaborative geo-ontology developers, 2) cataloguers of historical content, 3) information searchers, and 4) application developers. For group 1 HIPLA facilitates a sustainable model for aggregating historical place names in shared data repositories as time goes by. For groups 2 and 3 HIPLA provides a combination of historical and contemporary maps, linked contextual data, and semantic federated search to find and understand historical places. User group 4 can utilize distributed SPARQL endpoints, URI resolving services, and an autocompletion text search widget.
Our first focus in developing Hipla.fi has been on modeling, storing, and searching Finnish place names in multiple SPARQL endpoints, and on displaying them on historical and contemporary maps. The datasets used are stored in separate RDF graphs, which makes it possible to offer dynamic selection of data sources for the user interface or external data consumers. Table 1 presents the datasets currently connected to Hipla.fi, most of them available on the Linked Data Finland platform 1 (Hyvönen et al., 2014).
Figure 1 depicts the Hipla.fi user interface. For finding, disambiguating, and examining historical places, there is an autocompletion search input field (a). Place names can be searched from multiple SPARQL endpoints at the same time based on the user's choice (checkboxes above (b)) with the following functionalities:
Dataset | Original source | Place type | Size | Description |
Finnish Municipalities 1939–44 | National Archives of Finland | municipality | 612 | Finnish National Archives research project “Finland, prisoners of war and extraditions 1939–1955” produced a map application, from where the war time municipalities were obtained. |
Karelian map names 1922–44 | Jyrki Tiittanen / National Land Survey of Finland | village, house, etc. | 34 938 | Historical places in the Karelia region of Finland and Russia. |
Finnish Spatio-Temporal Ontology | SeCo | municipality | 1 261 | A spatio-temporal ontology of Finnish municipalities. |
Finnish Geographic Names Registry | National Land Survey of Finland | 61 place types | 800 000 | The place name dataset comprises natural and cultural names whose spelling has been checked by the Institute for the Languages of Finland. |
The Getty Thesaurus of Geographic Names | J. Paul Getty Trust | 1800 place types | 2 156 896 | TGN is a structured vocabulary containing names and other information about places. Names for a place may include names in the vernacular language, English, other languages, historical names, names and in natural order and inverted order. Among these names, one is flagged as the preferred name. |
Senate atlas | National Archives of Finland | map | 414 | Series of maps of Southern Finland drawn by the Russian Army topographic troops in the end of the 19th and the beginning of the 20th centuries in scale 1:21 000. |
Karelian maps | National Land Survey of Finland | map | 47 | The National Board of Survey and Topografikunta produced four-colour topographic maps in scale 1:100 000 during 1928–1951. |
If the user does not know the name of the place, but has some idea where the place is located, she can pan and zoom the map view to the area. After this it’s possible to use the “View all places on current map view” button. This way places from different datasets connected to Hipla.fi are rendered on the map, and the user can check if the place exists already in some of the datasets, and compare places in different gazetteers.
The "Historical maps" tab (Figure 1 (b)) provides a list of old maps that intersect the current map view. The map images are fetched from the Hipla.fi's Map Warper service 2 and their metadata is queried with SPARQL from the map RDF graph of the HIPLA service. Each map has a checkbox for rendering the map on the main map view, a thumbnail image, information about map series, scale and type, and a link to view the map in Map Warper. All map series are visible by default, but with the series button it is possible to filter the maps by their series. Once one or more historical maps have been selected with the checkboxes, the opacity of the historical maps can be controlled with the slider that is located on the top right corner of the map. If the user pans or zooms the main map view, clicking the "Refresh map list" button updates the map list.
When the user selects a place, contextual data (Figure 1 (c)) is provided for connecting the place to other relevant data sources. This functionality is first piloted with the spatial datasets of the WarSampo portal (Hyvönen et al., 2016), providing, e.g. 160 000 historical photos of the Second World War related to the places, and a timeline of historical events. In addition to this, the spatial perspective 3 of the WarSampo portal uses customized Hipla.fi user interface elements to visualize wartime places and their connections to other WarSampo datasets.
The HIPLA model is adaptable to various geographic data models and both contemporary and historical gazetteers. The only requirement is that the gazetteer is published in a SPARQL endpoint. Because there is no standard for how to express the temporal extent of spatial data, the spatial dimension of gazetteer data can be utilized in the user interface (e.g. when disambiguating place names) by individual configurations.
Figure 2 depicts the components of the HIPLA model. The Hipla.fi prototype is implemented using the Linked Data Finland platform (Hyvönen et al., 2014), based on Fuseki 4 with a Varnish 5 front end for serving the linked data. The end-user interface of Hipla.fi is a lightweight HTML5 single page map application, which provides access to multiple data sources with SPARQL queries and autocomplete search functionality using typeahead.js 6. Embedded Google Maps view is used to visualize historical places. Hipla.fi's Map Warper is an instance of the open source Map Warper tool of the New York Public Library for georectifying old maps on top of modern ones.
HIPLA is an ontology library service (d'Aguin and Noy, 2012) for historical places. Complementing traditional gazetteers, HIPLA not only publishes the data for humans but also for machines using the SPARQL endpoint API. In addition, historical maps and contextual linked data about the places are provided.
Thesauri of historical places, published as Linked Data, include the Getty TGN 7 of some 1.5 million records, 'Pelagios: Enable Linked Ancient Geodata In Open Systems' 8, and Pleiades 9. Pelagios and Pleiades are based on crowdsourcing volunteers' work in ontology development. The novelty of HIPLA from a user interface viewpoint lays in the idea of combining multiple geographic data sources, offering a unified view for examining and comparing them. In addition, HIPLA makes it possible to crowdsource the creation of the gazetteer to cataloguers of Cultural Heritage content, as a side effect of their daily work, as discussed in Hyvönen et al., 2015.
The Historical Gazetteer of England's Place-names 10 is a service of over 4 million names that can be searched and viewed on modern maps as well as on historical ones. HIPLA has a similar local flavor focusing on places in Finland, but is based on Linked Open Data. OldMapsOnline 11 is a search engine for finding historical maps covering a given area. In contrast to the systems above, HIPLA includes a map service for aligning and viewing georectified historical maps, as in the New York Public Library's Chronology of Place gazetteer 12. HIPLA also publishes the metadata of the historical maps as Linked Open Data and the dynamic and transparent selection of data sources makes it possible to understand the origins of the data.
Our research was supported by the Finnish Cultural Foundation and Wikidata Finland.