![]()
|
1.
Introduction |
|
|
|
|
|
The Cartographic Information
Systems Research Group (CISRG) of the University of Hull is
researching the automatic recognition of spatial objects based on
their spatial descriptions alone. The feasibility study, reported in
this paper, formed a part of an SERC CASE project (January 1990
January 1993); the collaborating body was the Ordnance Survey of
Great Britain. The project involved the recognition of objects
implied in large scale topographic maps. We define object
recognition as the process of identification of objects from their
forms alone; i.e. without recourse to semantic labels. Unlike
recognition, extraction uses all available information, including
semantic labels manually associated with line segments. This paper
is concerned with the problem of road extraction. The principal aim
of this feasibility study was to assess whether it would be possible
to recognise roads given the structure and content of an
experimental topographic database designed and created by the
Ordnance Survey. The emphasis in the feasibility study was on
acquiring a good understanding of large scale topographic data and
of the data processing problems. Although some attention was paid to
the efficiency of processing, this was not the primary concern at
this stage. Efficient processing of geo-spatial
data is a major topic of research which has to take into account a
wider range of considerations (Frank, 1991). The late 1980s has witnessed a growth in Geographical Information Systems (GIS). In Britain, the Chorley Report (DoE, 1987) has led to the formation of the UK Association for Geographic Information (AGI) and GIS Specialist Groups within many learned and professional societies, including the British Computer Society. The Chorley Report defines a GIS as "a system for capturing, storing, checking, integrating, manipulating, analysing and displaying data which are spatially referenced to the earth" (p 132). It refers to the diversity of users and uses of geo‑referenced data which give rise to different types of GIS (Visvalingam, 1990). In Britain, Land and Property Information Systems (LIS) use large‑scale (1:1250 and 1:2500) topographic data as a base reference against which they record, manage and inter‑relate their own data. The Ordnance Survey of Great Britain (OS) has been responsible for creating the topographic base maps used by numerous bodies, such as Her Majesty's Land Registry, the utilities and Local Authorities. The OS is renowned internationally for its pioneering activities in digital mapping since the early 1970s and its Research and Development Division continues to experiment with alternative database designs to meet changing market needs.
Users of topographic data are ultimately concerned with objects of interest to them. Different applications may link different data with the same objects. Topographic objects are described by spatial and other locational references. A road network, for example, will have multiple polygons which describe its extent and which exclude regions, such as central reservations and roundabouts, which do not form a part of the road. At the application‑level there is growing interest in object‑oriented databases. However, the design of the class structure of objects is application dependent. For example, vehicle routing, highway maintenance, cutting of grass verges and other activities which relate to roads will tend to develop their own object structures. Consequently data vendors, such as the Ordnance Survey, tend to supply only partially structured data and it is the users' responsibility to extract the necessary object descriptions. The vendor may add value to data by making explicit, through a combination of automated and manual processing, some objects implied by the basic topographic data. For example, the OS itself defines some objects, such as vegetation, for its own map production. However, the automatic recognition and labelling of objects encoded in digital topographic maps remains an outstanding problem.
The OS vector topographic data consist of point, line and text features. A semantic label, called a feature code, is associated with each feature. Features are partial descriptions of objects. An object may be described by one or more features and a feature may form a part of several higher‑level features and objects. Object recognition seeks to add value to data by grouping features so that they form higher‑level entities, both features and objects, and by either relating currently free‑standing, or else manually input, information with these derived entities.
In the past, data suppliers have supplied feature‑coded data in unstructured, so‑called spaghetti, form. In spaghetti form, lines are broken when the feature code changes but may otherwise cross each other and themselves. This simple structure is sufficient for semi‑automated map production but it does not facilitate the chaining of the polygons which define areal objects. For this, spaghetti vectors must be structured into a link‑and‑node form. In this structure, lines are not allowed to cross themselves or other features. Where such intersections occur, lines are split forming segments, called links by the Ordnance Survey. All links begin and end at nodes, which establish the connectivity of the links. The process of link‑and‑node structuring is usually automated. The output of this process can contain a few errors such as overshoots, which need to be trimmed, and is thus subject to a cleaning process. The OSBASE data we received was link‑and‑node structured and cleaned.
The feasibility study on road extraction had to achieve the following tasks. It had to:
|
|
|
|
|
|
|
|
|
| © Dr Mahes Visvalingam, University of Hull, Uploaded June 2003 |
Cartographic Information Systems Research Group, University of Hull