![]()
|
|
|
Abstract This paper outlines a simple strategy for recognising prototypical cases of urban roads given only their geometric forms. Two new indicators, namely exits and occupancy, are used together with average width to label regions as roads. Even in its present simplistic form, the methodology proposed here proved to be useful for identifying prototypical cases. It is equally useful for flagging some roads as untypical. These were labelled as roads by a previous study on road extraction which exploited available semantic information. However, owing to the absence of logical links, some roads became combined with neighbouring regions making them untypical. The recognition strategy therefore provides an additional means for validating topographic data. |
|
1 Introduction This paper arose out of an SERC CASE project (Jan 1990 - Dec 1992) undertaken in collaboration with the Ordnance Survey of Great Britain. It is concerned with the identification and labelling of areal objects given feature-coded vector topographic data. Geographic objects may be identified in a variety of ways. Varley and Visvalingam [1] used the method of extraction, which uses available semantic data. They distinguished extraction from recognition; the latter is based only on the outline forms and juxtapositions of regions of uncut space. Their studyonroad extraction used semantic codes associated with linear features, such as road metalling links, and road centre lines to extract roads and validate the input data. The extracted roads were then used to detect inconsistencies in the data and violations of the data specification. However, the process of extraction is affected by deficiencies and errors in the data. The aim of this study was to establish whether prototypical urban roads had distinctive geometric properties which could be used to provide an independent means for checking the results of extraction. If we were to plot only the lines on 1:1250 Ordnance Survey maps in a single line style in black and white and omit all the area symbolism, such as for roofed areas, map users would still be able to recognise and distinguish various classes of objects from their shapes and context alone. Although no road on an urban map is exactly the Same as another, we would be able to correctly identify most of the roads. Yet, the automatic recognition of urban roads on large scale maps has been a continuing topic of research. The identification and appropriate definition of roads is important to many urban Geographical Information Systems. In the next, background, section we briefly review relevant research in this area. We then describe the ideas underpinning our recognition strategy. Previous studies have attempted to abstract a single set of rules for recognising all roads. Our experience with road extraction [1], indicated that even extraction requires quite different strategies for different categories of roads. Rosch's work [2] on categorisation and cognition encouraged us to concentrate initially on the prototypical cases. In edge matched databases, these prototypical cases may be used to identify further roads by extension of these roads across map sheets, without reference to semantic information. Further study of the prototypical cases may in turn lead to the abstraction of patterns which identify less distinctive cases. We were interested in establishing whether prototypical cases could be identified directly using easily computed metrics. This paper demonstrates that
prototypical roads can be immediately and directly recognised using
three easily calculated metrics, two of which are new to the literature.
The ideas were tested using large scale data for three urban
environments but it needs further testing. The results suggest that a
single pseudo dimension may be sufficient for recognising prototypical
cases. The other two indicators are useful for detecting those cases of
extracted roads which deviate from the mental prototype in some aspects.
It also suggests that these metrics may be used within more complex
rules for recognising less typical cases of roads. |
|
2 Background 2.1 Literature on Road Recognition The term, recognition, has been used quite loosely to describe a variety of quite dissimilar approaches to object identification. This is inevitable given that some studies, such as that by Marr [3] and others in Humphreys [4] are more concerned with understanding human vision while other studies have been oriented instead towards solving practical problems. There are a number of factors which have led to a variety of approaches. These include:
Thus techniques for road recognition
are not universally applicable. In this study we have attempted to
formulate a strategy which is more dependent on the general nature,
rather than precise form, of urban roads.
Only two other British studies have
been based on data digitised from the OS 1:1250 maps although their
digital data conformed to different specifications. Of these, only the
study by De Simone [11] is noteworthy; it provided the inspiration for
this work. De Simone devised an elaborate staged strategy for
recognising objects in three groups of topographic entities, namely
railways, roads and land parcels. Each group was characterised by a
complex of objects which he called superstructures. Since elements of
railway superstructures have the same patterns as those in road
complexes, his strategy required the recognition and elimination of
railway superstructures prior to road recognition; rails are easily
distinguished by their parallel configurations and narrow gauges. |
|
3
Research Strategy Figure 1 shows a sample map. Although this map is devoid of text and symbolism, map users can still pick out the major roads on the map from their forms and juxtaposition vis-a-vis other regions. In this section, we consider how these prototypical urban roads may be described in qualitative and quantitative terms. 3.1 Prototypical Roads The principles underpinning categorisation and classification have been explored by psychologists, linguists and anthropologists in Rosch and Lloyd [13]; some of these ideas continue to influence the literature on Visual Cognition [14]. A category is a set of objects which are equivalent in some respects. A taxonomy is a system by which categories are related to one another by class inclusion. Categories and their members are distinguished from other categories by a set of defining characteristics. Rosch [2] suggested that basic categories are the categories that best mirror the correlational structure of the environment and that objects are first seen and recognised as members of their basic category and that only with the aid of additional processing can they be identified as members of a super- or sub- class. The computational models for road recognition normally place urban roads at a basic level of the taxonomy as distinct from others, such as buildings. We differentiate basic objects by their perceptual and functional attributes. At a higher level, roads may belong to a superclass of linear networks which would include other basic objects, such as rivers. Superclasses generally share fewer attributes. Thus, De Simone was able to differentiate between railways and roads according to their width. At a lower level, we can sub-categorise roads on several criteria. Motorways, freeways and rural roads differ from urban roads. Equally, roads in newer planned settlements and suburbs have a different profile compared with those in the core of historic towns, such as Canterbury. Although objects are often seen as distinctly different in their geographical context, they are not necessarily discontinuous or clustered in statistical space. Since the categories in the taxonomy seldom have clear-cut boundaries in this property space, it is difficult to recognise them by a formal set of rules. Rosch [2] noted that we appear to circumvent this problem by thinking of each category in terms of its clear, or prototypical, cases rather than in terms of its statistical boundaries. People tend to easily agree on whether a case belongs to a category even if there is some disagreement over the precise location of these boundaries. Rosch's experimental studies also indicated that learning, recognition, recall and classification were more easily accomplished through the use of prototypical instances of classes. Past research on road recognition has not sought to distinguish between prototypical and less typical cases but have attempted to formulate a single set of rules to identify all cases of urban roads. Varley and Visvalingam found it necessary to distinguish between trivial and more tricky cases when formulating algorithms for road extraction. Even map users can recognise only some roads instantly; a greater degree of cognitive effort is needed to resolve others. It was quite clear that some of the problems facing extraction would also affect recognition. We therefore decided to focus initially on prototypical cases of roads. Before we can recognise prototypical cases, we need to specify what we mean by them. When we think of urban roads we think of arterial branched networks. No doubt, there are many other networked objects since branching is a quality of the superclass of linear networked objects. However, roads are recent superimpositions on the urban scene and they often cross over waterways and railways. Roads therefore tend to segment other objects and form the most extensive networks connecting intra- and inter-urban spaces. Roads also penetrate urban space and extend beyond the map sheet. The mental models we use for holistic recognition of urban roads would be different to those we would adopt for recognising rural roads or for incremental tracking of motorways. This suggests that urban roads form a separate basic category. The Department of Transport classification of roads usually applies to component parts of a connected network and additional processing is needed to identify these sub-categories. Within the basic category, the difficult cases would tend to be those which do not match our mental model of urban roads, as seen later. 3.2 Global Dimensions The next problem was to determine the characteristic properties of prototypical cases. We pursued some context based analysis initially but found it difficult to recognise even fairly prominent roads. This was partly because the data specification does not require that all objects are defined by closed polygons. Consequently, several objects including roads, pavements, car parks and fields can become amalgamated with some of their neighbours. De Simone manually edited his maps to separate these features. It must be accepted that mass digitised data is likely to exhibit this problem since the boundaries between these objects are often conceptual rather than physical and may not be indicated on the visual map. Manual digitisation of these logical links is unreliable. Moreover, if the input to the recognition process consists of auto-vectorised scanned images, they are likely to be missing. We were keen to establish whether the main arterial roads could be recognised despite the absence of these logical links. The long term intention was to devise procedures for automatically separating these objects and truncating minor features on roads as part of the research on map generalisation and scale free mapping [15]. Since contextual information could not be determined reliably at this stage of the analysis, we decided to explore the attributes of objects instead. Garner [16] made a distinction between features and dimensions. He used the term, feature, to refer to some distinguishing component of an object which may be either present or absent. For example, the presence or absence of a single line can serve to distinguish an O from a Q. Garner proposed that the visual system prefers feature descriptions in many information processing tasks. Suzuki et al [10] and De Simone [11] incorporated features, such as right angled corners and parallel sides, in their recognition systems. However, the identification of in-line features is computationally demanding. In his formal definition of terms, Garner differentiated between these optional features and dimensions. He defined dimensions as potentially variable properties of components which are always present. Objects also have global dimensions. Garner was aware that aspects of stimuli could be studied using either dimensions or features. For example, the letters E and F are distinguished both by the dimension, number of horizontal lines, and by the presence or absence of the lowest horizontal line. Some features can be tracked more easily through such pseudo-dimensions. The choice of the term, dimensions, to denote these intrinsic aspects of an object is somewhat confusing since the word dimension has specific meanings which are different in physics and mathematics. We used the word to imply the phrase, global dimension, which refers to the intrinsic properties of the object. The study reported in this paper was concerned with assessing whether easily computable global dimensions enable direct recognition of prototypical urban roads and reveal the defining aspects or essence of these roads. Since large scale maps depict roads by their boundaries rather than by stylised conventions, a single dimension, such as width at the map edge, is inherently unreliable. In his textual analysis, De Simone noted that roads were extensive but his recognition strategy did not quantify this adequately. He therefore had to use contextual information, such as adjacency, full and partial containment, line continuation, alignment of shapes and shape combination. The problem in recognition is one of identifying and quantifying the definitive aspects or the essence of the class. We investigated the following dimensions.
The first four dimensions were also
used by De Simone; whereas De Simone attempted to measure these
dimensions as accurately as possible, we were only concerned with
quantifying them as crude indicators. We introduced the last three
indicators; exits and occupancy, in
particular, were designed to capture the intrinsic nature, rather than
precise form, of roads. Except where a section of a road becomes
truncated by overhead features, such as bridges, roads by their very
nature will eventually connect with those on adjacent sheets at the map
edge. The feasibility study on road extraction initially selected only
those regions with at least one map edge link as candidate regions since
urban roads tend to extend beyond a single map sheet. Indeed, in the 35
sheets we have studied to-date, there were only 2 detached sections of
road. This led to the insight that arterial roads have a number of exits
(or intersections with an imposed window) and that this could be used as
one indicator. As a corollary, an arterial urban road would tend to
extend over a large part of the map. To test this we included the
envelope or bounding box of the region as another indicator. This is not
entirely reliable. Since urban roads will branch out in several
directions and will segment other objects, it was worth considering.
Although it proved to be unreliable on its own, it led to the inclusion
of a third dimension, occupancy. With an extensive and branched object,
as gauged by exits, occupancy is indicative of the netted character of
roads. |
|
4 Observations In this section we describe the strategy for visual analysis of statistics which was used to identify the key dimensions of prototypical roads and their Gut¬off values. 4.1 Key Dimensions There are 5277 regions with at least one exit. Of these, 100 are roads (see Table 1). Figure 2 only shows the 183 regions (3% of all regions) with three or more exits; these include 44 roads. This selection was justified on the grounds that arterial roads are likely to have more than 2 exits in a 500m x 500m area. Three exits is also a definitive proof of the existence of at least one branch. Furthermore, these 44 road regions account for over 93% of the area of all roads. The bulk of the remaining cases with one or two exits do not exemplify a mental model of urban roads. They are segments resulting from the superimposition of a viewing window on the urban scene, in this case the map sheet boundary; most of these are very small segments which just intrude into the map. The road regions with zero exits are those which have become detached by overhead features.
Table 1: Counts of regions with at least one exit. Figure 2 shows the univariate
distributions of 6 of the global dimensions for roads, their neighbours
and other objects as labelled by Varley and Visvalingam [1]. In each
diagram, the regions in that class are sorted on the depicted dimension.
The point symbols record the values for the regions arranged in
ascending numeric order along the x-axis. The same y-scale is used to
facilitate comparisons of values across the graphs for all categories.
These plots show that area and shape (Figures 2a and 2b) are the least
useful for constraining the search. Figures 2c and 2d indicate that
average width and occupancy are useful for eliminating unlikely
candidates. Although both average width and shape are based on area and
perimeter, average width is a better measure which also has the
advantage of having a physical meaning relating to common knowledge.
Perimeter, envelope and exits have similar univariate distributions and
are all useful for distinguishing the larger roads. Perimeter is less
helpful since it is difficult to select statistical boundaries on a
priori grounds. An envelope of 250000 metres sq, covering the map sheet,
is particularly distinctive since roads are likely to segment other
objects (Figure 2e). However, with one notable error of commission,
exits is equally discriminating (Figure 2f). Since envelope is only a
crude indicator, exits was favoured as a more meaningful and reliable
dimension for focusing on the most distinctive cases.
4.2 Selection of Cut off
Values
Figure 3a also shows that there are some roads with very narrow average widths for extensive roads. The cut off of 5.6 metres is arbitrary but it illustrates that the average width dimension may also be used to locate other types of composite objects. For example, the light grey region in Figure 5a is a road with an average width of only 4.3m and it is quite clear that it has merged with pavements and paths; the same is true of the road in Figure 5b. Unusually high values for occupancy appear to be due either to structuring errors, giving rise to convoluted regions, or to roads which have merged with fields and car parks. Low average widths indicate merging with adjoining pavements, back lanes and paths in the urban rural fringe. Some of these merged regions can be automatically detected using simple topological rules. Given the data structures in DAM [12], a link which has the same boundary on either side, flags an error [1]. However, this will only identify some inconsistencies in the data. The above checks on prototypical attributes help to locate other cases.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
5 Conclusion This paper has made a number of contributions. First, it has conceptualised the form of urban roads in terms of their context and functions. Their transport and access functions specify an `arterial' networked form. In the current state of transport development, roads appear to be imposed on other urban objects to link and connect urban spaces in an extensive way. Roads serve urban spaces without swamping them. Second, this paper has shown that given topologically structured data, it is possible to recognise prototypical urban roads simply and efficiently through use of novel global dimensions. The conception of dimensions involved a great deal of mental visualisation aided by data visualization with the latter serving the former. The key characteristics which seem to typify urban roads are extent, netness, low occupancy and width of roads. Third, the paper has shown that these characteristics may be quantified as three global dimensions of regions, namely average width, exits and occupancy. Two of these indicators are new to the literature an road recognition. They are exits and occupancy. Exits is a pseudo-dimension since its main function is to detect the presence or absence of features, such as branches and connections with external spaces. They are both dimensionless variables. Unlike average width, they should be applicable in a wide range of urban environments since they are indicative of extensive networks, rather than measures of the precise size and form of roads. These dimensions are easily calculated using data normally held in attribute tables by GIS without accessing co-ordinates on disc. Fourth, the paper has suggested that exits alone is sufficient for identifying extensive arterial roads with more than 9 exits. The other dimensions served to pick out deficiencies in data or roads which had become combined with neighbouring regions. All three dimensions have to be used to distinguish between roads and other objects with three to nine exits. Fifth, it has shown that the method is sensitive to data conditions and that it is capable of picking out untypical cases for investigation. With one exception, the untypical cases of extracted roads are either not roads or are amalgams of objects. The method is thus useful both for flagging the absence of logical links in road boundaries and for locating some residual structural and semantic errors. This method of recognition is more reliable than extraction. It correctly rejected objects, such as pavements, which had been extracted as roads since they contained erroneous road metalling links. This suggests that the process of road extraction must be even more intelligent than that described by Varley and Visvalingam [1]. Finally, it has demonstrated the value of visualization showing how visual and statistical summaries may be used in conjunction with maps to assist in the processes of ideation and verification as suggested by Muehrcke [17]. In conclusion, we would like to stress
the following. The recognition of prototypical roads forms only part of
a programme of study on spatial data models and algorithms for
topographic data and just one of the methods used for identifying roads
and validating data. The ideas presented have only been tested with
link-and-node structured data for British urban areas as modelled on
Ordnance Survey 1:1250 maps. It must also be remembered that two of the
three dimensions, namely exits and occupancy, are both dependent on the
dimensions of the viewing window. The map sheet boundary provides a
convenient window. It can be varied to enhance recognition but it should
not be so small that it no longer measures the extensive, arterial
nature of urban road nets. Neither should it be so large that it misses
the urban environment and intersects inter- urban highways instead.
Equally, rectilinear windows may not be optimal for detecting grid
patterned roads. |
|
6
Acknowledgements We would like to thank the UK Science and Engineering Council for award of a CASE studentship to Dominic Varley and a quota award to Chris Wright. We are also grateful to the Ordnance Survey of Great Britain (OS), the collaborating body on the SERC CASE project, for providing access to their digital topographic data. We are particularly grateful to John Farrow, formerly of the OS for his support and encouragement. |
|
References [1] D.A.Varley and M.Visvalingam, Road Extraction and Topographic Data Validation Using Area Topology, Computer J., 37(1), 1994, pp. 3-15. [2] E.Rosch, Principles of Categorization, in Cognition and Categorization,(E.Rosch and B.B.Lloyd Eds.), pp. 27 48, Lawrence Erlbaum Associates, 1978. [3] D.Marr, Vision, W.H.Freeman and Company, 1982. [4] G.W.Humphreys (Ed.), Understanding Vision: An Interdiseiplinary Perspective, Blackwell, 1992. [5] P.Suetens, P.Fua and A.J.Hanson, Computational Strategies for Object Recognition, ACM Computing Surveys, 24(1), 1992, pp. 5-61. [6] D.H.McKeown and J.L.Denlinger, Cooperative Methods for Road Tracking in Aerial Imagery, in Proceedings, IEEE Conf. CVPR '88, 1988, pp. 662-672. [7] L.H.Quam, Road Tracking and Anomaly Detection in Aerial Imagery (SRI International AI Center Tech. Note 158), in Proceedings, ARPA Image Understanding Workshop, 1978, pp. 51-55. [8] Z.Zhu and Y.Kim, Algorithm for Automatic Road Recognition an Digitized Map Images, Optical Engineering, 28(9), 1989, pp. 949-954. [9] T.Nagao, T.Agui and N.Masayuki, An Automatic Road Vector Extraction Method From Maps, in Proceedings, 9th Int. Conf. Pattern Recognition, 1, 1988, pp. 585-587. [10] S.Suzuki and T.Yamada, MARIS: Map Recognition Input System, Pattern Recognition, 23(8), 1990, pp. 919-933. [11] M.D.De Simone, Data Structures and Feature Recognition: From the Graphic Map to a Digital Data Base (Unpublished thesis), N.E. London Polytechnic, 1985. [12J G.H.Kirby, M.Visvalingam and P.Wade, Recognition and Representation of a Hierarchy of Polygons with Holes, Computer J., 32(6), 1989, pp. 554-562. [13] E.Rosch and B.B.Lloyd (Eds), Cognition and Categorization, Lawrence Erlbaum Associates, 1978. [14] G.W.Humphreys and V.Bruce, Visual Cognition: Computational, Experimental, and Neuropsychological Perspectives, Lawrence Erlbaum Associates, 1989. [15]
M.Visvalingam and P.Williamson, Generalising Roads an Large Scale Maps:
A Comparison of Two Algorithms, (CISRG Discussion Paper 13),
University of Hull, 1994. |
| © Dr Mahes Visvalingam, University of Hull, Uploaded September 2005 |
Cartographic Information Systems Research Group, University of Hull