It is impossible to prove
quantitatively that the presence of digitising errors can be ignored
since the results would be dependent upon the selected line
configurations. We can however prove the converse, namely that
digitising errors impair the performance of this algorithm. For
example, in Figure 7a it can be seen that the algorithm results in the
choice of C rather than D. Since all points are subject to digitising
error, point D lying within 5 metres of C is an equally valid but
perceptually more significant point. In scale related generalisations,
which conceal the inadequacies of the algorithm to some extent, the
rigid use of the maximum offset is acceptable only at the two extreme
levels of generalisation. In minimal simplifications, there is a high
probability that both points will be included. In very small scale
displays, the absolute position of the point is irrelevant. At
intermediate levels, the choice could matter, as the point C once
selected is retained at more detailed levels. The adverse implications
of this were discussed elsewhere (Visvalingam and Whyatt, 1990). It is
sufficient to re state here that the retention of C leads to the non
selection of D even when 40% of points are retained. As a result, the
algorithm can exhibit a known weakness of the n'th point method,
namely a tendency for cutting perceptually important corners (Figure
8). Also as shown in Figure 7 some candidates communicate very much
less visual information and appear to be more dispensable than others.
This makes the algorithm particularly unsuitable for scale independent
generalisation. Jenks (1979) was justified in being dissappointed with
the method although he thought that it might have been due to some
peculiarity in his version (implementation) of the algorithm; he was
probably right in both respects.
The selection of relatively unimportant pointsonthe basis of numerical
distances not only prejudices the selection of visually more important
ones, but it also means that the algorithm is unnecessarily
extravagant it uses more points than necessary to represent lines.
This property of the algorithm was noted by Ramer (1972), who was
concerned with the approximation of arbitrary 2D curves by polygons.
Researchers before him had pursued the ideal objective of representing
lines and boundaries by polygons satisfying a given fit criterion,
using a minimum number of vertices. Ramer observed that a fit
criterion of the maximum distance from the curve to the approximating
polygon does not satisfy the ideal objective of locating a minimum
number of vertices.
Duda and Hart (1973) noted that this algorithm is strongly influenced
by individual points and that a single 'wild' point can drastically
change the final result. They stressed that many of the heuristics
used in image processing and pattern recognition are not dignified by
much supporting theory and that they must be used judiciously. They
advised that the use of this particular heuristic should be restricted
to data that are initially error free. Some researchers (for example,
Jones and Abraham, 1987 and McMaster, 1989) have incorrectly assumed
that weeding and/or smoothing remove digitising errors. Weeding cannot
make the retained points more accurate; and smoothing can blur the
distinctive features of the line.
4.
Discussion
Researchers in cartography and
pattern recognition have used Attneave's (1954) famous caricature of a
sleeping cat to illustrate the concept of "information loaded"
critical points. Attneave proposed that people perceive points of high
curvature along lines as perceptually significant and high in
information content. Despite Whites (1983) observation that there is
only a 45% overlap between these critical points and points with
maximum offsets, others have continued to assume that the Douglas
Peucker method can be used to define a hierarchy of critical points.
This assumption is questioned here and elsewhere (Visvalingam and
Whyatt, 1990). The method can be shown to select non critical points
and miss critical ones and thereby distort the shape of features.
No doubt all generalisations are inaccurate in some respects but this
algorithm can never approximate the performance of skilled
cartographers. Does this matter? This depends upon the purpose of
research. Basic research seeks to develop knowledge and understanding.
The discipline of cartography should seek to understand cartographic
proceses and the cartographer's skills in meaningful and explicit
terms so that we have a good grasp of the utility and limitations of
our knowledge, techniques and data. The continued promotion of the
Douglas Peucker algorithm by leading researchers stifles innovation
and creativity, especially by the young. What is more disconcerting is
that this algorithm has already inspired and has become a primitive
within secondary spatial analysis and the design of scale independent
databases. Further extensions to the algorithm are also advocated. For
example, Goodchild (1988) after considering issues relating to the
accuracy of spatial databases expressed in a separate section that
many of the standard methods for planar spatial analysis, including
the Douglas Peucker line generalisation algorithm, have yet to be
adapted to the spherical global context. Those inclined to do so
should at least recognise the problems of implementation and resolve
them in some rational manner. Even then, the Douglas Peucker algorithm
cannot provide more than a partial and shaky foundation for R & D in
line generalisation for it is difficult to envisage how we could
standardise the implementation of the algorithm in a meaningful and
universally applicable manner. There is also a need to accommodate
digitising errors in cartographically meaningful terms.
5. Conclusion
In this paper we used
the widely known Douglas-Peucker algorithm
to focus attention on the lack of rigour in
the expression, interpretation, implementation and evaluation of
cartographic software. We also demonstrated that measurement errors
can adversely influence the intended effect of such simple algorithms,
couched solely in geometric terms. Such
algorithms are neither capable of emulating the skills of the
cartographer nor are they helpful for evolving theories about
cartographic processes. Automatic line simplification remains one of
the outstanding challenges in digital cartography.
Rather than reiterate earlier observations and discussions, we wish to
conclude this paperonthe wider implications of this study. Our
research has been greatly facilitated by the past practice of detailed
publication of research methods; and, access by other means not just
to algorithms but also their implementations. No doubt those committed
to the advancement of knowledge will continue to exchange details of
their experimental design and observations (even if they are unable to
provide input data provided by research sponsors) so that they can
check each other's reasoning and conclusions to mutual benefit.
Researchers in computational geometry have pointed out that much
spatial software is erectedonshaky foundations. Digital cartography
buildsoncomputational geometry and computer graphics and Geographical
Information Systems in turn embody the academic output of these
contributing disciplines within their structures. We hope that this
paper has demonstrated in a small way the need for maintaining open
and public discussion of the knowledge, techniques and data which
underpin the development and use of modern information systems, such
as GIS.
Acknowledgments
We are grateful to the Science and Engineering Research Council for
the Quota Studentship held by Duncan Whyatt. We wish to thank Phil
Wade of the Cartographic Information Systems Research Group (CISRG) of
the University of Hull for his implementation of the Douglas Peucker
algorithm, Mike Blakemore of the Geography Department University of
Durham for the source of the original implementation by Douglas, and
Drs Graham Kirby, Mike Turner of the CISRG for their critical
commentsonthe paper. Some of the material in this publication has been
produced using data or other material relating to the digitised
boundary information and these data or other material remain the
property and copyright of the Crown.
References
Attneave, F. (1954) "Some informational
aspects of visual perception", Psychological Review 61 (3)
183-193.
Blakemore, M. (1984) "Generalisation and error in spatial data bases",
Cartographica 21 (2 & 3) 131-139.
Buttenfield, B.P. (1986) "Digital definitions of scale dependent line
structure", in M. Blakemore (ed) Proc. Auto Carto London (ICA,
1986), 497-507.
Douglas, D. H. and Peucker, T. K. (1973) "Algorithms for the reduction
of the number of points required to represent a digitized line or its
caricature", Canadian Cartographer 10 (2) 112-122.
Douglas, D. H. (1975) Collected Algorithms, Laboratory for Computer
Graphics and Spatial Analysis, Harvard University, Cambridge,
Massachusetts.
Duda, R. 0. and Hart, P. E. (1973) Pattern Classification and Scene
Analysis, (Wiley, NY), 337-339.
Forrest, A. R. (1985) "Computational geometry in practice", in R A
Earnshaw (ed.) Fundamental Algorithms for Computer Graphics ,
NATO ASI Series F17 (Springer Verlag, Berlin), 707-724.
Goodchild, M. F. (1988) "The issue of accuracy in global databases",
in Mounsey, H. (ed.) Building Databases for Global Science
(Taylor & Francis, Basingstoke), 31-48.
Jenks, G. F. (1979) "Thoughtsonline generalisation", in R T
Aangeenbrug (ed.) Auto Carto 4 : Proc. of Int.
Symp.onCartography and Computing : Applications in Health and
Environment (American CongressonSurveying and Mapping),
209-220.
Jenks, G. F. (1981) "Lines, computers, and human frailties', Annals
of the Association of American Geographers 71 (1) 1-10.
Jones, C. B. and Abraham, I. M. (1987) "Line generalization in a
global cartographic database", Cartographica 24 (3) 32-45.
McMaster, R. B. (1987a) "Automated line generalization',
Cartographica 24 (2) 74-111.
McMaster, R. B. (1987b) "The geometric properties of numerical
generalization", Geographical Analysis 19 (4) 330-346.
Muller, J. C. (1987) "Fractal and automated line generalization',
Cartographic Journal 24 27-34.
Perkal, J. (1966) "An attempt at objective generalization', Translated
by W. Jackowski from Julian Perkal, Proba obiektywnej generalizacji.
Geodezia es Kartografia, Tom VII, Zeszyt 2, 1958, 130 142. In J.
Nystuen (ed)
Michigan Inter University Community of Mathematical Geographers,
Discussion Paper 10. Ann Arbor, University of Michigan.
Peucker, T. K. (1975) "A theory of the cartographic line', Proc.
Auto Carto 2, Reston, Virgina 508 518.
Ramer, U. (1972) "An iterative procedure for the polygonal
approximation of plane curves", Computer Graphics and Image
Processing 1, 244-256.
Raper, J. and Green, N. (1989) "GIST! The Geographical Information
Systems Tutor", Burisa 89 (July) 16.
Visvalingam, M. and Whyatt, J.D. (1990) "The Douglas Peucker algorithm
for line simplification: re evaluation through visualisation",
Computer Graphics Forum 9 (3) 213 228.
Waugh, T.C. and McCalden, J. (1983) GIMMS Reference Manual (4.5),
Gimms Ltd, 30 Keir Street, Edinburgh EH3 9E4.
White, E.R. (1983) "Perceptual evaluation of line generalization
algorithms", Unpublished Masters Thesis, University of Oklahoma.
Whyatt, J.D. and Wade, P.R. (1988) "The Douglas Peucker line
simplification algorithm", Society of University Cartographers
Bulletin 22 (1) 17-25.