Evolutionary algorithm-based affine-invariant matching of object shapes from broken boundaries
基於進化算法之破碎物件輪廓綫仿射不變性匹配
Student thesis: Doctoral Thesis
Author(s)
Related Research Unit(s)
Detail(s)
Awarding Institution | |
---|---|
Supervisors/Advisors |
|
Award date | 3 Oct 2011 |
Link(s)
Permanent Link | https://scholars.cityu.edu.hk/en/theses/theses(e6e0645e-f97a-4281-ba0b-15a6f415588b).html |
---|---|
Other link(s) | Links |
Abstract
Viewpoint invariant matching of object shapes is a fundamental but difficult problem
in computer vision. It pertains to the recognition or alignment of shapes that are
subject to certain geometric transformations caused by different viewing positions. In
practice, the images of objects captured by optical means are affected by the
illumination, as well as various kinds of artifacts; the objects often present fragmented,
and more severely, incomplete contours in shape. This imposes further complications
on the problem. This thesis reports the developments of several shape matching
schemes, which can work independently or corporately in identifying object images
that are captured under poor lighting conditions. Specifically, the proposed methods
can be applied in the matching of near-planar object shapes from broken boundaries,
and moreover, under noise contamination. One of the main emphases of my works, is
to maintain a high success rate in shape matching, and at the same time minimizing
the computation time.
For near-planar objects, the matching process can be posed as an optimization
problem in realising an affine transform that yields a best matching score between a
pair of contours. Along this direction, simple genetic algorithms (SGA) and particle
swarm optimization (PSO) have been proven effective. Despite the moderate
successes of these approaches, however, they present erratic performance amongst
different objects, with reduced success rates and prolonged computation times in
some cases. These shortcomings can be attributed to the lack of an initial
population/swarm community that can realise global solutions.
In this thesis, a solution to this problem is presented by integrating PSO with the
migrant principle (MP). The latter is analogous to immigrant policy in real-world
situations; it introduces a continuous influx of foreign candidates to the swarm
community to promote diversity, and therefore, exploration power in the population.
Evaluations show that the method is less sensitive to swarm size and can achieve high
success rates for all test samples based on a small swarm community.
To further enhance the performance, a novel scheme based on contour reconstruction
is also provided. This scheme enables the extraction of a closed outermost boundary
from a set of fragmented object points, and represents this boundary as a
one-dimensional sequence. The similarity between a pair of fragmented boundaries
can then be determined by searching three corresponding point pairs in the
one-dimensional sequences. This reduces the dimensions of the problem to three
(instead of six for the original set of affine parameters). Experimental results show
that the proposed method is considerably faster than previous schemes, and can realise
a high success rate in identifying matched contours. Furthermore, a method known as
labelled chamfer distance transform (LCDT) is proposed to improve computational
efficiency in contour reconstruction. LCDT enables faster generation of the distance
image and correspondence map. Compared with its parent scheme, the proposed
LCDT approach achieves up to an order of magnitude of acceleration in the entire
matching process.
Apart from matching in a noise-free background, an attempt to match shapes under a
noisy setting is made, in which a scheme called successive erosion and distance
accumulation (SEDA) is proposed to alleviate the sensitivity of the distance images to
noise. Experimental evaluation demonstrates the SEDA scheme can achieve a high
success rate in identifying matched contours under moderate amount of noise
contamination.
- Optical pattern recognition, Computer vision, Image processing, Digital techniques