Next: Discussion Up: Controlling Asymmetric Errors in Previous: Adapting the Pruning Techniques

Experimental Evaluation

Figure 3: (a) 37,659 lines extracted from SAR image by Burns' edge detector (b) runway found by production net (c) the 20 lines for the construction of the runway (d) 3,281 lines classified as positive by modified NEFCLASS

For our studies we used a set of 17 SAR images depicting 5 different airports. Each of the images was analyzed by the production net to detect the runway(s) and the lines were labeled as positive or negative. 4 of the 17 images form the training dataset used to train NEFCLASS. The training set contains 253 runway lines and 31,330 negatives.

The lines from the remaining 13 images were used as test data. The assessment of the classification results takes into account the specific requirements of the application. The ideal classifier for this problem should find all edges which were used to assemble a runway. Because missing edges can interrupt the construction process and by this, essential intermediate results are missed and the analysis will fail. On the other hand the classifier should assign as few negatives as possible to the positive class to reduce the processing time. The behavior of a classifier could be characterized by a detection and a reduction rate:

In this application the misclassification costs cannot be specified exactly, as the costs of false positives and negatives are barely comparable. We empirically set the costs of false negatives to some hundred times the costs of false positives. This results in a detection rate of on the training set after learning.

The average of the detection and reduction rate on the unseen 13 images is and , respectively. The detection rates on the single images vary from to . Fig. 3d shows the lines NEFCLASS classified as positives in this image (which was one of the unseen images). Apparently this image allows rather high detection and extremely good reduction rates. But also for most of the lower detection rates the higher levels of image analysis were successful, as the missed lines are mainly shorter and less important.

By the use of the modified pruning techniques, the number of rules could be decreased from initially over 500 to under 20. We obtained the best result with 16 rules, but even with 7 rules the error increased only slightly. With a reduced number of rules, the difference between performance on training and test data decreased. This shows that generalization ability increased through pruning.

Next: Discussion Up: Controlling Asymmetric Errors in Previous: Adapting the Pruning Techniques

Aljoscha Klose
Mon Nov 29 17:03:10 MET 1999