Tooth Detection for Missing Tooth Identification (Paper Layout)

Table 1. Test Set Evaluation

Metric YOLOv11n YOLOv11n-aug YOLOv11s YOLOv11s-aug YOLOv8n YOLOv8n-aug YOLOv8s YOLOv8s-aug
TP / FN / FP / TN32/0/2/1631/1/2/1631/1/1/1731/1/1/1732/0/3/1531/1/2/1630/2/3/1531/1/2/16
Accuracy0.96000.94000.96000.96000.94000.94000.90000.9400
Sensitivity (Recall)1.00000.96880.96880.96881.00000.96880.93750.9688
Specificity0.88890.88890.94440.94440.83330.88890.83330.8889
Precision (PPV)0.94120.93940.96880.96880.91430.93940.90910.9394
F1-Score0.96970.95380.96880.96880.95520.95380.92310.9538
NPV1.00000.94120.94440.94441.00000.94120.88240.9412
AUC-ROC0.98440.98780.99310.99130.94100.98960.92530.9635

Table 2. Training Validation Metrics

Metric YOLOv11n YOLOv11n-aug YOLOv11s YOLOv11s-aug YOLOv8n YOLOv8n-aug YOLOv8s YOLOv8s-aug
Precision0.9330.9510.9120.9950.9400.9960.9661.000
Recall0.9510.9550.9550.8640.9550.9090.8640.908
mAP500.9550.9610.9580.9600.9450.9570.9490.971
mAP50-950.6870.6780.7510.7350.6570.6840.6940.716

Figures

Figure 1. Classification metrics comparison across 4 architectures with offline augmentation (higher is better).
Figure 2. Classification metrics comparison across 4 architectures without offline augmentation (higher is better).
Figure 3a. Validation metrics — with augmentation.
Figure 3b. Validation metrics — without augmentation.
Figure 4a. Radar plot — test metrics with augmentation.
Figure 4b. Radar plot — test metrics without augmentation.