Angle Estimation Performance(Paper Layout)

Table 1. Angle Error: RAW vs USED (Canny post-processing)

Model Params Type Mean Error Robust Statistics
avg err 012 (°) avg err 123 (°) p1 err (px) Median (°) MAD (°) p95 (°)
YOLOv8n-pose 3.30 M RAW 9.6711.278.377.795.4432.85
USED 7.868.937.375.773.9324.78
YOLOv8s-pose 11.63 M RAW 11.0214.2310.366.964.8436.61
USED 11.0414.549.686.964.1743.18
YOLO11n-pose 2.91 M RAW 10.4510.109.376.144.5130.85
USED 11.0910.989.756.134.5542.30
YOLO11s-pose 9.95 M RAW 17.9916.7214.3914.409.2241.68
USED 16.9916.6114.2013.798.8844.90

Table 2. Detection Quality Metrics (best_angle.pt)

Model Box Metrics Pose Metrics
PrecisionRecallmAP50mAP50-95 PrecisionRecallmAP50mAP50-95
YOLOv8n-pose 0.8850.8850.9210.623 1.0001.0000.9950.891
YOLOv8s-pose 0.9960.8460.9340.604 1.0000.9620.9810.868
YOLO11n-pose 0.9560.8420.9120.617 1.0000.9230.9610.866
YOLO11s-pose 0.9620.9620.9780.421 1.0001.0000.9950.812

Figures

Figure 1. Mean angle error comparison between RAW and USED outputs (lower is better).
Figure 2. Pose detection quality metrics (higher is better).
Figure 3. Robust error statistics based on RAW outputs.
Figure 4. Radar plot of normalized USED error metrics (outer ring indicates higher error; inner ring indicates lower error).
Figure 5. Radar plot of normalized detection quality metrics (outer ring indicates higher score).

USED = post-processed with Canny edge snap. Figure 4 uses USED error metrics with intuitive direction (inner=lower error, outer=higher error). Figure 5 uses box/pose metrics from best_angle.pt. All reported metrics are based on models trained with offline augmentation.