CARIL: Confidence-Aware Regression in Imitation Learning for Autonomous Driving

Elahe Delavari1 ORCID Aws Khalil1 ORCID , Jaerock Kwon1 ORCID
1University of Michigan-Dearborn

🎥 Demo Video

📄 Abstract

End-to-end vision-based imitation learning has demonstrated promising results in autonomous driving by learning control commands directly from expert demonstrations. However, traditional approaches rely on either regressionbased models, which provide precise control but lack confidence estimation, or classification-based models, which offer confidence scores but suffer from reduced precision due to discretization. This limitation makes it challenging to quantify the reliability of predicted actions and apply corrections when necessary. In this work, we introduce a dual-head neural network architecture that integrates both regression and classification heads to improve decision reliability in imitation learning. The regression head predicts continuous driving actions, while the classification head estimates confidence, enabling a correction mechanism that adjusts actions in low-confidence scenarios, enhancing driving stability. We evaluate our approach in a closed-loop setting within the CARLA simulator, demonstrating its ability to detect uncertain actions, estimate confidence, and apply real-time corrections. Experimental results show that our method reduces lane deviation and improves trajectory accuracy by up to 50%, outperforming conventional regression-only models. These findings highlight the potential of classification-guided confidence estimation in enhancing the robustness of vision-based imitation learning for autonomous driving.

🖼️ Method Overview

CARIL architecture

📊 Evaluation Results

Classification Performance

The classification head achieved an overall accuracy of 76.33%. The highest precision was observed in class 5 (straight driving), reflecting its dominance in the dataset. However, the model struggled more with underrepresented classes (e.g., sharp turns). Misclassifications primarily occurred between similar steering classes.

Confusion Matrix

Real-Time Confidence Estimation

The model estimates the confidence of its predictions using classification entropy and maximum softmax probability. When the model is confident (left image), the regression prediction is retained. In low-confidence situations (right image), a correction is applied. This enhances safety by dynamically adapting predictions in uncertain scenarios.

Real-time confidence estimation

Route Similarity Analysis

Confidence-guided correction significantly improved trajectory accuracy, particularly in complex scenarios (e.g., two-turn routes). Results show a reduction in deviation of up to 50% across multiple metrics including Fréchet distance, DTW, Area Between Curves (ABC), and Curve Length (CL).

Fréchet & DTW Metrics
Route Type Fréchet ↓ (W/ Corr.) Fréchet ↓ (W/O Corr.) DTW ↓ (W/ Corr.) DTW ↓ (W/O Corr.)
One-Turn6.667.111108.421148.90
Straight1.501.56145.27160.82
Two-Turn8.9325.995417.386605.88
ABC & Curve Length
Route Type ABC ↓ (W/ Corr.) ABC ↓ (W/O Corr.) CL ↓ (W/ Corr.) CL ↓ (W/O Corr.)
One-Turn635.16655.990.2260.236
Straight32.2641.560.3510.503
Two-Turn3267.103726.180.5961.479
Overall Route Similarity
Metric Mean ↓ (W/ Corr.) Mean ↓ (W/O Corr.) Std ↓ (W/ Corr.) Std ↓ (W/O Corr.)
Fréchet5.7011.563.3422.82
DTW2223.692638.542443.113401.99
ABC1311.511474.581447.181749.69
Curve Length0.3910.7390.1771.219

📖 Citation


  @article{delavari2025carilconfidenceawareregressionimitation,
    title={CARIL: Confidence-Aware Regression in Imitation Learning for Autonomous Driving}, 
    author={Elahe Delavari and Aws Khalil and Jaerock Kwon},
    year={2025},
    eprint={2503.00783},
    archivePrefix={arXiv},
    primaryClass={cs.RO},
    url={https://arxiv.org/abs/2503.00783}, 
}