File size: 7,623 Bytes
19e6678 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 | ---
license: mit
pipeline_tag: object-detection
library_name: ultralytics
base_model: yolo11n
datasets:
- lisa-traffic-sign-dataset
tags:
- computer-vision
- object-detection
- yolov11
- traffic-sign-detection
metrics:
- precision
- recall
- map50
- map50-95
---
# Traffic Sign Detection Model (YOLOv11)
## Model Description
This model is an object detection model trained using the Ultralytics YOLOv11 framework. The model detects and classifies multiple types of traffic signs in road images by predicting bounding boxes and class labels.
The model was trained by fine-tuning the pretrained **YOLOv11n** architecture on a dataset of annotated traffic signs derived from the **LISA Traffic Sign Dataset**.
### Training Approach
- Base model: YOLOv11n
- Framework: Ultralytics YOLO
- Training method: transfer learning / fine-tuning
- Task: object detection
Transfer learning allows the model to start with pretrained visual features learned from large datasets and then specialize those features for traffic sign detection.
---
# Intended Use
This model is designed for:
- traffic sign detection research
- computer vision experimentation
- academic coursework projects
- demonstrations of object detection systems
Possible applications include:
- driver assistance research
- automated traffic sign recognition
- road scene analysis
This model **should not be used in safety-critical systems such as autonomous vehicles without extensive additional testing and validation**.
---
# Training Data
## Dataset Source
The model was trained using images derived from the **LISA Traffic Sign Dataset**.
The LISA Traffic Sign Dataset is a publicly available dataset created for traffic sign detection and classification research. The dataset contains traffic sign images captured from real driving environments in the United States.
Dataset link:
https://cvrr.ucsd.edu/LISA/lisa-traffic-sign-dataset.html
### Dataset Citation
Mogelmose, A., Trivedi, M. M., & Moeslund, T. B. (2012).
**Vision-based traffic sign detection and analysis for intelligent driver assistance systems: Perspectives and survey.**
IEEE Transactions on Intelligent Transportation Systems.
---
## Classes
The model detects the following traffic sign classes:
- doNotEnter
- pedestrianCrossing
- speedLimit15
- speedLimit25
- speedLimit30
- speedLimit35
- speedLimit40
- speedLimit45
- speedLimit50
- speedLimit65
- stop
- yield
Each object instance is annotated with bounding boxes.
---
## Data Collection Methodology
Images in the dataset were collected using vehicle-mounted cameras capturing real road scenes. These images include a variety of:
- lighting conditions
- road environments
- viewing angles
- traffic sign scales
This diversity helps improve the model’s ability to generalize to new images.
---
## Annotation Process
Bounding box annotations were created using **Roboflow annotation tools**.
Annotation workflow:
1. Images were uploaded to Roboflow
2. Bounding boxes were drawn around each traffic sign
3. Each object was labeled with its correct class
4. Annotations were reviewed and corrected
5. Dataset exported in YOLO format for training
---
## Train / Validation / Test Split
The dataset was divided into three sets:
| Dataset Split | Percentage |
|---------------|-----------|
| Training | ~70% |
| Validation | ~20% |
| Test | ~10% |
---
## Data Augmentation
During training, several augmentation techniques were applied to improve generalization:
- horizontal flipping
- mosaic augmentation
- image scaling
- color adjustments (HSV)
These augmentations help the model learn to detect objects under different visual conditions.
---
# Training Procedure
## Framework
Training was performed using the **Ultralytics YOLO training framework**.
---
## Hardware
Training environment:
- GPU: Tesla T4
- Platform: Google Colab
- Training time: ~1 hour
---
## Hyperparameters
| Parameter | Value |
|-----------|------|
| Epochs | 300 |
| Batch Size | 16 |
| Image Size | 640 |
| Learning Rate | 0.01 |
| Weight Decay | 0.0005 |
---
# Evaluation Results
## Overall Model Performance
| Metric | Score |
|------|------|
| Precision | 0.99 |
| Recall | ~0.99 |
| mAP@0.5 | 0.994 |
| mAP@0.5–0.95 | ~0.89 |
| Best F1 Score | 0.99 |
These results indicate that the model performs extremely well on the validation dataset, detecting traffic signs with high accuracy and minimal false positives.
---
## Per-Class Performance
| Class | Average Precision |
|------|------|
| doNotEnter | 0.995 |
| pedestrianCrossing | 0.985 |
| speedLimit15 | 0.995 |
| speedLimit25 | 0.995 |
| speedLimit30 | 0.995 |
| speedLimit35 | 0.994 |
| speedLimit40 | 0.995 |
| speedLimit45 | 0.995 |
| speedLimit50 | 0.995 |
| speedLimit65 | 0.995 |
| stop | 0.995 |
| yield | 0.995 |
Most classes achieved extremely high detection accuracy.
The slightly lower performance for **pedestrianCrossing** may be due to higher variation in appearance and background conditions.
---
# Confusion Matrix Analysis
The confusion matrix shows that most predictions fall along the diagonal, indicating that the model correctly classifies the majority of traffic sign instances.
Examples of strong performance include:
- pedestrianCrossing: 146 correct detections
- speedLimit35: 76 correct detections
- speedLimit25: 57 correct detections
- stop: 118 correct detections
- yield: 41 correct detections
Misclassifications are rare and usually occur between visually similar traffic signs.
---
# Key Visualizations
## Precision-Recall Curve
The precision-recall curve demonstrates that the model maintains high precision across most recall values. This indicates that the model produces very few false positives while still detecting most objects.
---
## F1-Confidence Curve
The F1-confidence curve shows that the optimal detection confidence threshold is approximately **0.73**, where the model achieves an F1 score of about **0.99**.
This threshold provides the best balance between precision and recall.
---
# Performance Analysis
The model performs extremely well on the validation dataset due to several factors:
1. Transfer learning from a pretrained YOLO model.
2. Consistent visual characteristics of traffic signs.
3. Data augmentation during training.
4. Clear visual differences between most traffic sign classes.
However, these results reflect performance on the validation dataset and may not fully represent real-world performance in different environments.
---
# Limitations and Biases
## Visually Similar Classes
Speed limit signs such as **25 mph, 30 mph, and 35 mph** have similar shapes and layouts. If the number on the sign is partially obscured or blurred, the model may confuse these classes.
---
## Environmental Limitations
Model performance may degrade under certain conditions:
- poor lighting
- nighttime driving scenes
- motion blur
- heavy shadows
- extreme viewing angles
---
## Dataset Bias
The dataset primarily contains traffic signs captured in specific geographic and environmental conditions. This may introduce bias related to:
- geographic location
- road environment
- weather conditions
Performance may vary in unfamiliar environments.
---
# Ethical Considerations
This model should be used responsibly and should not be deployed in safety-critical systems without rigorous real-world testing and validation.
---
# Reproducibility
Training command used:
```python
from ultralytics import YOLO
model = YOLO("yolo11n.pt")
model.train(
data="/content/dataset/data.yaml",
epochs=300,
imgsz=640,
batch=16,
device=0
) |