| Author(s) | Collection number | Pages | Download abstract | Download full text |
|---|---|---|---|---|
| Hileta I. V., Коминар Т. Н. | № 2 (71) | 155-165 |
|
|
This article is devoted to the analysis of modern methods for optimizing computer vision models for deployment on edge devices with limited computational resources. The key approaches to adapting YOLO (You Only Look Once) family models for edge platforms such as Orange Pi 5 and Raspberry Pi 5 using hardware neural network accelerators (NPU, TPU, VPU) are examined. Special attention is paid to model compression methods, including quantization (INT8, FP16), structural and non-structural pruning, knowledge distillation, and computational graph optimization. Inference delegation systems are analyzed, including TensorRT, RKNN Toolkit, and Edge TPU Delegate, which ensure efficient use of specialized accelerators for real-time object detection.
The study demonstrates that the application of combined optimization methods allows achieving model size reduction up to 70% while maintaining detection accuracy above 98% of the baseline level. A comprehensive optimization pipeline is proposed, covering stages from dataset preparation to model deployment on the target platform, taking into account energy consumption and computational power constraints. The pipeline includes data collection and annotation, augmentation, stratified sampling, baseline model training, accuracy evaluation, quantization (QAT/PTQ), export to intermediate formats (ONNX, TFLite), inference delegation through specialized toolkits (RKNN, TensorRT, Edge TPU Compiler), edge deployment testing, and performance profiling.
Comparative analysis of hardware accelerators reveals significant differences in performance, supported precision formats, and application areas. Google Coral Edge TPU provides up to 4 TOPS with mandatory INT8 quantization, making it optimal for IoT and smart home systems. NVIDIA Jetson Xavier NX delivers 21 TOPS with FP16/INT8 support, suitable for drones, robotics, and video surveillance. Hailo-8 offers 26 TOPS for detection tasks in smart cameras and UAVs. Intel Movidius Myriad X (1 TOPS) is designed for low-budget CV applications, while Kendryte K210 (0.3 TOPS) targets mini-drones and IoT sensors.
The research findings create a foundation for practical implementation of object detectors in embedded systems, autonomous unmanned aerial vehicles, airspace video monitoring systems, and other edge-AI applications where performance, autonomy, and energy efficiency are critical. Future research directions include refining optimization pipelines, adapting models for diverse hardware platforms, and enhancing energy efficiency to enable high-performance and accurate detection solutions on edge devices.
Keywords: computer vision, YOLO, edge AI, neural network optimization, quantization, NPU/TPU, object detection, embedded systems, SBC, inference delegation.
doi: 10.32403/1998-6912-2025-2-71-140-154