Linux and interrupt latency (Axel)
Introduction[edit | edit source]
As known, Linux is not a real-time operating system. As such it can't guarantee that interrupt latency is always smaller than a determined limit.
When developing applications based on embedded platforms, this can cause unacceptable behaviors. For this reason system integrators have to be aware of it and, if necessary, have to implement specific strategies to prevent or limit them.
Numerous solutions exist (see for example AN-XELK-001:_Asymmetric_Multiprocessing_.28AMP.29_on_Axel_.E2.80.93_Linux_.2B_FreeRTOS, BRX-WP001:_Real-timeness,_system_integrity_and_TrustZone®_technology_on_AMP_configuration, https://rt.wiki.kernel.org/index.php/Main_Page, http://elinux.org/CPU_Shielding_capability), however an exhaustive discussion of these is beyond the scope of this document. Some practical considerations are illustrated instead, based on real-world cases involving Axel platforms and [Axel_Embedded_Linux_Kit_(XELK) XELK].
Disabling interrupts at kernel level[edit | edit source]
A lot of device drivers exist in the kernel space. Many of them need to temporarily disable interrupts to implement specific hardware-related operations. During the interrupt-disabled time windows, any peripheral could issue an interrupt request anyway. If this happens, associated interrupt service routine execution is delayed until (at least) interrupts are enabled again. As a consequence, in this particular case the observed interrupt latency is greater than the average one. It is virtually impossible to estimate the upper bound because it is not feasible to explore all the possible execution paths at kernel level. Generally speaking, maximum interrupt latency can be orders of magnitude greater that the average one.
A real-word case: UART RX FIFO overrun[edit | edit source]
[Power_management_(Axel)#Interrupt_latency This case] is a typical example that is pretty hard to analyze because a lot of variables are involved and therefore it is extremely difficult to isolate cause-effect relationships. In other words, with the help of software-based tools only, it is not easy to find out which are the drivers that cause the UART RX interrupt latency to explode. Advanced debugging tools providing tracing capabilities such as Lauterbach TRACE32/PowerDebug are extremely useful to perform such investigations. In this specific case, this tool allowed to find out that the SD controller driver affects interrupt latencies dramatically. This is due to 1ms delay that is implemented in the driver after disabling the interrupts. In case the physical SD card interface is not provided with the card detect signal, this conditions occurs on a regular basis because the polling mechanism used to detect the presence of the card. Thus it is likely that RX FIFO interrupts is triggered in the middle of such delay, causing the overrun issue.