Changes

Jump to: navigation, search
no edit summary
In conclusion, to maximize the performance in terms of execution time, the model has to be fully-quantized and the number of threads has to be specified explicitly.
=== Version 2 2A and 3 ===
In this case, only the fully-quantized model could be tested and the thread number has no effect.
|'''Fully-quantized'''
|1.5
|}
 
=== Version 2B ===
 
{| class="wikitable" style="margin: auto;"
|+
Prediction times
!Model
!Prediction time
[ms]
|-
|'''Fully-quantized'''
|TBD
|}
89
edits

Navigation menu