Open main menu

DAVE Developer's Wiki β

Changes

MISC-TN-008: Running Debian Buster (armbian) on Mito8M

2,727 bytes added, 09:22, 15 January 2020
History
|
|}
 
==Test programs==
===STREAM===
 
====Building====
<pre class="board-terminal">
git clone https://github.com/jeffhammond/STREAM.git
make
</pre>
 
<syntaxhighlight lang="makefile" line='line'>
armbian@Mito8M:~/devel/STREAM$ cat Makefile
CC = gcc
CFLAGS = -O2 -fopenmp
 
FC = gfortran-4.9
FFLAGS = -O2 -fopenmp
 
all: stream_c.exe
 
stream_f.exe: stream.f mysecond.o
$(CC) $(CFLAGS) -c mysecond.c
$(FC) $(FFLAGS) -c stream.f
$(FC) $(FFLAGS) stream.o mysecond.o -o stream_f.exe
 
stream_c.exe: stream.c
$(CC) $(CFLAGS) stream.c -o stream_c.exe
 
clean:
rm -f stream_f.exe stream_c.exe *.o
 
# an example of a more complex build line for the Intel icc compiler
stream.icc: stream.c
icc -O3 -xCORE-AVX2 -ffreestanding -qopenmp -DSTREAM_ARRAY_SIZE=80000000 -DNTIMES=20 stream.c -o stream.omp.AVX2.80M.20x.icc
</<syntaxhighlight>
 
====Running====
<pre class="board-terminal">
armbian@Mito8M:~/devel/STREAM$ ./stream_c.exe
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 10000000 (elements), Offset = 0 (elements)
Memory per array = 76.3 MiB (= 0.1 GiB).
Total memory required = 228.9 MiB (= 0.2 GiB).
Each kernel will be executed 10 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 4
Number of Threads counted = 4
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 46427 microseconds.
(= 46427 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 6770.5 0.024010 0.023632 0.025117
Scale: 6093.2 0.027474 0.026259 0.029142
Add: 5263.5 0.046008 0.045597 0.046230
Triad: 4820.0 0.050297 0.049793 0.050723
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
</pre>
4,650
edits