Changes

MISC-TN-009: Characterizing the RAM bandwidth of Mito8M SoM

11,115 bytes added, 11:11, 15 January 2020

→‎LM Bench

Solution Validates: avg error less than 1.000000e-13 on all three arrays

-------------------------------------------------------------

</pre>

===LMbench===

armbian@Mito8M:~/devel/lmbench$ sudo lmbench-run

[sudo] password for armbian:

/usr/lib/lmbench/scripts/gnu-os: unable to guess system type

This script, last modified 2004-08-18, has failed to recognize

the operating system you are using. It is advised that you

download the most up to date version of the config scripts from

ftp://ftp.gnu.org/pub/gnu/config/

If the version you run (/usr/lib/lmbench/scripts/gnu-os) is already up to date, please

send the following data and any information you think might be

pertinent to <config-patches@gnu.org> in order to provide the needed

information to handle your system.

config.guess timestamp = 2004-08-18

uname -m = aarch64

uname -r = 4.14.98-g4c94e1dbaec2

uname -s = Linux

uname -v = #1 SMP PREEMPT Mon Sep 30 14:46:22 CEST 2019

/usr/bin/uname -p =

/bin/uname -X =

hostinfo =

/bin/universe =

/usr/bin/arch -k =

/bin/arch =

/usr/bin/oslevel =

/usr/convex/getsysinfo =

UNAME_MACHINE = aarch64

UNAME_RELEASE = 4.14.98-g4c94e1dbaec2

UNAME_SYSTEM = Linux

UNAME_VERSION = #1 SMP PREEMPT Mon Sep 30 14:46:22 CEST 2019

=====================================================================

L M B E N C H C ON F I G U R A T I O N

----------------------------------------

You need to configure some parameters to lmbench. Once you have configured

these parameters, you may do multiple runs by saying

"make rerun"

in the src subdirectory.

NOTICE: please do not have any other activity on the system if you can

help it. Things like the second hand on your xclock or X perfmeters

are not so good when benchmarking. In fact, X is not so good when

benchmarking.

=====================================================================

If you are running on an MP machine and you want to try running

multiple copies of lmbench in parallel, you can specify how many here.

Using this option will make the benchmark run 100x slower (sorry).

NOTE: WARNING! This feature is experimental and many results are

known to be incorrect or random!

MULTIPLE COPIES [default 1]:

=====================================================================

Options to control job placement

1) Allow scheduler to place jobs

2) Assign each benchmark process with any attendent child processes

to its own processor

3) Assign each benchmark process with any attendent child processes

to its own processor, except that it will be as far as possible

from other processes

4) Assign each benchmark and attendent processes to their own

processors

5) Assign each benchmark and attendent processes to their own

processors, except that they will be as far as possible from

each other and other processes

6) Custom placement: you assign each benchmark process with attendent

child processes to processors

7) Custom placement: you assign each benchmark and attendent

processes to processors

Note: some benchmarks, such as bw_pipe, create attendent child

processes for each benchmark process. For example, bw_pipe

needs a second process to send data down the pipe to be read

by the benchmark process. If you have three copies of the

benchmark process running, then you actually have six processes;

three attendent child processes sending data down the pipes and

three benchmark processes reading data and doing the measurements.

Job placement selection [default 1]:

=====================================================================

Hang on, we are calculating your timing granularity.

OK, it looks like you can time stuff down to 5000 usec resolution.

Hang on, we are calculating your timing overhead.

OK, it looks like your gettimeofday() costs 0 usecs.

Hang on, we are calculating your loop overhead.

OK, it looks like your benchmark loop costs 0.00000136 usecs.

=====================================================================

Several benchmarks operate on a range of memory. This memory should be

sized such that it is at least 4 times as big as the external cache[s]

on your system. It should be no more than 80% of your physical memory.

The bigger the range, the more accurate the results, but larger sizes

take somewhat longer to run the benchmark.

MB [default 2097]: 1024

Checking to see if you have 1024 MB; please wait for a moment...

1024MB OK

Hang on, we are calculating your cache line size.

OK, it looks like your cache line is 64 bytes.

=====================================================================

lmbench measures a wide variety of system performance, and the full suite

of benchmarks can take a long time on some platforms. Consequently, we

offer the capability to run only predefined subsets of benchmarks, one

for operating system specific benchmarks and one for hardware specific

benchmarks. We also offer the option of running only selected benchmarks

which is useful during operating system development.

Please remember that if you intend to publish the results you either need

to do a full run or one of the predefined OS or hardware subsets.

SUBSET (ALL|HARWARE|OS|DEVELOPMENT) [default all]:

=====================================================================

This benchmark measures, by default, memory latency for a number of

different strides. That can take a long time and is most useful if you

are trying to figure out your cache line size or if your cache line size

is greater than 128 bytes.

If you are planning on sending in these results, please don't do a fast

run.

Answering yes means that we measure memory latency with a 128 byte stride.

FASTMEM [default no]:

=====================================================================

This benchmark measures, by default, file system latency. That can

take a long time on systems with old style file systems (i.e., UFS,

FFS, etc.). Linux' ext2fs and Sun's tmpfs are fast enough that this

test is not painful.

If you are planning on sending in these results, please don't do a fast

run.

If you want to skip the file system latency tests, answer "yes" below.

SLOWFS [default no]: yes

=====================================================================

This benchmark can measure disk zone bandwidths and seek times. These can

be turned into whizzy graphs that pretty much tell you everything you might

need to know about the performance of your disk.

This takes a while and requires read access to a disk drive.

Write is not measured, see disk.c to see how if you want to do so.

If you want to skip the disk tests, hit return below.

If you want to include disk tests, then specify the path to the disk

device, such as /dev/sda. For each disk that is readable, you'll be

prompted for a one line description of the drive, i.e.,

Iomega IDE ZIP

or

HP C3725S 2GB on 10MB/sec NCR SCSI bus

DISKS [default none]:

=====================================================================

If you are running on an idle network and there are other, identically

configured systems, on the same wire (no gateway between you and them),

and you have rsh access to them, then you should run the network part

of the benchmarks to them. Please specify any such systems as a space

separated list such as: ether-host fddi-host hippi-host.

REMOTE [default none]:

=====================================================================

Calculating mhz, please wait for a moment...

I think your CPU mhz is

798 MHz, 1.2531 nanosec clock

but I am frequently wrong. If that is the wrong Mhz, type in your

best guess as to your processor speed. It doesn't have to be exact,

but if you know it is around 800, say 800.

Please note that some processors, such as the P4, have a core which

is double-clocked, so on those processors the reported clock speed

will be roughly double the advertised clock rate. For example, a

1.8GHz P4 may be reported as a 3592MHz processor.

Processor mhz [default 798 MHz, 1.2531 nanosec clock]:

=====================================================================

We need a place to store a 1024 Mbyte file as well as create and delete a

large number of small files. We default to /var/tmp. If /var/tmp is a

memory resident file system (i.e., tmpfs), pick a different place.

Please specify a directory that has enough space and is a local file

system.

FSDIR [default /var/tmp/lmbench]: /tmp/lmbench

=====================================================================

lmbench outputs status information as it runs various benchmarks.

By default this output is sent to /dev/tty, but you may redirect

it to any file you wish (such as /dev/null...).

Status output file [default /dev/tty]:

=====================================================================

There is a database of benchmark results that is shipped with new

releases of lmbench. Your results can be included in the database

if you wish. The more results the better, especially if they include

remote networking. If your results are interesting, i.e., for a new

fast box, they may be made available on the lmbench web page, which is

http://www.bitmover.com/lmbench

Mail results [default yes]: no

OK, no results mailed.

=====================================================================

Confguration done, thanks.

There is a mailing list for discussing lmbench hosted at BitMover.

Send mail to majordomo@bitmover.com to join the list.

/usr/lib/lmbench/scripts/gnu-os: unable to guess system type

This script, last modified 2004-08-18, has failed to recognize

the operating system you are using. It is advised that you

download the most up to date version of the config scripts from

ftp://ftp.gnu.org/pub/gnu/config/

If the version you run (/usr/lib/lmbench/scripts/gnu-os) is already up to date, please

send the following data and any information you think might be

pertinent to <config-patches@gnu.org> in order to provide the needed

information to handle your system.

config.guess timestamp = 2004-08-18

uname -m = aarch64

uname -r = 4.14.98-g4c94e1dbaec2

uname -s = Linux

uname -v = #1 SMP PREEMPT Mon Sep 30 14:46:22 CEST 2019

/usr/bin/uname -p =

/bin/uname -X =

hostinfo =

/bin/universe =

/usr/bin/arch -k =

/bin/arch =

/usr/bin/oslevel =

/usr/convex/getsysinfo =

UNAME_MACHINE = aarch64

UNAME_RELEASE = 4.14.98-g4c94e1dbaec2

UNAME_SYSTEM = Linux

UNAME_VERSION = #1 SMP PREEMPT Mon Sep 30 14:46:22 CEST 2019

Using config in CONFIG.Mito8M

Wed Jan 15 10:56:54 CET 2020

Latency measurements

Wed Jan 15 10:57:29 CET 2020

Local networking

Wed Jan 15 10:58:36 CET 2020

Bandwidth measurements

Wed Jan 15 11:03:02 CET 2020

Calculating context switch overhead

Wed Jan 15 11:03:09 CET 2020

Calculating effective TLB size

Wed Jan 15 11:03:10 CET 2020

Calculating memory load parallelism

Wed Jan 15 11:14:34 CET 2020

McCalpin's STREAM benchmark

Wed Jan 15 11:15:30 CET 2020

Calculating memory load latency

Wed Jan 15 11:35:54 CET 2020

Benchmark run finished....

Remember you can find the results of the benchmark

under /var/lib/lmbench/results

</pre>

U0001

Bureaucrats, dave_user, Administrators

4,650

edits

Changes

MISC-TN-009: Characterizing the RAM bandwidth of Mito8M SoM

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Quick Links

Contact us

How to use wiki

Advanced Search

Tools