In the previous article we took a look at the Texas Instruments OMAP5432 board. This time we will focus on the performance of those A15 cores, the SATA interface, internal flash, and how much difference the hardware video decode makes.
Video Decode Hardware
H264 1080p video decode using the video decode hardware on the OMAP5432 required only about 17 percent of one CPU core to decode Big Buck Bunny. Video playback remained fine when the window was resized and other windows overlayed on top of the video playback window. Using Totem to playback instead of a raw gst-launch command line increased the CPU from 17 to 35 percent spikes on one CPU core. Trying to do H264 decode purely in software using mplayer and two cores didn't work at real-time playback speeds. Hopefully ffmpeg/mplayer will gain support for the video decode hardware of the OMAP5 in the future.
A closer comparison of the Octane performance of the OMAP5432 at 800Mhz to the Intel 2600K at it's stock speed settings is shown below.
The OMAP5432 has 2Gb of RAM and 4Gb of internal flash. Normally bonnie++ wants to create files that are twice the size of the system's RAM so that disk caching doesn't interfere with the benchmark. Because the internal flash was only 4Gb I limited the size of files created by bonnie++ with the command shown below. Sequential output was around 11.5Mb/sec and read came in at 496Mb/sec which was obviously heavily cached due to limited file sizes and not a real-world figure. The internal flash could muster 3270 seeks/sec.
bonnie++ -f -m omap5432flash -s 1000 -r 500 -n 0 -d `pwd`
The SATA port on the OMAP5432 was next to be benchmarked. An ext4 filesystem was created with default parameters and mounted with default parameters. The near default bonnie++ command shown below was used to perform the benchmark. I used the 120Gb model of the SanDisk Extreme range of SSD for testing. These drives can be had for around $100. About 66Mb/sec could be written to the SSD and 131Mb/sec read back from it with 8558 seeks/sec performed.
bonnie++ -f -m omapsandisk120 -d `pwd`
To test 2d graphics performance I used version 1.0.1 of the Cairo Performance Demos. There are many individual tests in the Cairo demos: the gears test runs three turning gears, the chart runs four line graphs, the fish is a simulated fish tank with many fish swimming around, gradient is a filled curved edged path what moves around the screen and flowers renders rotating flowers that move up and down the screen. For comparison I used a desktop machine running an Intel 2600K CPU with an NVidia GTX 570 card which drives two screens, one at 2560x1440 and the other at 1080p. I've also included the performance of the GK802 running at 1080p and BeagleBone Black running at 720p. Clearly the 2D performance in cairo is currently not seeing any special optimizations on the OMAP5432.
OpenSSL Speed Tests
Compiling openssl took around 7.5 minutes on the OMAP5432, about 8.5 minutes on the GK802 quad core Freescale i.MX6 machine, and around 1.5 minutes on an Intel 2600K CPU.
The single core openssl speed test showed similar results for ciphers and digests with the OMAP5432 performing at or below the BeagleBone Black for these single core tests. RSA signature performance of the OMAP5432 was significantly above the Beagle but still well below the ODroid-U2. Even though the OMAP5432 was clocked at 800Mhz it could still perform more than 1/3 more RSA signatures than the 1Ghz BeagleBone Black. At slightly under half the clock rate to the ODroid-U2 (1.7Ghz) the OMAP5432 performed about 2/3 the amount for RSA signatures per second. If the performance scales linearly, when the OMAP5432 is clocked at it's native 1.5Ghz it should get about 410 on the RSA graph, which is about 25 percent faster than the ODroid-U2.
The OMAP5432 was able to obtain a higher Octane score than the ODroid-U2 even though it has half the number of CPU cores and ran them at a lower clock rate. Video playback in a mainstream player like Totem wasn't much of a tax on the main cores of the CPU at all. Adding a SATA port to the board allowed seek and read speeds (130+Mb/sec) on a good size disk that can not be achieved through microSD cards. While some expensive microSD cards offer 90Mb/s transfers you might be paying $50 for 16Gb of storage in that form factor and speed. Looking at other A15 boards might provide some insight into the performance of openssl speed testing on that CPU design and how the compile might be optimized for that CPU. We would like to thank SVTronics for the OMAP5432 EVM review hardware.
Read other reviews in this series: