Safeguarding the past, present and future of RISC OS for everyone

News | Downloads | Bugs | Bounties | Forums | Library

Forums → Aldershot →

Cloud Computing Mandelbrot Benchmarking Experience

3 posts, 2 voices

Oct 24, 2022 5:20pm Kuemmel (439) 384 posts	Hi there, just sharing some cloud computing experience while optimizing and testing my good old Mandelbrot benchmark ending up with 1500 lines of assembler code for optimisation variant number 4. I updated the recent code and results especially with the Amazon Graviton series with up to 64 cores on my website. You can find it at the link here It’s kinda fun to test stuff on something that’s like 40 times faster than your RPi4 :-) I’m doing those test using Amazon Cloud Computing Service (AWS) uploading my code to a Linux Ubuntu shell via FTP and running it from there in a terminal in text mode. The cost of the service is quite low as I only use it for some minutes (latest Graviton 3 (Cortex X-level) with 64 cores is 2.48 dollar per hour). I’ve got to say that the overall experience with AWS is quite nice. At first the interface is a bit overwhelming, but once you get used to it it’s straight forward. If you got a question or need a service they answer right away and get things done within a couple of minutes. So encouraged by that I tried also Google Cloud Computing. What a totally lame experience. When I wanted to use more than 4 cores they told me “to contact my sales representative” LMAO…I told them I’m just an enthusiast and they didn’t get that. Asking again, a lady from Google Germany called me by phone to understand the problem and I told her. She said she’ll forward me to some subcontractor. Then just nothing happened…such a loser company… :-) For the results some key findings… ARMv8-A vs v8.2-A atomic instructions actually begin to make a difference with high core count. I gained like 5 percent at 64 cores when using the atomic add instead of the loop like shown here: `version ARMv8-A: try_again_update_global_iteration_counter: LDAXR X9,[X1] ADD X6,X9,X4 STLXR W7,X6,[X1] CBNZ W7,try_again_update_global_iteration_counter same in ARMv8.2-A: LDADDAL X4,XZR,[X1]` Other than that the benchmark also shows that it’s hard to feed enough work to those 4 NEON execution units of Gravitron 3 or Apple M1. You can only do that by hand coded assembler choosing the register usage by yourself. No high level language/compiler will let you do that, as far as I know. But then…who cares for Mandelbrots and assembler ;-) The parallelism is very close to 100 percent up to 16 cores, then it goes down a bit. Still okay at 70 percent for double precision at 64 cores on Gravitron 3 for an iterative algorithm. But this I guess is also due to the threading administration overhead when you think each thread/core only gets less than 10 lines of the 600 by 600 dots to be calculated as I assign a complete line for each core at first. By the way…if you got an Apple M1 and run Asahi Linux…I’d still like some test results as I got some gaps there.

Oct 24, 2022 11:46pm David J. Ruck (33) 1635 posts	Only 600×600? I thought with that amount of power available you’d be generating 4K images at a minimum!

Oct 25, 2022 6:21am Kuemmel (439) 384 posts	Of course, even with realtime zoom at 4K if I do the math :-) …but I kept the 600×600 to make the results comparable to my older versions on RiscOS and in the end it doesn’t really matter what resolution as it’s a CPU core evaluation.

Reply

To post replies, please first log in.

Forums → Aldershot →

Search forums

Social

Follow us on

and

ROOL Store

Buy RISC OS Open merchandise here, including SD cards for Raspberry Pi and more.

Donate! Why?

Help ROOL make things happen – please consider donating!

RISC OS IPR

RISC OS is an Open Source operating system owned by RISC OS Developments Ltd and licensed primarily under the Apache 2.0 license.

Description

Everything with nothing particularly or remotely to do with ROOL.

Voices

Options

Forums
Login

Contact Us | About Us

The RISC OS Open Beast theme is based on Beast's default layout
Site design © RISC OS Open Limited 2024 except where indicated

Hosted by Arachsys

Powered by Beast © 2006 Josh Goebel and Rick Olson
This site runs on Rails