tl;dr Raspberry Pi engineers tweaked SDRAM timings and other memory settings on the Pi, resulting in a 10-20% speed boost at the default 2.4 GHz clock. I of course had to test overclocking, which got me a 32% speedup at 3.2 GHz! Changes may roll out in a firmware update for all Pi 5 and Pi 4 users soon.
My quest for the world record Geekbench 6 score on a Pi 5 continues, as a couple months ago Martin Rowan used cooling and NUMA emulation tricks to beat my then-record score.
But Raspberry Pi's engineers are tweaking memory timings even further. They've talked to Micron and implemented a number of small tweaks that—along with NUMA emulation—really add up to a performance improvement for multi-core workloads. And even a little improvement for single-core!
The sdram refresh interval is currently using the default data sheet settings. You can actually monitor the temperature of the sdram and it reports if refresh at half or quarter the rate can be done. That allows the overhead due to refresh to be reduced by a half or a quarter which does improve benchmark results.
We got in contact with Micron, and there is good news. They have said they actually test their 8GB sdram with the 4GB refresh rate timing (rather than the slower jedec timings), and so it was be safe to run the 8GB parts with 4GB timing.
The tweaks can also give the Pi 4 a boost, but the Pi 5 improves more dramatically:
Pi 5 also has faster sdram, better access to sdram (i.e. wider/faster internal buses), so generally the improvements with NUMA are greater.
SDRAM Tweaks
To get the latest RAM speedups for now (this may be default soon):
- Update the Pi's firmware to the latest version:
sudo rpi-update
(confirm withY
) - Edit the bootloader config:
sudo rpi-eeprom-config -e
- Add the configuration
SDRAM_BANKLOW=1
(for Pi 5... for Pi 4, use3
) - Reboot
NUMA Emulation
Since my first post on NUMA emulation on the Pi 5, the patches required have been added to Raspberry Pi's OS kernel.
So to use NUMA, all you have to do is make sure you're on the latest Pi OS (e.g. sudo apt full-upgrade
).
To check if NUMA emulation is working, run dmesg | grep NUMA
and make sure it says something like mempolicy: NUMA default policy overridden to 'interleave:0-7'
. You can tweak the settings if you want by adding numa=fake=[n]
inside /boot/firmware/cmdline.txt
, though the defaults should be appropriate for most use cases.
Overclocking
Following my own guide for overclocking the Pi 5, I set the following inside /boot/firmware/config.txt
:
over_voltage_delta=72000
arm_freq=3200
gpu_freq=1000
After rebooting, I set the fan to 100%, hacked the kernel with my pi-overvolt project to boost the core voltage, and set the scaling governor to performance
:
$ pinctrl FAN_PWM op dl
$ cd pi-overvolt && sudo ./removelimit && vcgencmd cache_flush
$ echo performance | sudo tee /sys/devices/system/cpu/cpu*/cpufreq/scaling_governor
Geekbench World Record, Part 2
With just the default firmware changes, my Geekbench scores already increase quite a bit (+8% single, +18% multi). Adding a 3.2 GHz overclock on top (using my pi-overvolt hack to boost voltages), those increases go to +32% single, +31% multi, resulting in yet another world-record Geekbench 6 score!
Geekbench Result | Pi 5 - defaults | SDRAM + NUMA | 3.2 GHz OC |
---|---|---|---|
Single | 833 | 899 (+8%) | 1153 (+32%) |
Multi | 1805 | 2169 (+32%) | 2468 (+31%) |
I also ran these tests with just an Argon THRML 30-AC Active Cooler. To try to keep temps under control, I boosted the fan speed to 100%: pinctrl FAN_PWM op dl
.
A word on overclocking: I've now overclocked around 20 Pi 5s, and found most to be capable of 2.6 or 2.8 GHz, and many (about half) to be capable of 3.0 GHz. But beyond that, very few can hit 3.1 GHz or beyond. More exotic overclocking (to 3.4 or 3.5 GHz) is much more difficult, and I've only had one Pi 5 that even boots reliably at those speeds, with more extensive cooling. RAM timings were already not happy at those speeds, and with the extra SDRAM tweaks, I imagine extreme overclocking will be even more unstable.
Conclusion
These optimizations could become default soon. I started looking into this after someone on Twitter mentioned seeing Pi 500 Geekbench results starting in September—all seemingly with these tweaks in place already!
Memory speed has been a thorn in the Pi 5's side in comparison to many RK3588 boards. It's nice to see the SDRAM tweaks giving it a significant speed boost, over a year post-launch.
And eagle-eyed readers may note I only overclocked to 3.2 GHz instead of 3.4 GHz this time. I'll leave the door open for someone else to combine all the above tricks to hit another new WR score ;)
Comments
How about a benchmark of updated rpi 4 and 5?
For the Pi 5, see the benchmark results table—the first result is a Pi 5 with current Pi OS out of the box (unaltered), the second result is with the firmware update and new NUMA and SDRAM tweaks applied (which is the most significant gain). Third result is with a ridiculous 3.2 GHz overclock that is quite overkill for most people.
Thanks for the optimizations Jeff. I always look forward to reading your latest posts on the RPs and trying them out at home. The RP5 seems indeed a bit snappier now and becoming more viable as a "light' desktop PC for the basics with the applied changes, NVME drive and overclocking. Its not the destination and end result but the journey itself which makes it fun to tinker with the raspberry pi.
Actually, RK3588 has 64-bit DDR bus. What about RPI5?
From Raspberry Pi's BCM2712 Docs, it says:
Real-world I've measured about 14 GB/sec with the default OS that it shipped with, and I've just re-tested it with the latest firmware, and am now getting only 9.3 GB/sec.
Great! But kind of baffling to hear only now their memory supplier tells them how to properly use their product
According to the forum post, they originally followed the spec sheet, but after investigation and after discussing it with Micron, they found out the spec sheet was too conservative with the numbers. So then they pushed it. I'd rather have engineers start from the documentation and follow specs, then later adjust if possible, than YOLOing things for best performance first!
Hello Jeff! We are designing an expansion board based on the CM5, featuring rich functionalities and ease of use—perfect for makers and developers. We’d be happy to send you a sample for unboxing and review to share your experience. Would you be interested? Looking forward to hearing from you! 😊
The stock clock SDRAM + NUMA result in the table doesn't match the body text - 1805 -> 2169 is ~20%.
Hi Jeff!
Thanks so much for that information. Didn't know about RPi FIrmware - updated it the first time (since I bought it '21) and did the SDRAM tweaks. My RPi 4 2GB feels a LOT faster. Going to change the SD card with an external SSD and me and my docker containers will be even happier :)
Hi Jeff!
Is there a specific SDRAM tweaking for PIZero models, too?
No, none that I'm aware of. Only Pi 4 / Pi 5.
Hi Jeff.
Excellent contribution, Thanks a lot!
On overclocking: On vcgencmd pmic_read_adc output only VDD_CORE_A increases significantly depending of arm_freq / gpu_freq values.
(In my case 2400-800=>0.8A; 2600-850=>1.1A; 2800-950=>1.56A)
Can we use this fast indicator for capabilities of exact RPi5?
What value would be risky? (Seems 1.5A is too much...)
Thank you for your work to make the Raspberry Pi a better ecosystem. Between stuff like this and Ansible I can't think of anyone outside Pi engineers who has made a bigger difference. (Tips hat to Debian devs too.)
Questions:
Is the first mod
SDRAM_BANKLOW=1
set on the Pi or on the boot device? IOW if I perform this step and then swap another SD/SSD/NVME into my Pi, will the tweaks still be there? Conversely, if I swap the media to a different Pi, will it benefit from this RAM refresh timing tweak? (I'm supposing that the changes effected byrpi-update
involve both files stored in the SC card (/boot/firmware/...
and the EEPROM on the Pi itself.I forgot my second question. :-/ Probably how to unwind the changes.
At the moment I'm pulling my hair out trying to figure out why I can't install RpiOS/Ubuntu/Debian on an SSD and successfully boot on a Pi 4B (from USB.) I'm going to back out these changes to see if that makes a difference.
I'll answer my own question here. I have confirmed that the EEPROM change (SDRAM_BANKLOW=1) is retained when the boot media is changed. IOW it is part of the Pi.
The other issue I was facing (firstboot processing not completing) was resolved by secure erasing the SSD on a PC with a hot swap SATA bay. This is not possible (or difficult) using a USB adapter.
What type of memory test are you using to measure performance before/after these upgrades?
Mostly
tinymembench
.Hi, pi-overvolt gives an error. Any idea?
daniel@raspberrypi:~/pi-overvolt $ ./compile.sh
/usr/bin/ld: kan inte hitta /usr/src/linux-headers-6.6.62+rpt-common-rpi/include/linux/broadcom/: file format not recognized
collect2: error: ld returned 1 exit status
Please check the GitHub issues; a few of us have run into this issue, and I think we posted a solution there (it's a file path error, could probably be fixed better!).
Thanks, just needed a tiny bit more voltage to beat your record. :) Sorry. Voltage at 1.023V.
Haha, nice! And congrats!
Just applied the SDRAM tweaks, and nothing else:
Single core:901
Multi core: 2156
(I'm running on an old mSATA usb 3 drive, because that's cheap).
Next up is a bit of overclocking, I'm just curious.
Hi Jeff
Will these tweaks work on the new RPi500?
I guess it should do as it is basically a Pi5??
Thanks
Yes, in my testing they're nearly identical in how they perform with and without the tweaks.
It appears that the 6.12.6 kernel may have improved performance, especially for multi-core - based on Geekbench 6 results. At 3.0GHz CPU and with the sdram OC I am getting sc: 1092 and mc: 2382 - https://browser.geekbench.com/v6/cpu/9706889
I tried going to 3.1GHz cpu without the voltage hack and it wouldn't boot.