, ,

DDR4 RDIMM and LRDIMM Performance Comparison

Photo of dual Xeon E5-2600v3 motherboard with DDR4 ECC RDIMM memory

Recently, while carrying out memory testing in our integration lab, Lead Systems Integrator, Rick Warner,  was able to clearly identify when it is appropriate to choose load-reduced DIMMs (LRDIMM) and when it is appropriate to choose registered DIMMs (RDIMM) for servers running large amounts of DDR4 RAM (i.e., 256 Gigabytes and greater). The critical factors to consider are latency, speed, and capacity, along with what your computing objectives are with respect to them.

Misconceptions on Load Reduced DIMM Performance

Load-reduced DIMMs were built so that high-speed memory controllers in CPUs could drive larger quantities of memory. Thus, it’s often assumed that LRDIMMs will offer the best performance for memory-dense servers. This impression is strengthened by the fact that Intel’s guide for DDR4 memory population shows LRDIMMs running at a higher frequency than RDIMMs (e.g., 2133MHz vs 1866MHz). However, as we’ll show below, there are greater factors at play.

RDIMM vs LRDIMM Performance Testing

Using the STREAM memory benchmark, Rick took a look at 1 DIMM and 2 DIMMs per channel configurations using DDR4 LRDIMMS and RDIMMs on a Supermicro X10DAi motherboard with two Intel Xeon E5-2687W v3 CPU’s. Both our WhisperStation and WhisperStation for R are available in this configuration. We also have several Xeon Rackmount Servers which support this configuration.

For each case, the DIMM speed was forced to 2133MHz in the BIOS. Tests were run with both RDIMMs and LRDIMMs in 256GB and 512GB configurations.

LRDIMM Benchmark Results

FunctionBest Rate MB/sAvg. TimeMin. TimeMax. Time
Copy81,383.50.0040050.0039320.004151
Scale95,746.70.0034090.0033420.003561
Add109,661.00.0045050.0043770.004862
Triad109,315.60.0044900.0043910.004771
One LRDIMM Per Channel — 256GB RAM @ 2133MHz

 

FunctionBest Rate MB/sAvg. TimeMin. TimeMax. Time
Copy72,499.20.0044610.0044140.004546
Scale83,572.70.0039010.0038290.004036
Add95,979.50.0051030.0050010.005220
Triad96,541.00.0051050.0049720.005265
Two LRDIMMs Per Channel — 512GB RAM @ 2133MHz*

* for LRDIMMs, the 512GB configuration automatically operates at 2133MHz

LRDIMM Performance Summary

From these tests, we concluded that the latency imposed by the LRDIMMs results in approximately 12% reduction in overall performance when doubling the amount of RAM from 256GB to 512GB.

RDIMM Benchmark Results

Rick then tested RDIMMs using the same system for comparison (with 256GB and 512GB DDR4 memory configurations). Below are the stream results.

FunctionBest Rate MB/sAvg. TimeMin. TimeMax. Time
Copy82,707.50.0039390.0038690.004093
Scale101,973.70.0032430.0031380.003471
Add111,966.30.0045020.0042870.004978
Triad110,881.00.0044680.0043290.004843
One RDIMM Per Channel — 256GB RAM @ 2133MHz

 

FunctionBest Rate MB/sAvg. TimeMin. TimeMax. Time
Copy75,049.10.0043140.0042640.004405
Scale93,812.60.0034600.0034110.003550
Add103,091.10.0047290.0046560.004969
Triad103,493.90.0047040.0046380.004909
Two RDIMMs Per Channel — 512GB RAM @ 2133MHz*

* for RDIMMs, the 512GB configuration requires the memory speed to manually be increased to 2133MHz

RDIMM Performance Summary

Just as we saw with LRDIMMs, there is a reduction in performance between 1 DIMM per channel and 2 DIMMs per channel when using RDIMMs. However, this penalty is reduced to approximately 7% (compared to the 12% penalty suffered by LRDIMMs).

Side-by-Side Comparison of RDIMM and LRDIMM Performance

For clarity, here is a side by side table of DDR4 memory performance comparing LRDIMMs to RDIMMs. Note that RDIMM memory bandwidth is higher than LRDIMM bandwidth in every case.

1 DIMM Per Channel Best Rate (MB/s)2 DIMMs Per Channel Best Rate (MB/s)
FunctionLRDIMMRDIMMLRDIMMRDIMM
Copy81,383.582,707.572,499.275,049.1
Scale95,746.7101,973.783,572.793,812.6
Add109,661.0111,966.395,979.5103,091.1
Triad109,315.6110,881.096,541.0103,493.9
LRDIMMs and RDIMMs Compared

 

When Registered DIMMs (RDIMMs) are Best

Many of our HPC customers are looking for high speed and low latency. In that realm, RDIMMs are the hands down winner. At slightly cheaper cost and with the ability to ramp up memory frequency on certain motherboards, they are the right choice for fast memory performance.

When Load-Reduced DIMMs (LRDIMMs) are Best

When very large quantities of RAM are the goal, then LRDIMMs are the way to go. In this chart from Intel’s Grantly Platform Memory Configuration Guide, you can see that when packing a system full of RAM you can achieve twice the capacity from LRDIMMs. However, 64GB DDR4 LRDIMMs are still quite costly.  There are also specific configurations using 3 DIMMs per channel that require LRDIMMs.  Contact one of our experts to discuss the best options when you are considering servers with more than 512GB memory.

SKUMax DIMMs in PlatformNumber of CPU SocketsRDIMM ConfigLRDIMM Config
E5-1600 v312 DIMMS1384GB (12x32GB) @ 1600MHz768GB (12x64GB) @ 1600MHz
E5-2600 v324 DIMMs2768GB (24x32GB) @ 1600MHz1.5TB (24x64GB) @ 1600MHz
E5-4600 v348 DIMMs41.5TB (48x32GB) @ 1600MHz3TB (48x64GB) @ 1600MHz
Memory Configuration

lots-o-ram

Choosing between LRDIMMs and RDIMMs depends entirely on what performance characteristics meet the needs of your applications. Careful consideration of latency, speed and capacity as applied to your problem will show you the way to go. Our engineering team can help you work your way through this important design choice. Contact us or give us a call for assistance choosing the HPC platform that works best for you.

You May Also Like