40555 Rev. 3.00 June 2006 | Performance Guidelines for AMD Athlon™ 64 and AMD Opteron™ |
| ccNUMA Multiprocessor Systems |
A.5 Why Is 0 Hop-1 Hop Case Slower Than
0
When a 0
•Node 0: 2 foreground threads.
•Node 1: 1 background thread.
•Node 3: 1 background thread.
•Node 2: 1 background thread.
In the 0
•Node 0: 1 foreground thread
•Node 1: 1 foreground and 1 background threads.
•Node 3: 1 background thread.
•Node 2: 1 background thread.
The 0
Each of the background threads, as before, asks for data at a rate of 4GB/s and each of the foreground threads asks for data at a rate of 2.98 GB/s.
Data shows that there is a total memory access rate of 4.78 GB/s on node 1 and several buffer queues on node 1 are saturated and cannot absorb the data provided by the memory controller any faster.
A.6 Support for a
Developers should ensure that the OS is properly configured to support ccNUMA. All versions of Microsoft® Windows® XP for AMD64 and Windows Server for AMD64 support ccNUMA without any configuration changes. The
Appendix A | 43 |