Skip navigation links
Tools
SDKs
Libraries
Samples & Demos
Docs
Zones
Community
Support
OpenCL™ Optimization Case Study: Diagonal Sparse Matrix Vector Multiplication 
Skip Navigation LinksHome > Docs > Articles & Whitepapers
Bryan Catanzaro  5/10/2010 

AMD Phenom II X4 965 CPU Results

As we discussed earlier, a very well optimized implementation of this code for the AMD Phenom II X4 965 CPU might achieve 10.5 SP GFLOP/s on this computation. The upper bound on performance is 10.6 SP GFLOP/s since the processor has 21.3 GB/s of memory bandwidth, assuming perfect caching of the vector and offset array.

After applying our optimizations, except for the use of OpenCL™ images to cache the vector, since the ATI Stream SDK v2.1 does not currently support images on x86 CPUs, we reach 2.9 SP GFLOP/s, as shown in Figure 12. Although we only achieved 27% of bound, OpenCL™ still enabled us to run the same code and utilize all our cores.


Figure 12: Vectorized AMD Phenom II X4 965 CPU Results

OpenCL™ and the OpenCL™ logo are trademarks of Apple Inc. used by permission by Khronos.

Back to top
«1 2 3 4 5 6 7 8 9 10 11 »
2010 Advanced Micro Devices, Inc. AMD, the AMD Arrow logo, AMD Opteron, AMD Athlon, AMD Turion, AMD Sempron, AMD Phenom, ATI Radeon, Catalyst, AMD LIVE!, and combinations thereof, are trademarks of Advanced Micro Devices, Inc. Microsoft and Windows are registered trademarks of Microsoft Corporation in the United States and/or other jurisdictions. Linux is a registered trademark of Linus Torvalds. Other names are for informational purposes only and may be trademarks of their respective owners.

This website may be linked to other websites which are not in the control of and are not maintained by AMD. AMD is not responsible for the content of those sites. AMD provides these links to you only as a convenience, and the inclusion of any link to such sites does not imply endorsement by AMD of those sites. AMD reserves the right to terminate any link or linking program at any time.
Printer Friendly Version
Table Of Contents