AMD Logo AMD Developer Central

ACML Product Features
Skip Navigation LinksHome > CPU Tools > AMD Performance Libraries > AMD Core Math Library (ACML) > ACML Product Features

» ACML Features
» ACML Benefits
» ACML Support for Third-Generation AMD Opteron™ Processors
» ACML Benchmarks

Features
Announcing ACML Version 4.3.0!
  • New features in Version 4.3.0 include:
    • Support for the Open64 compilers.
    • Performance of DGEMM and SGEMM has been further improved. This performance improvement carries through to other Level 3 BLAS and LAPACK routines that call DGEMM and SGEMM.
    • Performance of DGEMM and SGEMM have been significantly improved for 64-bit Intel platforms.
    • An experimental "fast memory allocation" scheme has been introduced which may allow you to improve performance of the matrix-matrix multiply routine DGEMM and any other routines (such as LAPACK) which make heavy use of DGEMM.
    • Level 1 BLAS routines have been tuned for AMD Istanbul processors.  Routines affected include xDOT, xCOPY, xAXPY, and xSCAL routines.
    • Assembly language kernels used by the real-complex FFT routines csfft, dzfft, scfft and zdfft have been re-tuned for AMD Family 10h processors, providing significant performance increases.

 
Benefits
Simple interface to take advantage of latest hardware innovations
ACML tunes for the latest hardware so you can easily tap into new processor features, including:
  • SSE, SSE2, SSE3
  • Multi-cores
Faster development of scientific and High Performance Computing projects
With tuned implementations of industry standard math libraries and other frequently used scientific subroutines, ACML enables you to accelerate projects such as:
  • Weather modeling
  • Finite element analysis
  • Computational Fluid Dynamics
  • Financial analysis
  • Oil and gas applications
  • and many more...
Easy path to multi-threading
ACML's aggressively tuned OpenMP versions mean that you don't have to worry about managing sophisticated threading models or complex debugging. Whether you are using dynamic or static linking, Windows, Linux or Solaris 32- or 64-bit, multi threading just works . Multithreaded routines are available for the Level 3 BLAS, many LAPACK routines, and the 2D and 3D FFTs.

New! Please visit the ACML forum for FAQs, tips and tricks and peer support.
New! Please visit the AMD Libraries blog for the latest information and updates on the AMD Core Math Library.

Send all ACML feedback to ACML Support.


Support for Third-Generation AMD Opteron™ Processors
AMD Core Math Library (ACML) is specifically designed to support multi-threading and other key features of AMD’s next-generation processors. ACML currently supports OpenMP, and future releases will expand upon its support of multi-platform, shared memory multiprocessing. Beginning with the 4.0 release, ACML also features hand-tuned "Barcelona" support for SGEMM and DGEMM matrix multiplication routines, the CFFT complex-complex Fast Fourier Transforms, and more. Istanbul optimizations are featured in the 4.3.0 release, including the real-complex FFTs and the Level 1 BLAS routines.

Benchmarks
A simple set of benchmarks for a few key routines are included with ACML.
  • Download and install ACML.
  • Look in the performance directory in examples to find these benchmarks.