NEW! AOCL 2.1 is now available
AOCL are a set of numerical libraries tuned specifically for AMD EPYC™ processor family. They have a simple interface to take advantage of the latest hardware innovations. The tuned implementations of industry standard math libraries enable fast development of scientific and high-performance computing projects.
Highlights of AOCL 2.1 optimized for AMD EPYC™ 7002 Series Processors
- AMD BLIS
- Improved performance of SGEMM and DGEMM for small and skinny size matrices
- Improved TRSM single thread performance for small and skinny size matrices
- BLIS build now supports both AMD “Zen” and “Zen 2” configurations with auto config option
- Support for C++ Template APIs for all BLAS functions
- AMD FFTW
- Improved performance of the FFT kernels for AVX and AVX2
- Improved performance of copy function used in rank-0 transform and buffering plans.
- Several build configuration updates that work with –enable-amd-opt option including long double and quad precision support, CFLAGS, AOCC/clang compiler support
- AMD LibM
- Improved performance of powf(), logf() and expf()
- Minor bug fixes in fabs(), fmod() and remainder() for compliance with IEEE
- AMD libFLAME
- Support for C++ Template APIs for all LAPACK functions
- Includes LAPACKE source in the libFLAME source directory
- AMD ScaLAPACK – NEW
- New library in AOCL suite
- ScaLAPACK library for AMD integrated with optimized version of BLIS and libFLAME libraries that enables high performance dense linear algebra operations
- AMD’s version of ScaLAPACK source enables easy linking of optimized BLIS and libFLAME libraries
- Bug fixes in ScaLAPACK test suite for few of the complex and double complex types test cases
For any issues or queries regarding the libraries, please contact email@example.com
BLIS is a portable open-source software framework for instantiating high-performance Basic Linear Algebra Subprograms (BLAS) – like dense linear algebra libraries. The framework was designed to isolate essential kernels of computation that, when optimized, immediately enable optimized implementations of most of its commonly used and computationally intensive operations. Select kernels have been optimized for the AMD EPYCTM processor family by AMD and others.
Source code is available on GitHub https://github.com/amd/blis.
FFTW is a comprehensive collection of fast C routines for computing the Discrete Fourier Transform (DFT) and various special cases thereof. It is an open-source implementation of the Fast Fourier transform algorithm. It can compute transforms of real and complex-values arrays of arbitrary size and dimension. An AMD optimized FFTW that includes selective kernels and routines optimized for the AMD EPYC™ processor family is available.
Source code is available on GitHub https://github.com/amd/amd-fftw
AMD LibM is a software library containing a collection of basic math functions optimized for x86-64 processor-based machines. It provides many routines from the list of standard C99 math functions. Applications can link into AMD LibM library and invoke math functions instead of compiler’s math functions for better accuracy and performance.
libFLAME is a portable library for dense matrix computations, providing much of the functionality present in Linear Algebra Package (LAPACK). It includes a compatibility layer, FLAPACK, the FORTRAN interface, which includes complete LAPACK implementation. The library provides scientific and numerical computing communities with a modern, high-performance dense linear algebra library that is extensible, easy to use, and available under an open source license. In combination with the BLIS library which includes optimizations for the AMD EPYCTM processor family, libFLAME enables running high performing LAPACK functionalities on AMD platforms.
Source code is available on GitHub https://github.com/amd/libflame.
ScaLAPACK is a library of high-performance linear algebra routines for parallel distributed memory machines. It depends on external libraries including BLAS and LAPACK for Linear Algebra computations. AMD’s optimized version of ScaLAPACK enables using BLIS and libFLAME library that have optimized dense matrix functions and solvers for AMD EPYCTMprocessor family CPUs.
ScaLAPACK can be installed either from source or pre-built binaries.
ScaLAPACK for AMD source can be cloned from Github link https://github.com/amd/scalapack. Pre-built AMD optimized ScaLAPACK can be installed from the AOCL master installer tar file available under the Download section below.
AMD Random Number Generator Library is a pseudorandom number generator library. It provides a comprehensive set of statistical distribution functions and various uniform distribution generators (base generators) including Wichmann-Hill and Mersenne Twister. The library contains five base generators and twenty-three distribution generators, in addition to hooks for user provided base generator.
The AMD Secure Random Number Generator (RNG) is a library that provides APIs to access the cryptographically secure random numbers generated by AMD’s hardware-based random number generator implementation. These are high quality, robust random numbers that are designed to be suitable for cryptographic applications.
AMD Libraries Forum
The following installer extracts and installs all the AMD CPU Libraries on to the developer machine.