Developer Central
China  |  India
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support

Tools & SDKs

  • Heterogeneous Computing
    • Aparapi
    • APP Kernel Analyzer
    • APP Profiler
      • Getting Started
        • System Requirements
        • Installation
        • Successful Installation
        • Using The Visual Studio Plugin
        • Using the Command Line Interface
        • Known Issues
        • Support
      • User Guide
        • APP Profiler Session Explorer
        • APP Profiler Session
        • APP Profiler Timeline
        • APP Profiler Summary Pages
        • APP Profiler Code Viewer
        • APP Profiler Kernel Occupancy Viewer
        • APP Profiler Settings
        • APP Profiler Session Parameters
        • Description of Configuration Files
        • APP Profiler Kernel Occupancy
        • CLPerfMarkerAMD Library
    • Accelerated Parallel Processing (APP) SDK
      • Downloads
        • Download Archive
      • System Requirements & Driver Compatibility
      • Documentation
      • Bolt C++ Template Library
        • AMD Bolt Sample Code Contest Official Rules
      • Samples & Demos
      • Compute System Recommendations
      • Intro OpenCL Tutorial
      • Open Source Libraries
      • Support
    • Accelerated Parallel Processing Math Libraries (APPML)
    • CodeAnalyst Performance Analyzer
      • CodeAnalyst for Linux®
      • CodeAnalyst for Windows®
      • PMU Extension Driver
      • CodeAnalyst Archive
    • CodeXL
    • gDEBugger
      • gDEBugger User Guide
        • Getting started with gDEBugger
        • Toolbars
        • Views
        • Dialogs
        • How Do I…
        • FAQ
        • Supported Extensions
        • System Requirements & Installation
        • Legal Information
        • Contact Information
  • CPU Development
    • Advanced Platform Management Link (APML)
    • Core Math Library (ACML)
      • Downloads & Resources
        • Building with ACML
        • How to use ACML with different versions of GCC/GFORTRAN
      • Archive Downloads
      • Product Features
      • Redistribution Agreements
    • LibM
      • ACML_MV
      • LibM Functions
    • Open64 SDK
      • Building with ACML
    • SimNow™ Simulator
    • Tools for DMTF DASH
    • x86 Open64 Compiler Suite
      • Developer Guide
      • Apps & Libraries Built by x86 Open64
      • Inactive Files
  • Graphics Development
    • Display Library (ADL) SDK
    • ATI GPU Services (AGS) Library
    • GPU PerfStudio 2
      • API Trace
      • Frame Debugger
      • Frame Profiler
      • Shader Debugger
      • Shader Editor
      • Previous versions
    • GPU ShaderAnalyzer
    • GPUPerfAPI
    • OpenGL ES SDK
    • Quad-Buffer SDK
    • FirePro SDK
      • FirePro News & Events
    • Radeon™ SDK
  • Open Source
    • GCC & GNU Toolchains
    • OpenCL™ Emulator-Debugger
    • Tapper

Home > Tools & SDKs > CPU Development > Core Math Library (ACML) > Archive Downloads

Archive Downloads

This page contains previous versions of ACML. Click the browser’s back button to see the current ACML versions. Note that documentation for previous releases can be found in the corresponding installation files. For Windows®, the package must be installed to access the documentation.

Go to Downloads

Release Notes

Features introduced with previous ACML releases:

  • Version 5.3.0:
    • Added FMA3 code paths for many BLAS and FFT routines.
    • Updated the LAPACK code to version 3.4.0
    • Improved performance for complex-complex out-of-place FFTs
    • Added Fast Malloc to more BLAS routines.
      • Fast Malloc is now enabled by default, with no need to set an environment variable.
      • routines affected include *GEMM (except CGEMM), *SYR2K, *GEMV, *GER, *TRMV, and *TRSV.
      •     - Fast Malloc is only enabled for Linux
    • Expanded coverage of the FFTW Wrapper examples to include double precision routines
    • Added a set of FFTW Wrappers to assist in using ACML FFTs in applications written to use FFTW. These wrappers are provided as source code and are found in the ACML example directories. Examples are provided for a useful subset of the FFTW3 and FFTW2 routines
  • Version 5.2.0:
    • Improved performance for some common applications, using various optimizations
      • Improved dgemm performance for small to medium problem sizes
      • Fast malloc enabled in the single threaded library
      • Fast malloc added to dtrsm. Note that the ACML_FAST_MALLOC environment variable must be set by the user to enable these optimizations.
      • Changed openmp threading behavior in dtrsm, dpotrf, and dgetrf to limit the number of threads for small problems.
    • Enabled better default FFT radix plans for specific HPCC problem sizes. When using HPCC with these problem sizes, the resulting FFT problem size will choose an optimal FFT radix plan, eliminating the need to use Mode100 planning. This dramatically improves performance
    • “Run-anywhere” builds use fma4 instructions for key routines, based on CPUID feature bits. This allows these libraries to get reasonable performance on AMD Bulldozer CPUs, while still running properly on other processors
    • Added a set of FFTW Wrappers to assist in using ACML FFTs in applications written to use FFTW. These wrappers are provided as source code and are found in the ACML example directories. Examples are provided for a useful subset of the FFTW3 and FFTW2 routines
  • Version 5.1.0:
    • In addition to SGEMM and DGEMM, CGEMM and ZGEMM have been tuned for AMD Family 15h processors.
    • Real to Complex and Complex to Real FFTs (single and double precision) have been tuned for AMD Family 15 processors.
  • Version 4.4.0:
    • Performance of ZGEMM has been further improved. This performance improvement carries through to other Level 3 BLAS and LAPACK routines that call ZGEMM.
    • Assembly language kernels used by the real-complex FFT routines csfft, dzfft, scfft and zdfft have been re-tuned for AMD Family 10h processors, providing significant performance increases.
  • Version 3.6:
    • LAPACK code update
    • New OpenMP multithreading capability for many LAPACK routines
    • Intel FORTRAN compatible Windows® 64 and Linux® 64 libraries
  • Download current versions of ACML.

By checking this box, you agree to abide by the terms and conditions set forth in the end-user license agreement, above. If you do not agree to abide by these terms and conditions, you are not permitted to use the site or download materials from the site.

Note that the new End User License agreement supersedes the agreement found in previous ACML releases.

*Please note: Read EULA Agreement before downloading. If you are considering bundling ACML along with your products, you need a separate redistribution agreement. Refer to the ACML redistribution agreement page for more information.

Downloads

File Name Version Size Launch Date OS Bitness Description
GFORTRAN
acml-4-4-0-gfortran-32bit.tgz 4.4.0 20 MB 02/22/2010 Linux® 32-Bit AMD Core Math Library (ACML) for Linux built with GFORTRAN
acml-4-4-0-gfortran-64bit.tgz 4.4.0 26 MB 02/22/2010 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with GFORTRAN
acml-4-4-0-gfortran-64bit-int64.tgz 4.4.0 25 MB 02/22/2010 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with GFORTRAN
acml-5-3-0-gfortran-64bit.tgz 5.3.0 71 MB 12/12/2012 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with GFORTRAN
acml-5-3-0-gfortran-64bit-int64.tgz 5.3.0 70 MB 12/12/2012 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with GFORTRAN
Gnu
acml-3.6.0-32-g77.exe 3.6.0 11 MB Windows® 64-Bit AMD Core Math Library (ACML) for Windows®. Built with G77.
acml-3-6-0-gnu-32bit.tgz 3.6.0 10 MB Linux® 32-Bit AMD Core Math Library (ACML) for Linux built with G77
acml-3-6-0-gnu-64bit.tgz 3.6.0 11 MB Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with G77
Intel Fortran
acml-5-3-0-ifort-64bit.tgz 5.3.0 106 MB 12/12/2012 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with Intel® FORTRAN
acml-5-3-0-ifort-64bit-int64.tgz 5.3.0 104 MB 12/12/2012 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with Intel® FORTRAN
NAG
acml-4-4-0-nag-32bit.tgz 4.4.0 13 MB 03/01/2010 Linux® 32-Bit AMD Core Math Library (ACML) for Linux® built with NAGWare F95
acml-4-4-0-nag-64bit.tgz 4.4.0 19 MB 03/01/2010 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with NAGWare F95
acml-4-4-0-nag-64bit-int64.tgz 4.4.0 19 MB 03/01/2010 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with NAGWare F95
Open64
acml-5-1-0-open64-64bit.tgz 5.1.0 85 MB 12/23/2011 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with Open64
acml-5-1-0-open64-64bit-int64.tgz 5.1.0 84 MB 12/23/2011 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with Open64
PGI
acml-5.3.0-win64.exe 5.3.0 81 MB 12/12/2012 Windows® 64-Bit AMD Core Math Library (ACML) for Windows®. Built with PGI.
acml-5.3.0-win64-int64.exe 5.3.0 80 MB 12/12/2012 Windows® 64-Bit AMD Core Math Library (ACML) for Windows®. Built with PGI.
acml-5-3-0-pgi-64bit-int64.tgz 5.3.0 143 MB 12/12/2012 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with PGI
acml-5-3-0-pgi-64bit.tgz 5.3.0 144 MB 12/12/2012 Linux® 64-Bit AMD Core Math Library (ACML) for Linux built with PGI

Get the hcNewsFlash.

Your email address:

No SPAM.
Easy unsubscribe.

HSA is going to rock your world.

Learn more about Heterogeneous System Architecture.

Got Questions?

Ask the Developer Forums Community. They’ve got answers.

Tools & SDKs

  • Heterogeneous Computing
    • Aparapi
    • APP Kernel Analyzer
    • APP Profiler
      • Getting Started
        • System Requirements
        • Installation
        • Successful Installation
        • Using The Visual Studio Plugin
        • Using the Command Line Interface
        • Known Issues
        • Support
      • User Guide
        • APP Profiler Session Explorer
        • APP Profiler Session
        • APP Profiler Timeline
        • APP Profiler Summary Pages
        • APP Profiler Code Viewer
        • APP Profiler Kernel Occupancy Viewer
        • APP Profiler Settings
        • APP Profiler Session Parameters
        • Description of Configuration Files
        • APP Profiler Kernel Occupancy
        • CLPerfMarkerAMD Library
    • Accelerated Parallel Processing (APP) SDK
      • Downloads
        • Download Archive
      • System Requirements & Driver Compatibility
      • Documentation
      • Bolt C++ Template Library
        • AMD Bolt Sample Code Contest Official Rules
      • Samples & Demos
      • Compute System Recommendations
      • Intro OpenCL Tutorial
      • Open Source Libraries
      • Support
    • Accelerated Parallel Processing Math Libraries (APPML)
    • CodeAnalyst Performance Analyzer
      • CodeAnalyst for Linux®
      • CodeAnalyst for Windows®
      • PMU Extension Driver
      • CodeAnalyst Archive
    • CodeXL
    • gDEBugger
      • gDEBugger User Guide
        • Getting started with gDEBugger
        • Toolbars
        • Views
        • Dialogs
        • How Do I…
        • FAQ
        • Supported Extensions
        • System Requirements & Installation
        • Legal Information
        • Contact Information
  • CPU Development
    • Advanced Platform Management Link (APML)
    • Core Math Library (ACML)
      • Downloads & Resources
        • Building with ACML
        • How to use ACML with different versions of GCC/GFORTRAN
      • Archive Downloads
      • Product Features
      • Redistribution Agreements
    • LibM
      • ACML_MV
      • LibM Functions
    • Open64 SDK
      • Building with ACML
    • SimNow™ Simulator
    • Tools for DMTF DASH
    • x86 Open64 Compiler Suite
      • Developer Guide
      • Apps & Libraries Built by x86 Open64
      • Inactive Files
  • Graphics Development
    • Display Library (ADL) SDK
    • ATI GPU Services (AGS) Library
    • GPU PerfStudio 2
      • API Trace
      • Frame Debugger
      • Frame Profiler
      • Shader Debugger
      • Shader Editor
      • Previous versions
    • GPU ShaderAnalyzer
    • GPUPerfAPI
    • OpenGL ES SDK
    • Quad-Buffer SDK
    • FirePro SDK
      • FirePro News & Events
    • Radeon™ SDK
  • Open Source
    • GCC & GNU Toolchains
    • OpenCL™ Emulator-Debugger
    • Tapper

©2013 Advanced Micro Devices, Inc. OpenCL and the OpenCL logo are trademarks of Apple, Inc., used with permission by Khronos.

  • Contact Us
  • |
  • Careers
  • |
  • Site Map
  • |
  • Terms and Conditions
  • |
  • Privacy
  • |
  • Trademarks