Developer Central
China
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support

Tools & SDKs

  • Heterogeneous Computing
    • Aparapi
    • APP Kernel Analyzer
    • APP Profiler
      • Getting Started
        • System Requirements
        • Installation
        • Successful Installation
        • Using The Visual Studio Plugin
        • Using the Command Line Interface
        • Known Issues
        • Support
      • User Guide
        • APP Profiler Session Explorer
        • APP Profiler Session
        • APP Profiler Timeline
        • APP Profiler Summary Pages
        • APP Profiler Code Viewer
        • APP Profiler Kernel Occupancy Viewer
        • APP Profiler Settings
        • APP Profiler Session Parameters
        • Description of Configuration Files
        • APP Profiler Kernel Occupancy
        • CLPerfMarkerAMD Library
    • Accelerated Parallel Processing (APP) SDK
      • Downloads
        • Download Archive
      • System Requirements & Driver Compatibility
      • Documentation
      • Bolt C++ Template Library
        • AMD Bolt Sample Code Contest Official Rules
      • Samples & Demos
      • Compute System Recommendations
      • Intro OpenCL Tutorial
      • Open Source Libraries
      • Support
    • Accelerated Parallel Processing Math Libraries (APPML)
    • CodeAnalyst Performance Analyzer
      • CodeAnalyst for Linux®
      • CodeAnalyst for Windows®
      • PMU Extension Driver
      • CodeAnalyst Archive
    • CodeXL
    • gDEBugger
      • gDEBugger User Guide
        • Getting started with gDEBugger
        • Toolbars
        • Views
        • Dialogs
        • How Do I…
        • FAQ
        • Supported Extensions
        • System Requirements & Installation
        • Legal Information
        • Contact Information
  • CPU Development
    • Advanced Platform Management Link (APML)
    • Core Math Library (ACML)
      • Downloads & Resources
        • Building with ACML
        • How to use ACML with different versions of GCC/GFORTRAN
      • Archive Downloads
      • Product Features
      • Redistribution Agreements
    • LibM
      • ACML_MV
      • LibM Functions
    • Open64 SDK
      • Building with ACML
    • SimNow™ Simulator
    • Tools for DMTF DASH
    • x86 Open64 Compiler Suite
      • Developer Guide
      • Apps & Libraries Built by x86 Open64
      • Inactive Files
  • Graphics Development
    • Display Library (ADL) SDK
    • ATI GPU Services (AGS) Library
    • GPU PerfStudio 2
      • API Trace
      • Frame Debugger
      • Frame Profiler
      • Shader Debugger
      • Shader Editor
      • Previous versions
    • GPU ShaderAnalyzer
    • GPUPerfAPI
    • OpenGL ES SDK
    • Quad-Buffer SDK
    • FirePro SDK
      • FirePro News & Events
    • Radeon™ SDK
  • Open Source
    • GCC & GNU Toolchains
    • OpenCL™ Emulator-Debugger
    • Tapper

Home > Tools & SDKs > Heterogeneous Computing > APP Profiler

APP Profiler

END-OF-LIFE NOTICE

AMD APP Profiler has been succeeded by AMD’s CodeXL. With the exception of critical bug fixes, there will be no new versions of APP Profiler. For new features, continuous support and a unified developer tools suite, please visit AMD’s CodeXL web page and check out the CodeXL Forum.

Overview

The AMD APP Profiler is a performance analysis tool that gathers data from the OpenCL™ run-time and AMD Radeon™ GPUs during the execution of an OpenCL™ application. We can then use this information to discover bottlenecks in an application and find ways to optimize the application’s performance for AMD platforms.

Go to Downloads

OpenCL and the OpenCL logo are trademarks of Apple Inc. used by permission by Khronos.

What’s New

What’s New in Version 2.5

  • Support for AMD APP SDK v2.7.
  • Support for OpenCL™ 1.2.
  • Support for collecting performance counters on APU devices.
  • Full support for profiling with AMD Radeon™ HD7000 series GPUs based on Graphics Core Next/Southern Islands:
    • Added support for kernel occupancy analysis.
    • Added support for collecting performance counters for DirectCompute (DirectX 11) applications.
    • Addition of SALUBusy counter.
    • Fixed value reported for VALUBusy counter.
    • The values reported for LDSFetchInsts and LDSWriteInsts counters were inaccurate on AMD Radeon™ HD7000 series GPUs; thus for those GPUs, those two counters have been replaced by a single LDSInsts counter.
    • Fixed display of kernel ISA.
  • Improved OpenCL™ analysis module:
    • Added detection of deprecated OpenCL™ APIs.
  • Added support for showing source and destination location, as well as zero-copy status for memory transfers initiated using clEnqueueMapBuffer or clEnqueueMapImage. This information is displayed in the API Trace view.
  • Added support for Microsoft® Visual Studio® projects that use User-defined Macros in the project settings.
  • Fixed the –workingdirectory (-w) command line switch (set current directory) on Linux.
  • Fixed some problems with importing previously-generated profile results into Microsoft® Visual Studio®.
  • Changed the default installation directory on Windows to %PROGRAMFILES(X86)%\AMD\AMD APP Profiler to make it more consistent with other AMD tools (i.e gDEBugger and CodeAnalyst)

Features

  • Collect OpenCL™ Application Trace
    • View and debug the input parameters and output results for all OpenCL™ API calls
    • Search the API calls
    • Navigate to the source code that called an OpenCL™ API
    • Specify which OpenCL™ APIs will be traced
  • Collect GPU Performance Counters of AMD Radeon™ graphics cards
    • Show kernel resource usages
    • Show the number of instructions executed by the GPU
    • Show the GPU utilization
    • Show the GPU memory access characteristics
    • Measure kernel execution time
  • OpenCL™ Timeline visualization
    • Visualize the application high level structure
    • Visualize kernel execution and data transfer operations
    • Visualize host code execution
    • Annotate host code in the timeline with performance markers using the included CLPerfMarkerAMD library
  • OpenCL™ Application Summary pages
    • Find incorrect or inefficient usage of the OpenCL™ API using the OpenCL™ analysis module
    • Find the API hotspots
    • Find the bottleneck between kernel execution and data transfer operations
    • Find the top 10 data transfer and kernel execution operations
  • OpenCL™ Kernel Occupancy Viewer
    • Calculates and displays a kernel occupancy number, which estimates the number of in-flight wavefronts on a compute unit as a percentage of the theoretical maximum number of wavefronts that the compute unit can support
    • Find out which kernel resource (GPR usage, LDS size, or Work-group size) is currently limiting the number of in-flight wavefronts
    • Displays graphs showing how kernel occupancy would be affected by changes in each kernel resource
  • Display the AMD IL and ISA (hardware disassembly) code of the kernel for OpenCL™ kernels and DXASM code for DirectCompute kernels.

Getting Started

Requirements

  • Microsoft Windows Vista or 7 (32 bit/64 bit) or Linux 32-bit/64-bit
  • [Optional] Microsoft Visual Studio 2008 and 2010 (Standard/Professional/Team System Edition)
  • To profile OpenCL™ applications:
    • AMD APP SDK v2.7 or later
    • [GPU device] AMD Catalyst with OpenCL™ GPU support (12.4 or newer)
    • [GPU device] AMD Radeon™ HD 4000 series or newer
  • To profile DirectCompute applications:
    • Microsoft DirectX run-time (June 2010 or later)
    • AMD Radeon™ HD 5000 series or newer

AMD APP Profiler Screenshot

Support

  • AMD APP Profiler Knowledge Base
  • GPU Developer Tool Forum

Resources

  • AMD APP Profiler User Guide (coming soon!)
  • AMD_Accelerated_Parallel_Processing_OpenCL_Programming_Guide.pdf
  • AMD APP Profiler’s article at Siggraph 2010
  • OpenCL™ Zone

Download

File Name Version Size Launch Date OS Bitness Description
Linux®
AMDAPPProfiler-v2.5.1804-lnx.tgz 2.5 3 MB 05/16/2012 32-bit/64-bit AMD APP Profiler for OpenCL on Linux platforms
Windows®
AMDAPPProfiler-v2.5.1777.msi 2.5 8 MB 05/16/2012 32-/64-bit AMD APP Profiler for OpenCL on Windows platforms

Get the hcNewsFlash.

Your email address:

No SPAM.
Easy unsubscribe.

HSA is going to rock your world.

Learn more about Heterogeneous System Architecture.

Got Questions?

Ask the Developer Forums Community. They’ve got answers.

Tools & SDKs

  • Heterogeneous Computing
    • Aparapi
    • APP Kernel Analyzer
    • APP Profiler
      • Getting Started
        • System Requirements
        • Installation
        • Successful Installation
        • Using The Visual Studio Plugin
        • Using the Command Line Interface
        • Known Issues
        • Support
      • User Guide
        • APP Profiler Session Explorer
        • APP Profiler Session
        • APP Profiler Timeline
        • APP Profiler Summary Pages
        • APP Profiler Code Viewer
        • APP Profiler Kernel Occupancy Viewer
        • APP Profiler Settings
        • APP Profiler Session Parameters
        • Description of Configuration Files
        • APP Profiler Kernel Occupancy
        • CLPerfMarkerAMD Library
    • Accelerated Parallel Processing (APP) SDK
      • Downloads
        • Download Archive
      • System Requirements & Driver Compatibility
      • Documentation
      • Bolt C++ Template Library
        • AMD Bolt Sample Code Contest Official Rules
      • Samples & Demos
      • Compute System Recommendations
      • Intro OpenCL Tutorial
      • Open Source Libraries
      • Support
    • Accelerated Parallel Processing Math Libraries (APPML)
    • CodeAnalyst Performance Analyzer
      • CodeAnalyst for Linux®
      • CodeAnalyst for Windows®
      • PMU Extension Driver
      • CodeAnalyst Archive
    • CodeXL
    • gDEBugger
      • gDEBugger User Guide
        • Getting started with gDEBugger
        • Toolbars
        • Views
        • Dialogs
        • How Do I…
        • FAQ
        • Supported Extensions
        • System Requirements & Installation
        • Legal Information
        • Contact Information
  • CPU Development
    • Advanced Platform Management Link (APML)
    • Core Math Library (ACML)
      • Downloads & Resources
        • Building with ACML
        • How to use ACML with different versions of GCC/GFORTRAN
      • Archive Downloads
      • Product Features
      • Redistribution Agreements
    • LibM
      • ACML_MV
      • LibM Functions
    • Open64 SDK
      • Building with ACML
    • SimNow™ Simulator
    • Tools for DMTF DASH
    • x86 Open64 Compiler Suite
      • Developer Guide
      • Apps & Libraries Built by x86 Open64
      • Inactive Files
  • Graphics Development
    • Display Library (ADL) SDK
    • ATI GPU Services (AGS) Library
    • GPU PerfStudio 2
      • API Trace
      • Frame Debugger
      • Frame Profiler
      • Shader Debugger
      • Shader Editor
      • Previous versions
    • GPU ShaderAnalyzer
    • GPUPerfAPI
    • OpenGL ES SDK
    • Quad-Buffer SDK
    • FirePro SDK
      • FirePro News & Events
    • Radeon™ SDK
  • Open Source
    • GCC & GNU Toolchains
    • OpenCL™ Emulator-Debugger
    • Tapper

©2013 Advanced Micro Devices, Inc. OpenCL and the OpenCL logo are trademarks of Apple, Inc., used with permission by Khronos.

  • Contact Us
  • |
  • Careers
  • |
  • Site Map
  • |
  • Terms and Conditions
  • |
  • Privacy
  • |
  • Trademarks