Developer Central
China  |  India
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support

Resources

  • Heterogeneous Computing
    • OpenCL™ Zone
      • Getting Started with OpenCL
      • Tools and Libraries
      • Programming in OpenCL™
        • Introductory Exercises and Tutorials
        • Debugging Applications
        • Optimizing Applications
        • Benchmarking Performance
        • Porting CUDA Applications to OpenCL™
        • Image Convolution Using OpenCL™
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 2
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 3
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 4
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 5
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 6
      • Training & Events
        • OpenCL™ Course: Introduction to OpenCL™ Programming
        • OpenCL™ Course: Introductory Tutorial to OpenCL™ for HPC at SAAHPC’10
        • OpenCL Programming Webinar Series
        • OpenCL™ On-Demand Webinars
      • Articles & Papers
      • Getting Started – Software & Hardware
    • What is Heterogeneous Computing?
    • What is Heterogeneous System Architecture (HSA)?
      • A Heterogenius Architecture
  • Documentation Library
  • Hardware & Drivers
    • CCC Driver Details
    • “Magny-Cours” Zone
    • ATI Catalyst™ PC Vendor ID (1002) LI
  • AFDS Videos
  • Documentation & Articles
    • Develop Blazing Fast Code with Microsoft Visual Studio® 2008 and AMD Tools
    • Exploiting Multi-Core Processors in Windows Vista
    • Performance Optimization of Windows Applications on AMD Processors, Part I
    • Performance Optimization of Windows Applications on AMD Processors, Part II
    • Ten Things Developers Should Know About Windows 7
    • The Windows NUMA API-What It Is and Why You Care
    • Articles & Whitepapers
      • OpenCL™ Optimization Case Study: Diagonal Sparse Matrix Vector Multiplication Test
      • Barcelona’s Innovative Architecture Is Driven by a New Shared Cache
      • Bulk Encryption on GPUs
      • Develop Blazing Fast Code with Microsoft Visual Studio® 2008 and AMD Tools
      • Going to Barcelona: A Modern Architecture for Breakthrough Software Performance
      • Introduction to “Magny-Cours”
      • Java Performance when Debugging is Enabled
      • JPEG Decoding with Run-Length Encoding: A CPU and GPU Approach
      • New Round-to-Even Technique for Large-scale Data and Its Application in Integer Scaling
      • OpenCL™ and the AMD APP SDK
      • OpenCL™ and the AMD APP SDK v2.4
      • OpenCL™ Optimization Case Study Fast Fourier Transform – Part 1
      • OpenCL™ Optimization Case Study Fast Fourier Transform – Part II
      • OpenCL™ Optimization Case Study: Simple Reductions
      • OpenCL™ Optimization Case Study: Support Vector Machine Training
      • Tiled Convolution: Fast Image Filtering
    • Developer Guides & Manuals
    • Specifications & Technical Bulletins
    • Case Studies
    • Conference Presentations
      • GPU Technical Publications
      • GPU Technology Papers
    • Videos
      • AMD Developer Inside Track
      • Intro to CodeAnalyst
      • OpenCL™ Technical Overview
      • GPU Demo Videos
      • AMD & Sun Technology
      • AMD Opteron 6100 Series: A Developer’s Perspective
      • Software Optimization Video Series
      • Xen Summit North America 2010
    • Java™ Zone
    • Knowledge Base
    • OpenGL® Zone
      • OpenGL® Specifications
    • Samples & Demos
      • Processor and Core Enumeration Using CPUID
      • GPU Demos
        • Radeon™ HD 7900 Series Graphics Real-Time Demos
        • Radeon™ HD 6900 Series Graphics Real-Time Demo
        • Radeon™ HD 5000 Series Graphics Real-Time Demos
        • Radeon™ HD 4800 Series Real-Time Demos
        • FireGL™ V8600 PCI-Express Real-Time Demos
        • Radeon™ HD 3000 Series Real-Time Demo
        • Radeon™ HD 2000 Series Real-Time Demos
  • India Developer Zone
    • India University Courses
    • University Kit & Book
    • C-DAC “Think Parallel” participants visits at AMD – 20th June, 2012
    • C-DAC HeGaPa 2012 Conference
    • Heterogeneous computing Jobs in AMD India
  • Archive
    • Events
      • AMD OpenCL Coding Competition
      • Real-Time Image Processing for Autonomous Learning and Control within 3D Virtual Worlds
      • Semi-Supervised Learning-Based Method for Adaptive Shadow Detection
      • AMD OpenCL™ Coding Competition
      • Real-time Video Effects with AMD & Kinect
      • Numerical Simulation of an X-Ray Generator
    • AppShowcase Archive
    • Archived Tools
      • Video Player Test
      • CPU Tools Archive
        • 128-Bit SSE5 Instruction Set
        • AMD String Library
        • Framewave Project
        • SSEPlus Project
      • GPU Tools Archive
        • ATI Stream Software Development Kit (SDK) v2.0 Beta Program
        • AMD Tootle
        • ASHLI – Advanced Shading Language Interface
        • ATI Radeon™ SDK
        • ATI Stream Software Development Kit (SDK) v1.4-beta
          • ATI Stream SDK MD5 Checksums
        • ATI_Compress
        • CubeMapGen
        • AMD GPU MeshMapper
        • GPU PerfStudio
        • Normal Mapper
        • RenderMonkey™ Toolsuite
          • RenderMonkey Toolsuite – IDE Features
          • RenderMonkey™ Toolsuite – Testimonials
          • RenderMonkey™ Toolsuite – SDK
        • The Compressonator
        • TruForm Resources
          • TruForm™ FAQ
      • Installing GCC on Ubuntu 8.04

Home > Resources > Heterogeneous Computing > OpenCL™ Zone > Training & Events > OpenCL Programming Webinar Series

OpenCL Programming Webinar Series


Parallel computing is being taken to the next level by leveraging the GPU for general computing tasks. AMD is fully behind this heterogeneous computing revolution with our hardware, software tools & libraries, and support of open standards. Get trained by OpenCL™ and heterogeneous compute experts on the latest technology advances, standards, and best practices for heterogeneous compute.Stay tuned on upcoming webinar topics through the AMD hcNewsFlash Newsletter.

  • View previously recorded webinars

New Live Webinar Topics – Register Today!

Webinar Topics Registration Links
by
Region/Date/Time

Previously Recorded Webinars:

Webinar Topics Videos and Presentations
CodeXL Overview and Demonstration

AMD CodeXL is a comprehensive tool suite that enables developers to harness the benefits of AMD CPUs, GPUs and APUs. It includes powerful GPU debugging, comprehensive GPU and CPU profiling, and static OpenCL™ kernel analysis capabilities, enhancing accessibility for software developers to enter the era of heterogeneous computing.
Performance Evaluation of APARAPI Using Real World Applications

Java APARAPI (Java A PARallel API) allows Java developers to take advantage of the computational power of GPU and APU devices by executing java parallel code fragments on the GPU rather than being confined to the local CPU. This presentation aims at performance evaluation of APARAPI for execution of parallel Java code on GPU via OpenCL. Performance analysis is done by running real world problems programmed in Java using Aparapi. Each program is written in multi-threaded java to have proper comparison. There will around 15 real world programs which are commonly used and well known. This also have some tuning done in the APARAPI library.
Overview of HSAIL – the basis for implementing an HSA platform agnostic open-source OpenCL runtimeHSAIL is a new virtual byte code and virtual machine designed for parallel compute on heterogeneous devices. HSAIL makes it easy to compile high performance code both for current and future architectures. HSAIL programs will run unchanged on future hardware . Unlike AMDIL which is the graphics byte code, HSAIL has been architected to support modern high level programming languages such as Java and C++. This talk will introduce HSAIL at a high level, go over the virtual machine, Next we will talk about the compilation model, the reasons for a byte code rather than an exposed ISA and how HSAIL opens up HSA hardware to compiler and tool developers. We will review how HSAIL is different from PTX/LLVM and Java Byte code. Finally we will go over the one HSAIL important aspects– the memory model. Unlike previous GPU byte codes, the HSAIL memory model uses a formal design based on acquire/release semantics.
Graphics Core Next Architecture OverviewGCN is Designed to push not only the boundaries of DirectX® 11 gaming, the GCN Architecture is also AMD’s first design specifically engineered for general computing. Equipped with up to 32 compute units (2048 stream processors), each containing a scalar coprocessor, AMD’s 28nm GPUs are more than capable of handling workloads-and programming languages-traditionally exclusive to the processor. Coupled with the dramatic rise of GPU-aware programming languages like C++ AMP and OpenCL™, the GCN Architecture is truly the right architecture for the right time. Participate in this webinar to learn how you can take advantage of this new architecture in your GPGPU programs. Slide Deck
MXPA – The Multicore Cross-Platform Architecture for Performance-Portable Computing (Guest Presenter from MulticoreWare Inc.)We believe that one OpenCL source implementation should perform well on all architectures. MXPA is an OpenCL runtime and compiler enabling the same code you optimized for your GPU to get portable performance for architectures from multicore x86 to ARM to DSPs.The key features of MXPA are:

  1. An installable OpenCL platform for multicore x86 with performance comparable or superior to existing implementations, due to drastic reductions barrier synchronization costs, and effective SIMD usage of any native vector width
  2. Standalone development libraries for statically compiling and linking the MXPA runtime with applications, enabling delivery of complete OpenCL applications or libraries without dependencies on a client-installed runtime or requirements to distribute uncompiled source code
  3. Integration with other MulticoreWare OpenCL tools, such as GMAC, making it even easier to write OpenCL code for peak performance
Heterogeneous Computing Tips & Tricks with OpenCL™This webinar will focus on a comprehensive list of tips and tricks that AMD performance engineers use to help get the most out of their heterogeneous computing coding time and code performance.
Quickly Optimize OpenCL™ Applications with SlotMaximizer (Guest Presenter from MulticoreWare Inc.)SlotMaximizer is a transformation tool that automatically tunes OpenCL™ kernels, helping to increase developer productivity. It aids developers to obtain increased performance, higher throughput, and better hardware utilization from their kernels with minimal effort while maintaining a small, readable and maintainable code base.SlotMaximizer enables developers to focus on their original problems and algorithm strategies and leave the details of optimizing the code to the compiler. SlotMaximizer is already incorporated into the AMD Catalyst™ drivers as a preview and can be used by anyone developing applications using the AMD APP SDK.
Advanced OpenCL Debugging using AMD gDEBuggerThis webinar will cover profiling OpenCL with CodeAnalyst. We will start with an overview of the features, types of analysis performed and include an example. View Video
Heterogeneous Compute Features of AMD CodeAnalyst Performance AnalyzerDeveloping robust parallel computing applications is difficult. In this talk we will introduce the audience to gDEBugger, an OpenCL kernel source code debugger. We will display advanced debugging techniques that help locate hard to find OpenCL related bugs. View Video
Coordinating OpenCL Computations on one or more Heterogeneous Devices, presented by guest speaker Rob Farber (3 of 3)This webinarcontinues the discussion of the nine article OpenCL Portable Parallelism series by Rob Farber onThe Code Project. Articles4, 5, and 8 will be discussed demonstrating how to concisely utilize multiple command queues and to coordinate tasks across multiple heterogeneous devices such as the two GPU+ CPU configuration used in the articles. Complete working code samples will be discussed including a massively parallel random number test framework. In combination with a strong scaling execution model, the ability to choreograph asynchronous data movement and overlapped computations on multiple devices makes OpenCL a powerful development tool to consider to incorporate scalable portable parallelism into your applications.
Accelerate Rendering by an Order of Magnitude with OpenCL plus a View to the Multi-core and Web-enabled Future, presented by guest speaker Rob Farber. (2 of 3)This webinarconcludes the discussion of the nine article OpenCL Portable Parallelism series by Rob Farber on the Code Project. Articles 6, 7, and 9 willdemonstrate how to use OpenCL to provide high-quality, fast rendering in combination with primitive restart, a new feature added to the OpenGL 3.1 standard. As CPUs add ever more cores, device fission lets OpenCL programmerspartition the hardware capability to achieve the best resource usage. Concluding thoughts will include a discussion of webcl, which allows the use of OpenCL inside a web browser.
Introducing OpenCL Portable Parallelism presented by guest speaker, Rob Farber (1 of 3)This webinar will introduce the nine article OpenCL Portable Parallelism series by Rob Farber on The Code Project. Articles 1, 2 and 3 will be discussed including (1) C and C++ APIs for OpenCL plus building and running applications (2) OpenCL memory spaces and (3) the OpenCL execution model. Complete code examples from each article will be discussed to help get started with OpenCL. In particular, the importance of the OpenCL strong scaling execution model will be highlighted along with other reasons to consider OpenCL for your application development.
Write Once Run Anywhere This presentation shows how Aparapi, an API for expressing data parallel workloads in Java, can extend Java’s promise of ‘Write Once, Run Anywhere’ to include GPU devices. Existing Java OpenCL bindings require developers to code data parallel algorithms in OpenCL, provide explicit buffer transfers and execution requests at runtime, and if OpenCL were unavailable, code a separate Java implementation and have an appropriate fallback strategy. Aparapi allows developers to code against a simple Java data parallel API. At runtime Aparapi attempts to execute on the GPU by converting bytecode to OpenCL; if OpenCL is unavailable Aparapi will fall back to executing using a Java thread pool. We contrast Aparapi with other OpenCL Java bindings, describe how it works, and walk through some real world examples. We also discuss how to determine whether Aparapi is a viable option for your application.
Taming GPU Compute with C++ AMP Developers today inject parallelism into their compute-intensive applications in order to take advantage of multi-core CPU hardware. Beyond CPUs, however, compute accelerators such as general-purpose GPUs can provide orders of magnitude speed-ups for data parallel algorithms. How can you as a C++ developer fully utilize this heterogeneous hardware from your Visual Studio environment? How can you benefit from this tremendous performance boost in your Visual C++ solutions without sacrificing developer productivity? The answers will be presented in this session about C++ Accelerated Massive Parallelism. Slide Deck
Advanced OpenCL™ Debugging using gDEBugger by Yaki Tebeka, AMD Fellow Developing robust parallel computing applications is difficult. In this talk we will introduce the audience to gDEBugger, an OpenCL kernel source code debugger, integrated into Visual Studio™. We will display advanced debugging techniques that help locate hard-to-find OpenCL related bugs. Join Yaki Tebeka, an AMD Fellow, responsible for AMD’s developer tools, for this live webinar. Yaki brings over 13 years of experience in software, focusing on 3D graphics, heterogeneous computing and developer tools. View Video
Introduction to Parallel and Heterogeneous Computing (1 hour)
Learn how heterogeneous computing fits into the parallel computing paradigm, what problems it solves and what opportunities it presents.
Video
Slide Deck
Introduction to OpenCL (1 hour)
Learn about the benefits of OpenCL, the anatomy and architecture of OpenCL and the tools and drivers available.
Video
Slide Deck
GPU Architecture Overview (1 hour)
Learn about the modern GPU architectures and place the devices in context of the CPU technologies available today. Get specific insight into the latest AMD hardware including the 5000 and 6000 series GPUs and how this design affects software implementation.
Video
Slide Deck
OpenCL Programming in Detail (1.5 hours)
Learn about OpenCL application execution, resource setup, kernel programming and compiling, program execution, memory objects and synchronization. This webinar will also get into OpenCL C Language including restrictions, data types, type casting and conversions, qualifiers, and built-in functions in the context of an N-Body example.
Video
Slide Deck
Real World Application in OpenCL (1 hour)
Walk through the creation of a video processing application developed by one of our engineers and get a sense of what you might be able to do with OpenCL in your own applications.
Video
Slide Deck
Device Fission Extensions for OpenCL (1 hour)
Learn about the unique advantage that OpenCL has when it comes to Fission extensions.
Video
Slide Deck
Smoothed Particle Hydrodynamics (1 hour)
This webinar describes a project in computational fluid dynamics targeted for videogame applications. The Smoothed Particle Hydrodynamics (SPH) algorithm is a particle method for simulating viscous fluids like water, syrup and air. It is based on solving the incompressible Navier-Stokes equations of fluid mechanics using a particle formulation. This webinar shows you how to build an SPH simulation in OpenCL and discusses design tradeoffs. Source code for the simulation is available.
Video
Slide Deck
Download the source code (.rar)
Optimizing a Convolution Algorithm (1 hour)
Learn about Debugging OpenCL, performance measurements, general optimization tips and walk through optimizing a convolution algorithm.
Video
Slide Deck

Get the hcNewsFlash.

Your email address:

No SPAM.
Easy unsubscribe.

HSA is going to rock your world.

Learn more about Heterogeneous System Architecture.

Got Questions?

Ask the Developer Forums Community. They’ve got answers.

Resources

  • Heterogeneous Computing
    • OpenCL™ Zone
      • Getting Started with OpenCL
      • Tools and Libraries
      • Programming in OpenCL™
        • Introductory Exercises and Tutorials
        • Debugging Applications
        • Optimizing Applications
        • Benchmarking Performance
        • Porting CUDA Applications to OpenCL™
        • Image Convolution Using OpenCL™
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 2
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 3
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 4
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 5
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 6
      • Training & Events
        • OpenCL™ Course: Introduction to OpenCL™ Programming
        • OpenCL™ Course: Introductory Tutorial to OpenCL™ for HPC at SAAHPC’10
        • OpenCL Programming Webinar Series
        • OpenCL™ On-Demand Webinars
      • Articles & Papers
      • Getting Started – Software & Hardware
    • What is Heterogeneous Computing?
    • What is Heterogeneous System Architecture (HSA)?
      • A Heterogenius Architecture
  • Documentation Library
  • Hardware & Drivers
    • CCC Driver Details
    • “Magny-Cours” Zone
    • ATI Catalyst™ PC Vendor ID (1002) LI
  • AFDS Videos
  • Documentation & Articles
    • Develop Blazing Fast Code with Microsoft Visual Studio® 2008 and AMD Tools
    • Exploiting Multi-Core Processors in Windows Vista
    • Performance Optimization of Windows Applications on AMD Processors, Part I
    • Performance Optimization of Windows Applications on AMD Processors, Part II
    • Ten Things Developers Should Know About Windows 7
    • The Windows NUMA API-What It Is and Why You Care
    • Articles & Whitepapers
      • OpenCL™ Optimization Case Study: Diagonal Sparse Matrix Vector Multiplication Test
      • Barcelona’s Innovative Architecture Is Driven by a New Shared Cache
      • Bulk Encryption on GPUs
      • Develop Blazing Fast Code with Microsoft Visual Studio® 2008 and AMD Tools
      • Going to Barcelona: A Modern Architecture for Breakthrough Software Performance
      • Introduction to “Magny-Cours”
      • Java Performance when Debugging is Enabled
      • JPEG Decoding with Run-Length Encoding: A CPU and GPU Approach
      • New Round-to-Even Technique for Large-scale Data and Its Application in Integer Scaling
      • OpenCL™ and the AMD APP SDK
      • OpenCL™ and the AMD APP SDK v2.4
      • OpenCL™ Optimization Case Study Fast Fourier Transform – Part 1
      • OpenCL™ Optimization Case Study Fast Fourier Transform – Part II
      • OpenCL™ Optimization Case Study: Simple Reductions
      • OpenCL™ Optimization Case Study: Support Vector Machine Training
      • Tiled Convolution: Fast Image Filtering
    • Developer Guides & Manuals
    • Specifications & Technical Bulletins
    • Case Studies
    • Conference Presentations
      • GPU Technical Publications
      • GPU Technology Papers
    • Videos
      • AMD Developer Inside Track
      • Intro to CodeAnalyst
      • OpenCL™ Technical Overview
      • GPU Demo Videos
      • AMD & Sun Technology
      • AMD Opteron 6100 Series: A Developer’s Perspective
      • Software Optimization Video Series
      • Xen Summit North America 2010
    • Java™ Zone
    • Knowledge Base
    • OpenGL® Zone
      • OpenGL® Specifications
    • Samples & Demos
      • Processor and Core Enumeration Using CPUID
      • GPU Demos
        • Radeon™ HD 7900 Series Graphics Real-Time Demos
        • Radeon™ HD 6900 Series Graphics Real-Time Demo
        • Radeon™ HD 5000 Series Graphics Real-Time Demos
        • Radeon™ HD 4800 Series Real-Time Demos
        • FireGL™ V8600 PCI-Express Real-Time Demos
        • Radeon™ HD 3000 Series Real-Time Demo
        • Radeon™ HD 2000 Series Real-Time Demos
  • India Developer Zone
    • India University Courses
    • University Kit & Book
    • C-DAC “Think Parallel” participants visits at AMD – 20th June, 2012
    • C-DAC HeGaPa 2012 Conference
    • Heterogeneous computing Jobs in AMD India
  • Archive
    • Events
      • AMD OpenCL Coding Competition
      • Real-Time Image Processing for Autonomous Learning and Control within 3D Virtual Worlds
      • Semi-Supervised Learning-Based Method for Adaptive Shadow Detection
      • AMD OpenCL™ Coding Competition
      • Real-time Video Effects with AMD & Kinect
      • Numerical Simulation of an X-Ray Generator
    • AppShowcase Archive
    • Archived Tools
      • Video Player Test
      • CPU Tools Archive
        • 128-Bit SSE5 Instruction Set
        • AMD String Library
        • Framewave Project
        • SSEPlus Project
      • GPU Tools Archive
        • ATI Stream Software Development Kit (SDK) v2.0 Beta Program
        • AMD Tootle
        • ASHLI – Advanced Shading Language Interface
        • ATI Radeon™ SDK
        • ATI Stream Software Development Kit (SDK) v1.4-beta
          • ATI Stream SDK MD5 Checksums
        • ATI_Compress
        • CubeMapGen
        • AMD GPU MeshMapper
        • GPU PerfStudio
        • Normal Mapper
        • RenderMonkey™ Toolsuite
          • RenderMonkey Toolsuite – IDE Features
          • RenderMonkey™ Toolsuite – Testimonials
          • RenderMonkey™ Toolsuite – SDK
        • The Compressonator
        • TruForm Resources
          • TruForm™ FAQ
      • Installing GCC on Ubuntu 8.04

©2013 Advanced Micro Devices, Inc. OpenCL and the OpenCL logo are trademarks of Apple, Inc., used with permission by Khronos.

  • Contact Us
  • |
  • Careers
  • |
  • Site Map
  • |
  • Terms and Conditions
  • |
  • Privacy
  • |
  • Trademarks