Developer Central
China  |  India
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support
  • Home
  • Tools & SDKs
  • Resources
  • Community
  • Partners
  • Support

Resources

  • Heterogeneous Computing
    • OpenCL™ Zone
      • Getting Started with OpenCL
      • Tools and Libraries
      • Programming in OpenCL™
        • Introductory Exercises and Tutorials
        • Debugging Applications
        • Optimizing Applications
        • Benchmarking Performance
        • Porting CUDA Applications to OpenCL™
        • Image Convolution Using OpenCL™
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 2
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 3
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 4
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 5
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 6
      • Training & Events
        • OpenCL™ Course: Introduction to OpenCL™ Programming
        • OpenCL™ Course: Introductory Tutorial to OpenCL™ for HPC at SAAHPC’10
        • OpenCL Programming Webinar Series
        • OpenCL™ On-Demand Webinars
      • Articles & Papers
      • Getting Started – Software & Hardware
    • What is Heterogeneous Computing?
    • What is Heterogeneous System Architecture (HSA)?
      • A Heterogenius Architecture
  • Documentation Library
  • Hardware & Drivers
    • CCC Driver Details
    • “Magny-Cours” Zone
    • ATI Catalyst™ PC Vendor ID (1002) LI
  • AFDS Videos
  • Documentation & Articles
    • Develop Blazing Fast Code with Microsoft Visual Studio® 2008 and AMD Tools
    • Exploiting Multi-Core Processors in Windows Vista
    • Performance Optimization of Windows Applications on AMD Processors, Part I
    • Performance Optimization of Windows Applications on AMD Processors, Part II
    • Ten Things Developers Should Know About Windows 7
    • The Windows NUMA API-What It Is and Why You Care
    • Articles & Whitepapers
      • OpenCL™ Optimization Case Study: Diagonal Sparse Matrix Vector Multiplication Test
      • Barcelona’s Innovative Architecture Is Driven by a New Shared Cache
      • Bulk Encryption on GPUs
      • Develop Blazing Fast Code with Microsoft Visual Studio® 2008 and AMD Tools
      • Going to Barcelona: A Modern Architecture for Breakthrough Software Performance
      • Introduction to “Magny-Cours”
      • Java Performance when Debugging is Enabled
      • JPEG Decoding with Run-Length Encoding: A CPU and GPU Approach
      • New Round-to-Even Technique for Large-scale Data and Its Application in Integer Scaling
      • OpenCL™ and the AMD APP SDK
      • OpenCL™ and the AMD APP SDK v2.4
      • OpenCL™ Optimization Case Study Fast Fourier Transform – Part 1
      • OpenCL™ Optimization Case Study Fast Fourier Transform – Part II
      • OpenCL™ Optimization Case Study: Simple Reductions
      • OpenCL™ Optimization Case Study: Support Vector Machine Training
      • Tiled Convolution: Fast Image Filtering
    • Developer Guides & Manuals
    • Specifications & Technical Bulletins
    • Case Studies
    • Conference Presentations
      • GPU Technical Publications
      • GPU Technology Papers
    • Videos
      • AMD Developer Inside Track
      • Intro to CodeAnalyst
      • OpenCL™ Technical Overview
      • GPU Demo Videos
      • AMD & Sun Technology
      • AMD Opteron 6100 Series: A Developer’s Perspective
      • Software Optimization Video Series
      • Xen Summit North America 2010
    • Java™ Zone
    • Knowledge Base
    • OpenGL® Zone
      • OpenGL® Specifications
    • Samples & Demos
      • Processor and Core Enumeration Using CPUID
      • GPU Demos
        • Radeon™ HD 7900 Series Graphics Real-Time Demos
        • Radeon™ HD 6900 Series Graphics Real-Time Demo
        • Radeon™ HD 5000 Series Graphics Real-Time Demos
        • Radeon™ HD 4800 Series Real-Time Demos
        • FireGL™ V8600 PCI-Express Real-Time Demos
        • Radeon™ HD 3000 Series Real-Time Demo
        • Radeon™ HD 2000 Series Real-Time Demos
  • India Developer Zone
    • India University Courses
    • University Kit & Book
    • C-DAC “Think Parallel” participants visits at AMD – 20th June, 2012
    • C-DAC HeGaPa 2012 Conference
    • Heterogeneous computing Jobs in AMD India
  • Archive
    • Events
      • AMD OpenCL Coding Competition
      • Real-Time Image Processing for Autonomous Learning and Control within 3D Virtual Worlds
      • Semi-Supervised Learning-Based Method for Adaptive Shadow Detection
      • AMD OpenCL™ Coding Competition
      • Real-time Video Effects with AMD & Kinect
      • Numerical Simulation of an X-Ray Generator
    • AppShowcase Archive
    • Archived Tools
      • Video Player Test
      • CPU Tools Archive
        • 128-Bit SSE5 Instruction Set
        • AMD String Library
        • Framewave Project
        • SSEPlus Project
      • GPU Tools Archive
        • ATI Stream Software Development Kit (SDK) v2.0 Beta Program
        • AMD Tootle
        • ASHLI – Advanced Shading Language Interface
        • ATI Radeon™ SDK
        • ATI Stream Software Development Kit (SDK) v1.4-beta
          • ATI Stream SDK MD5 Checksums
        • ATI_Compress
        • CubeMapGen
        • AMD GPU MeshMapper
        • GPU PerfStudio
        • Normal Mapper
        • RenderMonkey™ Toolsuite
          • RenderMonkey Toolsuite – IDE Features
          • RenderMonkey™ Toolsuite – Testimonials
          • RenderMonkey™ Toolsuite – SDK
        • The Compressonator
        • TruForm Resources
          • TruForm™ FAQ
      • Installing GCC on Ubuntu 8.04

Home > Resources > Documentation & Articles > Videos > AMD Developer Inside Track

AMD Developer Inside Track

The AMD Developer Inside Track is a monthly video series with the sole purpose of giving software developers and inside, behind-the-scenes, look at new technologies coming out of AMD and AMD’s software partners.  

Episode Index

  • Episode 1: CommunityOne 2009 West Panel
  • Episode 2: Introduction to OpenCL
  • Episode 3: Vision Launch Recap: The Developer Perspective
  • Episode 4: AMD & Adobe: A Software Optimization Story
  • Episode 5: AMD x86 Open64 Compiler Suite Team Insights
  • Episode 6: Using CPUID
  • Episode 7: GDC 2010: Cloth, Destruction, Tesselation & More
  • Episode 8: How to Optimize Image Convolution
  • Episode 9: Xen Summit
  • Episode 10: CodeAnalyst Supports Visual Studio 2010
  • Episode 11: Introduction to the AMD Core Math Library (ACML)
  • Episode 12: Introduction to SimNow™
  • Episode 13: “Bulldozer” Processor Topology
  • Episode 14: “Evergreen” GPU Architecture
  • Episode 15: Introduction to Aparapi
  • Episode 16: SimNow Live Demo
  • Episode 17: Aparapi Open Sourced

Aparapi Open Sourced Gary Frost, AMD Runtimes Team Learn how Aparapi works and how you can contribute to the open source project. (10:05)

  • Download the video

SimNow Live Demo Juan Flores, AMD SimNow Team AMD SimNow team member Juan Flores gives an in-depth demonstration of SimNow, a fast and configurable x86 and x86-64 functional simulator that demonstrates next generation hardware before the silicon becomes available. Juan walks developers through the environment and demonstrates a piece of the SPEC CPU benchmark using AMD’s next generation instruction set architectures. (10:05)

  • Download the video

Introduction to Aparapi Gary Frost and Tom Deneau, AMD Runtimes Team The AMD Runtimes team discusses the recently released Java-to-OpenCL™ API, Aparapi, and how it can help Java developers parallelize code without necessarily having to learn a new programming language. Aparapi is a Java API for arranging data parallel solutions to be executed on the GPU, which can convert Java to OpenCL on the fly. This video discusses the features and applications of Aparapi, as well a preview of what’s to come in the next release. (10:25)

  • Download the video

“Evergreen” Graphics Architecture Justin Hensley, Design Engineer, Office of the CTO This video gives you an in-depth description of GPU architecture with specifics about our “Evergreen” graphics processors. A deep understanding of GPU architecture is key to fully unlocking the power of Heterogeneous computing and optimizing those graphic intensive applications to their greatest potential. (18:51)

  • Download the video
  • Download the slide deck

“Bulldozer” Processor Topology Elsie Wahlig, AMD Fellow Processor Topology has changed dramatically over the years and will continue to evolve to be more complex. Elsie Wahlig, AMD Fellow in the Software Division, steps through a processor block diagram for the “Bulldozer” Module and highlights the areas that have changed over what is currently available today with the “Magny-Cours” Processor. She also talks about what kind of developer really needs to know this level of processor detail and what tools they can use to get all the topology information they need. (9:26)

  • Download the video

Introduction to SimNow™ Juan Flores, Member of Technical Staff Did you know that AMD offers a free application called SimNow that actually simulates future AMD platforms? This video features Juan Flores, MTS talking all about what SimNow is, who would could benefit from using it, and gives examples on what type of instructions, like AVX and XOP, are available in SimNow that aren’t available in current silicon. (12:43)

  • Read Transcript
  • Download the video

Related Links: Download SimNow


Introduction to the AMD Core Math Library (ACML) Chip Freitag, Senior Member of Technical Staff, Software Development Engineer Get a first hand introduction of the AMD Core Math Library from Chip Freitag, a 17 year AMD veteran, currently working on the ACML Team. He talks about what ACML is, how it differs from ACML-GPU and who could really take advantage of it.  He also gives some insight into what the team is working on now to improve ACML and when we will have access to those new features. (7:36)

  • Download the video
  • Read Transcript

Related Links: ACML Download ACML Data Sheet AMD Performance Libraries


CodeAnalyst Supports Visual Studio 2010 Frank Swehosky CodeAnalyst integrates into Visual Studio 2010. Get a first hand introduction from Frank Swehosky, a 10 year AMD veteran on the CodeAnalyst Team. He talks through the integration process of Visual Studio 2010 and the analysis and possible optimizations that can be done on C/C++ and .NET managed code. Check CodeAnalyst often for regularly scheduled end of quarter refreshes.

  • Read Transcript
  • Download the video

Xen Summit Tom Woller, Principal MTS Engineer This episode features Tom Woller, Principal MTS Engineer speaking why AMD is involved with virtualization. Also, Ian Pratt, chairman of Xen along with Rodney Schmidt, AMD Software Eningeer, speak openly about what is coming up for Xen and AMD in the virtualization space. (09:40)

  • Download the video

How to Optimize Image Convolution Udeepta Bordoloi, MTS Software Engineer in the Stream Computing Group Udeepta Bordoloi walks though several different ways to optimize an image convolution algorithm. He explains the thinking behind tweaking the workgroup size, type of memory used, and data types that result in incredible speedups. If you use convolution you will definitely not want to miss this step-by-step optimization video. (18:15)

  • Download Video

AMD GDC Demos 2010 Saif Ali, AMD Software Engineer, Advanced Technology Group Christopher Jess, Studio|GPU™, Technical Director This episode of the AMD Inside Track Video series brings the AMD GDC demos to you. Software Engineer, Saif Ali, walks through three examples of how OpenCL can be used to create realistic cloth, destruction and fluid particle simulation using the updated Pixelux and Bullet Physics offerings. And our partners from Studio|GPU™ (Mach Studio Pro is bundled with our FirePro Graphics cards) show us how to use tessellation to change the texture of a character in real time! (10:47)

  • Read transcript
  • Download the video

Using CPUID Randy VanderHeyden, MST Software Engineer, ISV Team This video features Randy VanderHeyden, a 17 year AMD veteran, talking about the best practices on how to use, and how not to use the CPUID instruction. He demonstrates how to find the processor core count with CPUID using the code provided in the Processor and Core Enumeration Using CPUID article (Thanks for the code Tracy Carver!). Also, for those of you using Visual Studio, there is very detailed information available on MSDN about the CPUID Intrinsic. (7:22)

  • Read Transcript
  • Download the video

AMD x86 Open64 Compiler Suite Team Insights Mike Vermeulen, Manager of the x86 Open64 Team Roy Ju, Architect and Software Engineer Michael Lai, Software Engineer This video features AMD’s Open64 Compiler Team talking about Open64 origins, unique features of this compiler, such as multi-core scalability optimizations and other important optimization flags you should not miss such as loop nest optimizations (LNO), and interprocedural analysis (IPA) to name a few. Many thanks to the team for giving us this insight directly! (14:23)

  • Read Transcript
  • Download the video

AMD & Adobe: A Software Optimization Story Justin Boggs, Sofware Engineer CPU Thomas Fortier, Software Engineer Graphics Justin Boggs and Thomas Fortier are AMD software engineers that work with Adobe to make sure their software is optimized for AMD. This video footage was taken earlier this year when they were part of an Adobe Users group multithreading panel. Hear what they had to say about AMD CPU and GPU optimizations with Adobe. (9:37)

  • Download the video

Vision Launch Recap: A Developer’s Perspective Check out the AMD VISION launch event from a software developer perspective. Get the details about DirectX 11 development straight from Codemasters, Rebellion, EA and Turbine. You’ll see six monitors being driven from one graphics card, get the info on the new DX11 games coming up and more importantly, learn about the new levels of graphical detail that DX11 hardware can bring to the table. Many thanks to Gareth Thomas, Senior Programmer from Codemasters, Chris Kingsley, CTO of Rebellion, Kevin O’Leary, Product Manager of EA and Nate Jones, VP of Corporate Business Development at Turbine. (5:55)

  • Download the video

Introduction to OpenCL Michael Houston, GPG System Architect Learn about OpenCL, what the transition to OpenCL will be like, what applications are ideal for OpenCL and what impact OpenCL will have on future software. Read this blog post for a summary of this video and some example applications and demos (10:07).

  • Download the video

CommunityOne 2009 West Panel Experts from AMD, Allinea, Pervasive, and Rogue Wave give their advice and insights for what they recommend when beginning a multithreaded project, whether you are starting from scratch or working with existing single threaded code.  Parallel programming issues are not a one-size-fits-all.  Depending on the algorithm, the data dependencies, and the problem domain there are different approaches to achieving parallelism.  These expert interviews followed an AMD-sponsored multithreading discussion at CommunityOne 2009 West (12:06).

  • Read Transcript
  • Download the video

Related Links: Taking Advantage of Multi-Core, Blog by Tracy Carver


Additional Resources

  • Introducing the AMD Developer Inside Track
  • Download AMD Codeanalyst Performance Analyzer
  • Video: Introduction to CodeAnalyst
  • CodeAnalyst Forums

 

Get the hcNewsFlash.

Your email address:

No SPAM.
Easy unsubscribe.

HSA is going to rock your world.

Learn more about Heterogeneous System Architecture.

Got Questions?

Ask the Developer Forums Community. They’ve got answers.

Resources

  • Heterogeneous Computing
    • OpenCL™ Zone
      • Getting Started with OpenCL
      • Tools and Libraries
      • Programming in OpenCL™
        • Introductory Exercises and Tutorials
        • Debugging Applications
        • Optimizing Applications
        • Benchmarking Performance
        • Porting CUDA Applications to OpenCL™
        • Image Convolution Using OpenCL™
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 2
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 3
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 4
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 5
          • Image Convolution Using OpenCL™ – A Step-by-Step Tutorial Step 6
      • Training & Events
        • OpenCL™ Course: Introduction to OpenCL™ Programming
        • OpenCL™ Course: Introductory Tutorial to OpenCL™ for HPC at SAAHPC’10
        • OpenCL Programming Webinar Series
        • OpenCL™ On-Demand Webinars
      • Articles & Papers
      • Getting Started – Software & Hardware
    • What is Heterogeneous Computing?
    • What is Heterogeneous System Architecture (HSA)?
      • A Heterogenius Architecture
  • Documentation Library
  • Hardware & Drivers
    • CCC Driver Details
    • “Magny-Cours” Zone
    • ATI Catalyst™ PC Vendor ID (1002) LI
  • AFDS Videos
  • Documentation & Articles
    • Develop Blazing Fast Code with Microsoft Visual Studio® 2008 and AMD Tools
    • Exploiting Multi-Core Processors in Windows Vista
    • Performance Optimization of Windows Applications on AMD Processors, Part I
    • Performance Optimization of Windows Applications on AMD Processors, Part II
    • Ten Things Developers Should Know About Windows 7
    • The Windows NUMA API-What It Is and Why You Care
    • Articles & Whitepapers
      • OpenCL™ Optimization Case Study: Diagonal Sparse Matrix Vector Multiplication Test
      • Barcelona’s Innovative Architecture Is Driven by a New Shared Cache
      • Bulk Encryption on GPUs
      • Develop Blazing Fast Code with Microsoft Visual Studio® 2008 and AMD Tools
      • Going to Barcelona: A Modern Architecture for Breakthrough Software Performance
      • Introduction to “Magny-Cours”
      • Java Performance when Debugging is Enabled
      • JPEG Decoding with Run-Length Encoding: A CPU and GPU Approach
      • New Round-to-Even Technique for Large-scale Data and Its Application in Integer Scaling
      • OpenCL™ and the AMD APP SDK
      • OpenCL™ and the AMD APP SDK v2.4
      • OpenCL™ Optimization Case Study Fast Fourier Transform – Part 1
      • OpenCL™ Optimization Case Study Fast Fourier Transform – Part II
      • OpenCL™ Optimization Case Study: Simple Reductions
      • OpenCL™ Optimization Case Study: Support Vector Machine Training
      • Tiled Convolution: Fast Image Filtering
    • Developer Guides & Manuals
    • Specifications & Technical Bulletins
    • Case Studies
    • Conference Presentations
      • GPU Technical Publications
      • GPU Technology Papers
    • Videos
      • AMD Developer Inside Track
      • Intro to CodeAnalyst
      • OpenCL™ Technical Overview
      • GPU Demo Videos
      • AMD & Sun Technology
      • AMD Opteron 6100 Series: A Developer’s Perspective
      • Software Optimization Video Series
      • Xen Summit North America 2010
    • Java™ Zone
    • Knowledge Base
    • OpenGL® Zone
      • OpenGL® Specifications
    • Samples & Demos
      • Processor and Core Enumeration Using CPUID
      • GPU Demos
        • Radeon™ HD 7900 Series Graphics Real-Time Demos
        • Radeon™ HD 6900 Series Graphics Real-Time Demo
        • Radeon™ HD 5000 Series Graphics Real-Time Demos
        • Radeon™ HD 4800 Series Real-Time Demos
        • FireGL™ V8600 PCI-Express Real-Time Demos
        • Radeon™ HD 3000 Series Real-Time Demo
        • Radeon™ HD 2000 Series Real-Time Demos
  • India Developer Zone
    • India University Courses
    • University Kit & Book
    • C-DAC “Think Parallel” participants visits at AMD – 20th June, 2012
    • C-DAC HeGaPa 2012 Conference
    • Heterogeneous computing Jobs in AMD India
  • Archive
    • Events
      • AMD OpenCL Coding Competition
      • Real-Time Image Processing for Autonomous Learning and Control within 3D Virtual Worlds
      • Semi-Supervised Learning-Based Method for Adaptive Shadow Detection
      • AMD OpenCL™ Coding Competition
      • Real-time Video Effects with AMD & Kinect
      • Numerical Simulation of an X-Ray Generator
    • AppShowcase Archive
    • Archived Tools
      • Video Player Test
      • CPU Tools Archive
        • 128-Bit SSE5 Instruction Set
        • AMD String Library
        • Framewave Project
        • SSEPlus Project
      • GPU Tools Archive
        • ATI Stream Software Development Kit (SDK) v2.0 Beta Program
        • AMD Tootle
        • ASHLI – Advanced Shading Language Interface
        • ATI Radeon™ SDK
        • ATI Stream Software Development Kit (SDK) v1.4-beta
          • ATI Stream SDK MD5 Checksums
        • ATI_Compress
        • CubeMapGen
        • AMD GPU MeshMapper
        • GPU PerfStudio
        • Normal Mapper
        • RenderMonkey™ Toolsuite
          • RenderMonkey Toolsuite – IDE Features
          • RenderMonkey™ Toolsuite – Testimonials
          • RenderMonkey™ Toolsuite – SDK
        • The Compressonator
        • TruForm Resources
          • TruForm™ FAQ
      • Installing GCC on Ubuntu 8.04

©2013 Advanced Micro Devices, Inc. OpenCL and the OpenCL logo are trademarks of Apple, Inc., used with permission by Khronos.

  • Contact Us
  • |
  • Careers
  • |
  • Site Map
  • |
  • Terms and Conditions
  • |
  • Privacy
  • |
  • Trademarks