Our primary aim with the AMD APP SDK is to enable developers to leverage the processing power of heterogeneous compute (hc). OpenCL™ is a primary mechanism for…
Blog
Introducing AMD CodeXL: Developer Tool Suite for Heterogeneous Compute (Beta Now Available)
At the AMD Fusion Developer Summit (AFDS) this summer, we discussed new developments around heterogeneous computing and showcased a powerful and growing developer ecosystem for the same.…
Installing LAMP: Part 1
One of the most popular open source technology stacks is LAMP – Linux, Apache, MySQL, PHP. To sum it up in a very simplistic manner, this technology…
Sparse Matrix by Vector Multiplication
Multiplication of a matrix by a vector is a very common operation. If M is an m x n matrix and V is a vector with n…
Persistent Threads dot Product With Templates
A previous blog entry discussed the benefits of the persistent thread programming paradigm in OpenCL™, using the dot product operation as an example. This note modifies the…
Call for Papers (and Attendees) for the Programmability Issues for Heterogeneous Multicores Workshop
AMD is again supporting the Programmability Issues for Heterogeneous Multicores (MULTIPROG) workshop at the HiPEAC conference in Berlin, Germany on January 21 – 23, 2013. MULTIPROG brings…
Tips & Tricks: How to get near Zero cost for data inspection
Zero cost of data inspection. Very often a GPU kernel has to return a few values to the host for inspection. For example, to check optimization criterion,…
AMD releases CodeAnalyst 3.4 for Linux!
AMD CodeAnalyst software for Linux® 3.4 has been released and can be downloaded from the AMD CodeAnalyst for Linux website: http://developer.amd.com/cpu/codeanalyst/codeanalystlinux In this AMD CodeAnalyst 3.4 release,…
HSA – a boon for OpenCL™ and heterogeneous compute in general
Like many of you I spent time recently at the AMD Fusion Developer Summit (AFDS) in Belleview WA, but for those of you who were unable to…
Efficient dot Product Implementation Using Persistent Threads
This note discusses the implementation of the dot product operator in OpenCL™. It presents an efficient implementation demonstrating the use of persistent threads. Download the code sample…
- ← Previous Page
- 1
- 2
- 3
- 4
- …
- 26
- 27
- Next Page →