Sparse LU Factorization on GPUs and Its Application in Circuit Simulation | Heterogeneous Computing, Optimization | 2012 |

CPU & GPU for Parallel Programming | C, C++, Heterogeneous Computing, OpenCL™ Getting Started | 09/13/2013 |

GPU Devices and Parallelism of Image Processing | Heterogeneous Computing | 09/05/2013 |

Get Ready for the Next Generation APU Architecture | APU, Architecture, HSA, OpenCL™ Research | 09/05/2013 |

Deep Neural Network (DNN) Analysis, Application and Challenges | Optimization | 09/10/2013 |

An Efficient Compiler Framework for Cache Bypassing on GPUs | Optimization | 2013 |

Harnessing GPU compute with C++ Accelerated Massive Parallelism | C++, Heterogeneous Computing | 2011 |

CPU And GPU Programming: Analysis and Discussing the Pros and Cons | Heterogeneous Computing | 07/10/2013 |

Heterogeneous System Architecture: Introduction | HSA | 07/10/2013 |

AMD heterogeneous Uniform Memory Access | Heterogeneous Computing, HSA | 06/2013 |

Heterogeneous Computing in ARM Architecture | Architecture, Heterogeneous Computing, OpenCL™ Getting Started | 06/25/2013 |

Case Study: OpenCL-based SIFT Algorithm and Optimization | OpenCL™ Research, Optimization | 03/30/2013 |

OpenCL, Video Open Source Project Application | OpenCL™ Research | 04/03/2013 |

Heterogeneous System Architecture(HSA) | HSA, OpenCL™ Getting Started | 03/2013 |

VALGRIND | C, C++ | - |

OpenSUSE 10.3 repository for x86_64 | Optimization | - |

Taming GPU compute with C++AMP | C++, Heterogeneous Computing | 09/16/2011 |

C++ AMP Book | C++, Heterogeneous Computing | 10/16/2012 |

How to use C++ AMP from C# for a Windows Store app | C++, Heterogeneous Computing | 11/11/2011 |

Tutorial for building a Windows Store app in C++ | C++ | 07/22/2013 |

C++ AMP Math Library | C++, Heterogeneous Computing | 02/08/2012 |

C++ AMP Overview | C++, Heterogeneous Computing | - |

Creating a DirectX game | C++, DirectX, Graphics Development | 07/23/2013 |

How to set up your Modern UI DirectX app to display a view | C++, DirectX | 07/22/2013 |

Complete code for a DirectX Windows Store app framework | C++, DirectX, Graphics Development | 07/22/2013 |

DirectX and XAML interop | C++, DirectX, Graphics Development | 06/14/2013 |

Framewave documentation | Architecture, C, C++, Optimization | 12/2008 |

Intrinsics and Casting | C, Optimization | 08/31/2007 |

Using the Process Class to call command-line utilities | | 08/17/2007 |

Fun with Intrinsics | C++, Optimization | 06/29/2007 |

Image Processing the Easy Way (Brent Hollingsworth) | C++, Graphics Development, Heterogeneous Computing | 11/2006 |

Analyzing Java Performance Using Hardware Performance Counters | Java, Optimization | 2008 |

Virtualizing a Virtual Machine | Java, Optimization | 2008 |

Accelerating Java Workloads via GPUs | C, C++, Heterogeneous Computing, Java, OpenCL™ Research | 2010 |

Java Tutorial: Concurrency (threading) | Java | - |

J2SE 5.0 Performance on AMD Opteron | Java, Optimization | - |

How JVMs use Escape Analysis to Improve Application Performance | Java, Optimization | 01/30/2008 |

The Secret of Java Thread Pools | Heterogeneous Computing, Java, Optimization | 11/21/2006 |

Using AMD CodeAnalyst with Java | Java, Optimization | 06/30/2006 |

Supersizing Java: Large Pages on the Opteron Processor, Part 2 | Java, Optimization | 03/02/2006 |

Supersizing Java: Large Pages on the Opteron Processor, Part 1 | Java, Optimization | 02/14/2006 |

CodeSleuth: New Performance Analysis Tool for Java Applications | C, C++, Heterogeneous Computing, Java, OpenCL™ Getting Started, Optimization | - |

JVMTI Event Piggybacking For Precise Source Mapping | Java, Optimization | 02/18/2009 |

4 Easy Ways to do Java Garbage Collection Tuning | Java, Optimization | 04/01/2009 |

Optimizing Java Performance in a Virtual Machine Environment | Java, Optimization | 04/30/2009 |

Java Garbage Collection Characteristics and Tuning Guidelines for Apache Hadoop TeraSort Workload | Optimization | 10/2012 |

A Tutorial on Adding New Instructions to the Oracle® Java HotSpot ™ Virtual Machine | Architecture, Java, Optimization | 10/2012 |

AMD Boosts Data Warehouse Performance with Parallel Processing Appliance | Heterogeneous Computing, Optimization | 01/2013 |

AMD I/O Virtualization Technology (IOMMU) Specification | Architecture | 02/2009 |

Open Platform Management Architecture Specification | Architecture | 01/2008 |

AMD Lightweight Profiling Specification | Architecture, Optimization | 08/2010 |

IOMMU Architectural Specification | Architecture, Optimization | 03/24/11 |

Surviving and Thriving in a Multi-Core World (Peter Aitken, Alan Zeichick) | Architecture, Graphics Development, Heterogeneous Computing, Optimization | 11/2006 |

BIOS and Kernel Developer’s Guide for AMD Athlon™ 64 and AMD Opteron™ Processors | Architecture, Optimization | 02/2006 |

BIOS and Kernel Developer’s Guide for AMD NPT Family 0Fh Processors | Architecture, Optimization | 11/2009 |

BIOS and Kernel Developer Guide (BKDG) for AMD Family 14h Models 00h-0Fh Processors | Architecture | 02/17/2012 |

BIOS and Kernel Developer’s Guide (BKDG) For AMD Family 12h Processors | Architecture | 10/06/2011 |

BIOS and Kernel Developer Guide (BKDG) for AMD Family 15h Models 00h-0Fh Processors | Architecture | 01/23/2013 |

Hadoop Tuning Guide | Java, Optimization | 10/2012 |

Preliminary BIOS and Kernel Developer’s Guide (BKDG) for AMD Family 16h Models 00h-0Fh (Kabini) Processors | Architecture | 05/30/2013 |

BIOS and Kernel Developer’s Guide (BKDG) For AMD Family 10h Processors | Architecture | 04/22/2010 |

Heterogeneous Systems Architecture | C++, Heterogeneous Computing, HSA, OpenCL™ Getting Started | 08/30/2012 |

Accelerated Filtering using OpenCL | Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 2009 |

An OpenCL framework for heterogeneous multicores with local memory | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2010 |

OpenCL Evaluation for Numerical Linear Algebra Library Development | Heterogeneous Computing, OpenCL™ Research, Optimization | - |

Fast calculation of computer-generated-hologram on AMD HD5000 series GPU and OpenCL | Architecture, Graphics Development, OpenCL™ Research | 05/10/2010 |

GPU Assisted Simulation – Wireless Communications and Information Processing Research Laboratory | Heterogeneous Computing, OpenCL™ Research | - |

GPGPU Compute On AMD | Architecture, Heterogeneous Computing, OpenCL™ Getting Started | 04/06/2011 |

The Future of the APU – A4MMC2011 Keynote (Lee Howes) | APU, Architecture, C++, DirectX, Heterogeneous Computing, OpenCL™ Getting Started, Optimization | 06/04/2011 |

The Future of the APU – PAPA Workshop Presentation (Benedict Gaster) | APU, Architecture, Heterogeneous Computing, OpenCL™ Getting Started, Optimization | 06/05/2011 |

Heterogeneous System Architecture Programming: Today and Tomorrow – An exclusive interview with Leendert van Doorn of AMD | APU, C++, HSA, OpenCL™ Getting Started, Optimization | 10/2012 |

Kite: Braided Parallelism for Heterogeneous Systems | Heterogeneous Computing, OpenCL™ Research | 09/05/2012 |

Advanced OpenCL and OpenGL Debugging and Profiling | Graphics Development, Heterogeneous Computing, OpenCL™ Research, OpenGL | 2011 |

OpenCL™ Course: Introduction to OpenCL™ Programming Training Guide | Architecture, C, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 05/2010 |

OpenCL™ Course: Introduction to OpenCL™ Programming Presentation | Architecture, C, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 2010 |

OpenCL™ Course: Introductory Tutorial to OpenCL™ for HPC at SAAHPC’10 | Architecture, Heterogeneous Computing, OpenCL™ Getting Started | 2010 |

Introduction to OpenCL | C, Heterogeneous Computing, OpenCL™ Getting Started | 2011 |

OpenCL Specification | Architecture, C, Graphics Development, Heterogeneous Computing, HPC, OpenCL™ Getting Started, OpenGL, Optimization | 06/01/11 |

OpenCL-EMU Documentation | C++, OpenCL™ Getting Started | 10/2012 |

Resolve your Resolves | C, C++, DirectX, Graphics Development, OpenCL™ Getting Started, OpenGL | 08/2008 |

Framebuffer Objects | Graphics Development, OpenGL | 10/2005 |

Harnessing the Performance of CrossFireX™ | C, C++, DirectX, Graphics Development, OpenCL™ Getting Started, OpenGL | 08/2008 |

Order Matters in Resource Creation | Graphics Development, Optimization | 10/2005 |

R2VB Programming | DirectX, Graphics Development, Optimization | 03/2006 |

Depth In-depth | Graphics Development, Optimization | 06/2007 |

Advanced DX9 Capabilities for ATI Radeon Cards – whitepaper | DirectX, Graphics Development | 07/2009 |

EQAA Modes for AMD 6900 Series Graphics Cards – whitepaper | Graphics Development, Optimization | 09/2011 |

GCN Performance Tweets | Graphics Development, Optimization | 05/2013 |

AMD – GPU Association – Targeting GPUs for Load Balancing in OpenGL | Graphics Development, OpenGL, Optimization | 2010 |

AMD – Introduction to OpenGL 3.0 Whitepaper | Graphics Development, OpenGL | 2009 |

GPUPerfAPI User Guide | DirectX, Graphics Development, OpenCL™ Getting Started, OpenGL, Optimization | 2010 |

CodeAnalyst Linux Users Manual 3.4 | Optimization | 04/2012 |

Instruction-Based Sampling: A New Performance Analysis Technique for AMD Family 10h Processors | Optimization | 11/16/2007 |

Improving program performance with AMD CodeAnalyst for Linux® | Optimization | 05/09/2007 |

An introduction to analysis and optimization with AMD CodeAnalyst Performance Analyzer | C, Optimization | 09/08/2008 |

Increased performance with AMD CodeAnalyst software and Instruction-Based Sampling (on Linux) | Optimization | 06/19/2008 |

Basic Performance Measurements for AMD Athlon™ 64, AMD Opteron™ and AMD Phenom™ Processors | Optimization | 09/25/2008 |

Heterogeneous Compute Profile Features | Heterogeneous Computing, OpenCL™ Research | 07/11/2011 |

Cache Line Utilization with AMD CodeAnalyst Software | Optimization | 12/14/2011 |

Tuning GPGPU Applications for Performance – SC2007 BOF | Heterogeneous Computing, Optimization | 11/13/2007 |

Brook+ SC07 BOF Session | C, C++ | 11/13/2007 |

AMD Brook+ Presentation – SC2007 BOF | HPC | 12/12/2007 |

AMD ACML-GPU Presentation – SC2007 | HPC | 11/13/2007 |

AES Encryption – University of Central Florida | Optimization | 02/15/2008 |

H.264 Decoding – University of Central Florida | Graphics Development, Optimization | 02/15/2008 |

Symmetric Key Cryptography on Modern Graphics Hardware – RSA Conference 2008 | C, Heterogeneous Computing, HPC | 04/23/08 |

Building a High Level Language Compiler for GPGPU – PLDI’08 Tutorial | Architecture, C++, Heterogeneous Computing, Optimization | 06/08/2008 |

Entering the Golden Age of Heterogeneous Computing – C-DAC PEEP2008 | Architecture, DirectX, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 2008 |

PDC/AMD Workshop on GPU Programming | Architecture, Heterogeneous Computing, HPC, Optimization | 2008 |

Architecture-Aware Optimization Targeting Multithreaded Stream Computing – GPGPU-2 | Architecture, Heterogeneous Computing, Optimization | 03/08/2009 |

Understanding Software Approaches for GPGPU Reliability – GPGPU-2 | Architecture, Heterogeneous Computing | - |

Studies with GPGPU – NSF U.S./Egypt Meeting on Software Development for Multicore and Heterogeneous Processors | Architecture, Heterogeneous Computing, OpenCL™ Research | 06/22/2009 |

Computation Challenges in the Use of Emerging Many-Core Architectures for DoD Applications | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 08/17/2009 |

DirectCompute: Capturing the Teraflop | DirectX, Graphics Development | 2010 |

AMD: Unleashing the Power of Parallel Compute With Commodity ATI Radeon(TM) 5800 Series GPU! (SIGGRAPH Asia 2009) | Architecture, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 12/2009 |

SIGGRAPH Asia 2009 – OpenCL™: Parallel Programming for Computing and Graphics | Graphics Development, Heterogeneous Computing, OpenCL™ Research, OpenGL, Optimization | 12/16/2009 |

OpenCL™: Parallel Computing for CPUs and GPUs | C, C++, Heterogeneous Computing, OpenCL™ Research, OpenGL | 2010 |

GPU Computing: Past, Present and Future with ATI Stream Technology | Architecture, DirectX, Heterogeneous Computing, HPC, OpenCL™ Research, OpenCL™ SDK, Optimization | 03/09/2010 |

GPGPU Architecture and Performance Comparison of ATI and NVIDIA GPUs | Architecture, Heterogeneous Computing, Optimization | 06/10/2010 |

GPU-Accelerated Computing for Chemistry and Material Simulations Using ATI Stream Technology | Heterogeneous Computing, HPC, OpenCL™ Research | 06/30/2010 |

Efficient Histogram Generation Using Scattering on GPUs – ACM I3D 2007 | Graphics Development, Heterogeneous Computing | 2007 |

Combining Computer Vision and Physics Simulations Using GPGPU – SIGGRAPH 2007 | Graphics Development, Heterogeneous Computing | 10/2012 |

Symmetric Key Cryptography on Modern Graphics Hardware – ASIACRYPT 2007 | Architecture, Heterogeneous Computing, Optimization | 10/2012 |

A Compiler for Parallel Execution of Numerical Python Programs on Graphics Processing Units | Heterogeneous Computing, Optimization | 10/02/2009 |

Part 1: OpenCL™ – Portable Parallelism | C++, Heterogeneous Computing, OpenCL™ Research | 09/17/2010 |

OpenCL™ Optimization Case Study: GATLAS – Designing Kernels with Auto-Tuning | Architecture, C++, Heterogeneous Computing, OpenCL™ Research, Optimization | - |

OpenCL™ Tutorial: N-Body Simulation | Architecture, Heterogeneous Computing, OpenCL™ Getting Started, Optimization | 01/22/2011 |

Image Convolution using OpenCL™: A Step-by-Step Tutorial | C++, Graphics Development, Heterogeneous Computing, OpenCL™ Getting Started | 10/13/2009 |

Intro OpenCL™ Tutorial | C++, Heterogeneous Computing, OpenCL™ Getting Started, OpenCL™ SDK | - |

Evergreen Family Instruction Set Architecture | Architecture, DirectX, Graphics Development, Heterogeneous Computing, Optimization | 11/2011 |

AMD Intermediate Language (IL) Specification | DirectX, Graphics Development, Heterogeneous Computing | 10/2011 |

OpenCL™ 1.0 Specification | Architecture, C, Graphics Development, Heterogeneous Computing, OpenCL™ Getting Started, Optimization | 10/06/09 |

OpenCL™ 1.1 Specification | Architecture, C, Graphics Development, Heterogeneous Computing, OpenCL™ Getting Started, Optimization | 06/01/11 |

OpenCL™ 1.2 Specification | Architecture, C, Graphics Development, Heterogeneous Computing, OpenCL™ Getting Started, Optimization | 11/14/11 |

OpenCL C++ Wrapper API | C++, Graphics Development, OpenCL™ Research | 12/2012 |

OpenCL Static C++ Kernel Language Extension | C++, Heterogeneous Computing, OpenCL™ Research | 12/15/2011 |

AMD Accelerated Parallel Processing OpenVideo Decode | Graphics Development, Heterogeneous Computing, OpenCL™ Research | 2010 |

Using the Command Line Interface | Heterogeneous Computing | - |

Leverage Aparapi to Help Improve Financial Java Application Performance | APU, Heterogeneous Computing, Java, OpenCL™ Research, Optimization | 01/18/2012 |

Whole-function vectorization [FEE REQUIRED] | Graphics Development, Heterogeneous Computing, OpenCL™ Research | 04/06/2011 |

VOCL: An Optimized Environment for Transparent Virtualization of Graphics Processing Units | OpenCL™ Research, Optimization | 2011 |

Visualization Grammar: A Phrase-Based Reservoir Information Visualization System [FEE REQUIRED] | Architecture, Heterogeneous Computing | - |

Virtual Machine and Bytecode for Optimization on Heterogeneous Systems | Heterogeneous Computing, Java, OpenCL™ Research, Optimization | 2011 |

Utilising OpenCL Framework for Ray-Tracing Acceleration | Architecture, C++, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | - |

Using OpenCL for Implementing Simple Parallel Graph Algorithms | Architecture, Heterogeneous Computing, OpenCL™ Research | 2012 |

Using OpenCL | C++, Heterogeneous Computing, OpenCL™ Research, OpenGL | 2012 |

Using MELT to improve or explore your GCC-compiled source code | Heterogeneous Computing, HPC, OpenCL™ Research | 04/16/2012 |

Using Blue Gene/P and GPUs to Accelerate Computations in the EULAG Model [FEE REQUIRED] | Heterogeneous Computing, HPC, OpenCL™ Research | 2012 |

Universal view synthesis unit for glassless 3DTV [FEE REQUIRED] | Graphics Development, Heterogeneous Computing | 05/2012 |

Towards Breast Anatomy Simulation Using GPUs [FEE REQUIRED] | C++, Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

The Virtual OpenCL (VCL) Cluster Platform | Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 2011 |

The New Visualization Engine— The Heterogeneous Processor Unit [FEE REQUIRED] | Heterogeneous Computing | 2012 |

The MOSIX Virtual OpenCL (VCL) Cluster Platform [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research | 2012 |

The Bones Source-to-Source Compiler Manual | Heterogeneous Computing, OpenCL™ Research | 04/05/2012 |

Synthesis of Platform Architectures from OpenCL Programs [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research | 05/01/2011 |

Symbolic Testing of OpenCL Code | C, C++, OpenCL™ Research | 2012 |

SWM: Simpliﬁed Wu-Manber for GPU-based Deep Packet Inspection | Heterogeneous Computing, OpenCL™ Research | 2011 |

Swan: A tool for porting CUDA programs to OpenCL [FEE REQUIRED] | OpenCL™ Research, Optimization | 04/2011 |

SU(3) gluodynamics on Graphics Processing Units | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 05/05/2011 |

Streamed Watershed Transform on GPU for Processing of Large Volume Data | Architecture, Heterogeneous Computing, OpenCL™ Research | 2011 |

Solving Molecular Distance Geometry Problems in OpenCL [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research | 05/26/2012 |

Solution to PDEs using radial basis function finite-differences (RBF-FD) on multiple GPUs [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 08/30/2012 |

Software-based branch predication for AMD GPUs [FEE REQUIRED] | Architecture, Optimization | 09/2010 |

SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Simulating the Spread of Epidemics in Real-world Trading Networks using OpenCL | Heterogeneous Computing, OpenCL™ Research | 12/2011 |

Shortening Design Time through Multiplatform Simulations with a Portable OpenCL Golden-model: The LDPC Decoder Case [FEE REQUIRED] | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research | 05/01/2012 |

Scalable parallel implementation of independent components analysis on the graphics processing unit [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 05/11/2011 |

Scalable multi-precision simulation of spiking neural networks on GPU with OpenCL [FEE REQUIRED] | Architecture, OpenCL™ Research | 06/10/2011 |

RGEM: A Responsive GPGPU Execution Model for Runtime Engines [FEE REQUIRED] | Architecture, Heterogeneous Computing | 12/02/11 |

Revision of Relational Joins for Multi-Core and Many-Core Architectures? | Architecture, Heterogeneous Computing | 2011 |

Research and Application of Parallel Computing Technologies based on CUDA and OpenCL | Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

RELEVANCE-DRIVEN ACQUISITION AND RAPID ON-SITE ANALYSIS OF 3D GEOSPATIAL DATA | Graphics Development, Heterogeneous Computing, OpenCL™ Research | 2010 |

Realtime scheduling using GPUs - proof of feasibility | Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Real-time video processing framework for functional testing of the DTV/STB devices based on heterogeneous multi-core platform [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research | 01/12/2011 |

Real-Time Systems with Radiation-Hardened Processors: A GPU-based Framework to Explore Tradeoffs | Heterogeneous Computing, OpenCL™ Research, Optimization | 05/09/2010 |

Real-Time Spherical Panorama Image Stitching Using OpenCL | Architecture, Graphics Development, Heterogeneous Computing, OpenCL™ Research | 2011 |

Real-time gradient vector flow on GPUs using OpenCL [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 06/2012 |

Real-time 3D flash ladar imaging through GPU data processing [FEE REQUIRED] | Graphics Development, Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 01/24/2011 |

Radar Symposium (IRS), 2011 Proceedings International [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 09/07/2011 |

Quantum.Ligand.Dock: protein–ligand docking with quantum entanglement refinement on a GPU system | Heterogeneous Computing, HPC, OpenCL™ Research | 05/09/2012 |

Provisioning OpenCL Capable Infrastructure with Infiniband Verbs [FEE REQUIRED] | Architecture, HPC, OpenCL™ Research, Optimization | 07/08/2011 |

Programmation multi-accélérateurs unifiée en OpenCL | Architecture, OpenCL™ Research | 2011 |

Proceedings of the IADIS International Conference on Applied Computing 2011 | Heterogeneous Computing, OpenCL™ Research | 2011 |

Practical Oracles for Sequential Code Parallelization (PDF) [FEE REQUIRED] | Heterogeneous Computing, Optimization | 08/2012 |

Poster: GPU-accelerated rigid body fitting of atomic structures into electron density maps | Heterogeneous Computing, OpenCL™ Research | 02/05/2011 |

Poster: GPU-accelerated artificial neural network for QSAR modeling [FEE REQUIRED] | OpenCL™ Research, Optimization | 02/05/2011 |

Leveraging Binary Translation for Heterogeneous Proﬁling | Heterogeneous Computing, Optimization | 2011 |

LatticeQCD using OpenCL | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research, OpenGL | 07/16/2011 |

Introduction to GPU Radix Sort | Heterogeneous Computing, OpenCL™ Research | 06/2011 |

Integrated Framework for Heterogeneous Embedded Platforms Using OpenCL | Architecture, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 03/01/2011 |

GPU-based motion correction of contrast-enhanced liver MRI scans: An OpenCL implementation [FEE REQUIRED] | OpenCL™ Research | 04/02/2011 |

GPU Parallel Collections For Scala | Architecture, Heterogeneous Computing, Java, OpenCL™ Research, Optimization | 05/2011 |

GPU Linear algebra extensions for GNU/Octave | OpenCL™ Research, Optimization | 2011 |

Accelerating Outlier Detection with Uncertain Data using Graphics Processors | Heterogeneous Computing, OpenCL™ Research, Optimization | 2010 |

Accelerating Foreign-Key Joins using Asymmetric Memory Channels | Architecture, Heterogeneous Computing, Optimization | 2011 |

Accelerating Clustering Coefficient Calculations on a GPU Using OPENCL [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2010 |

Accelerating a climate physics model with OpenCL | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Accelerated protein structure comparison using TM-score-GPU | Heterogeneous Computing, OpenCL™ Research, Optimization | 06/02/2012 |

Abstract: F1.00036 : Cell-based Adaptive Mesh Refinement on the GPU with Applications to Exascale Supercomputing | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research | 10/21/2011 |

A Static Task Partitioning Approach for Heterogeneous Systems Using OpenCL [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

A Sequential to Parallel Intermediate Representation Extension | Heterogeneous Computing, OpenCL™ Research | 2012 |

A scalable and portable framework for massively parallel variable selection in genetic association studies | C++, Heterogeneous Computing, OpenCL™ Research | 01/11/2012 |

Portable LDPC Decoding on Multicores Using OpenCL [Applications Corner] [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research | 07/2012 |

Performance characterization of the NAS Parallel Benchmarks in OpenCL [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 11/08/2011 |

Performance Characterization and Optimization of Atomic Operations on AMD GPUs [FEE REQUIRED] | Heterogeneous Computing, Optimization | 09/30/2011 |

Performance and Power Analysis of ATI GPU: A Statistical Approach [FEE REQUIRED] | Architecture, Heterogeneous Computing, HPC, Optimization | 07/30/2011 |

Perceptually optimized real-time computer graphics | Heterogeneous Computing, OpenCL™ Research, Optimization | 05/2012 |

ParModelica: Extending the Algorithmic Subset ofModelica with Explicit Parallel LanguageConstructs for Multi-core Simulation | Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Parallelized videocoding: A H.264 Decoder in OpenCL | Heterogeneous Computing, OpenCL™ Research | 03/01/2011 |

Parallelization of KMP String Matching Algorithm on Different SIMD architectures: Multi-Core and GPGPU’s | Architecture, Heterogeneous Computing, Optimization | 07/2012 |

Parallel Processing (ICPP), 2011 International Conference on [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 09/16/2011 |

Parallel paradigms in optimal structural design | Heterogeneous Computing, Optimization | 2012 |

Parallel neural network training with OpenCL [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 05/25/2012 |

Parallel implementation of MOPSO on GPU using OpenCL and CUDA [FEE REQUIRED] | Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 12/21/2011 |

Parallel computation of a SPECT projection operator for a content adaptative mesh model [FEE REQUIRED] | Graphics Development, Heterogeneous Computing | 05/05/2012 |

Parallel coding for storage systems — An OpenMP and OpenCL capable framework [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 02/29/2012 |

Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International Conference on [FEE REQUIRED] | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 12/09/2011 |

Parallel and Distributed Processing with Applications (ISPA), 2012 IEEE 10th International Symposium on [FEE REQUIRED] | Heterogeneous Computing | 07/13/2012 |

Parallel and Distributed Processing with Applications (ISPA), 2011 IEEE 9th International Symposium on [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 05/28/2011 |

Parallel Agent systems on a GPU for use with Simulations and Games | C++, Heterogeneous Computing, OpenCL™ Research, OpenGL, Optimization | - |

Optimizing Techniques for OpenCL Programs on Heterogeneous Platforms [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Optimizing Option Pricing Algorithms and Profiling Power Consumption on VLIW APU Architecture [FEE REQUIRED] | APU, Architecture, Heterogeneous Computing, HPC, Optimization | 07/13/2012 |

Optimizing OpenCL Kernels for Iterative Statistical Applications on GPUs | Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Optimizing and multithreading SNPHAP on a multi-core APU with OpenCL [FEE REQUIRED] | APU, Heterogeneous Computing, OpenCL™ Research, Optimization | 06/01/2012 |

Optimizing a Near-duplicate Document Detection System with SIMD Technologies | Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 08/2012 |

OpenCL/OpenGL aproach for studying active Brownian motion | OpenCL™ Research, OpenGL, Optimization | 10/08/2012 |

OpenCL, a Viable Solution for High-performance Medical Image Reconstruction? | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | - |

OpenCL-based implementation of an unstructured edge-based finite element convection-diffusion solver on graphics hardware [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 12/12/2011 |

OpenCL-based Algorithm for Heat Load Modelling of District Heating System | Heterogeneous Computing, OpenCL™ Research | - |

OpenCL programming guide [FEE REQUIRED] | Architecture, C++, Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 08/2011 |

OpenCL and the 13 dwarfs: a work in progress [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

On the Efficacy of a Fused CPU+GPU Processor (or APU) for Parallel Computing [FEE REQUIRED] | APU, Heterogeneous Computing, HPC, Optimization | 07/21/2011 |

OCL-BodyScan: A Case Study for Application-centric Programming of Many-Core Processors [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 09/16/2011 |

Intelligent GPGPU Classification in Volume Visualization: A framework based on Error-Correcting Output Codes [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 11/04/2011 |

Network Simulator Tools and GPU Parallel Systems | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research | 02/14/2012 |

Multiplatform GPGPU implementation of the active contours without edges algorithm | Heterogeneous Computing, OpenCL™ Research, OpenGL | 05/2012 |

Multicore Processing for Classification and Clustering Algorithms | Heterogeneous Computing, OpenCL™ Research | 2012 |

Multi-Object Geodesic Active Contours (MOGAC): A Parallel Sparse-Field Algorithm for Image Segmentation | Graphics Development, Heterogeneous Computing, Optimization | - |

Multi- and Many-Core Data Mining with Adaptive Sparse Grids | Architecture, Heterogeneous Computing, Optimization | 05/04/2011 |

Comparison of OpenMP & OpenCL Parallel Processing Technologies | Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Modular Arithmetic for Solving Linear Equations on the GPU | Heterogeneous Computing, Optimization | - |

Method and Apparatus for Compiling and Executing an Application Using Virtualization in a Heterogeneous System | Heterogeneous Computing | 03/29/2012 |

MetaCL - A Model-driven Approach to Programming Heterogeneous Architectures Using OpenCL | Architecture, C++, Graphics Development, Heterogeneous Computing, OpenCL™ Research | 05/2012 |

Mesh–particle interpolations on graphics processing units and multicore central processing units [FEE REQUIRED] | Architecture, C++, Heterogeneous Computing, Optimization | 06/2011 |

Medical Image Registration using OpenCL | Architecture, Graphics Development, HPC, OpenCL™ Research | 04/2012 |

MDR: performance model driven runtime for heterogeneous parallel platforms [FEE REQUIRED] | Architecture, Heterogeneous Computing, Optimization | 2011 |

MCMini: Monte Carlo on GPGPU | C, Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Improving Performance of OpenCL on CPUs [FEE REQUIRED] | CPU Development, OpenCL™ Research, Optimization | 2012 |

Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU | OpenCL™ Research, Optimization | 07/02/2012 |

Implementation of a Parallel Tree Method on a GPU | Heterogeneous Computing, Optimization | 12/20/2011 |

Implementation and performance analysis of the Simplex algorithm adapted to run on commodity OpenCL enabled graphics processors [FEE REQUIRED] | OpenCL™ Research, Optimization | 10/29/2011 |

Image registration on GPU | C++, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 03/14/2011 |

Hybrid OpenCL: Enhancing OpenCL for Distributed Processing [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 05/28/2011 |

Hybrid OpenCL-MPI parallelization of the FDTD method [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research | 09/16/2011 |

Higher Level Programming Abstractions for FPGAs using OpenCL | Heterogeneous Computing, OpenCL™ Research | 2011 |

High-Level Manipulation of OpenCL-Based Subvectors and Submatrices [FEE REQUIRED] | Architecture, C++, Heterogeneous Computing, OpenCL™ Research | 2012 |

High Speed Vector Graphics Rendering on OpenCL Hardware [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research | 2012 |

High performance parallel backprojection on multi-GPU [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 05/31/2012 |

High accuracy gravitational waveforms from black hole binary inspirals using OpenCL [FEE REQUIRED] | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 2012 |

Hierarchical overlapped tiling [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Heterogeneous Highly Parallel Implementation of Matrix Exponentiation Using GPU | APU, Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 03/2012 |

Guest Editor's Introduction: Special Issue on High-Performance Computing with Accelerators | Architecture, Heterogeneous Computing, HPC, Optimization | 01/2011 |

Green Computing using Graphical Processing Units | Heterogeneous Computing | 04/2012 |

Graphics Processing Unit Audio Signals Processing in Pure Data and PdCUDA an Implementation with the CUDA Runtime API | Heterogeneous Computing | - |

GPU and APU computations of Finite Time Lyapunov Exponent fields [FEE REQUIRED] | APU, Architecture, Heterogeneous Computing, OpenCL™ Research | 2011 |

GPU Acceleration for the C++ Standard Template Library | C++, Heterogeneous Computing, Optimization | - |

GPU Accelerated Target Tracking Method [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

GPU Accelerated Parallel Cholesky Factorization [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 12/2011 |

GPU accelerated multiplatform FDTD simulator [FEE REQUIRED] | C, C++, Heterogeneous Computing, OpenCL™ Research, Optimization | 04/14/2011 |

GPU accelerated computation of fast spectral transforms | C, C++, Heterogeneous Computing, OpenCL™ Research, Optimization | 12/2011 |

GPU Accelerate Parallel Odd-Even Merge Sort: An OpenCL Method [FEE REQUIRED] | C++, Heterogeneous Computing, OpenCL™ Research | 06/10/2011 |

GPGPU Volume Classiﬁcation using Simple OpenCL | Heterogeneous Computing, OpenCL™ Research | 2011 |

GLOpenCL: OpenCL support on hardware- and software-managed cache multicores [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Generating GPU Code from a High-level Representation for Image Processing Kernels | C, C++, Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Generating Device-specific GPU Code for Local Operators in Medical Imaging [FEE REQUIRED] | C++, Heterogeneous Computing, OpenCL™ Research | 05/25/2012 |

Generalizing the Utility of GPUs in Large-Scale Heterogeneous Computing Systems | Heterogeneous Computing, OpenCL™ Research | - |

General-purpose Graphics Processing Units Deliver New Capabilities to the Embedded Market | Architecture, Heterogeneous Computing, OpenCL™ Research | 05/2011 |

General Purpose Computing on the GPU | Heterogeneous Computing | - |

Gemma in April: A matrix-like parallel programming architecture on OpenCL [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 03/18/2011 |

Fractals Image Rendering and Compression using GPUs | Heterogeneous Computing, OpenCL™ Research | 2012 |

ForOpenCL: Transformations Exploiting Array Syntax in Fortran for Accelerator Programming | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 07/11/2011 |

Flexible OpenCL accelerated disparity estimation for video communication applications [FEE REQUIRED] | OpenCL™ Research, Optimization | 05/16/2011 |

Fixing Performance Bugs: An Empirical Study of Open-Source GPGPU Programs | Heterogeneous Computing, HPC | - |

Finite element assembly strategies on multi- and many-core architectures | Architecture, Heterogeneous Computing, Optimization | 2011 |

Fast Wavelet Transform Utilizing a Multicore-Aware Framework [FEE REQUIRED] | Architecture, C++, Heterogeneous Computing, HPC, Optimization | 2012 |

Fast GPU Based Adaptive Filtering of 4D Echocardiography [FEE REQUIRED] | Graphics Development, Heterogeneous Computing, OpenCL™ Research | 06/2012 |

Fast calculation of Fresnel diffraction calculation using AMD GPU and OpenCL [FEE REQUIRED] | OpenCL™ Research | 05/09/2011 |

Fast Alignment of Biological Sequences Based on General Purpose Graphic Processor Unit [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 04/2012 |

Expression Templates and OpenCL [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Exploiting Multi- and Many-core Parallelism for Accelerating Image Compression [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Exploiting Contextual Information in Image Retrieval Tasks | Heterogeneous Computing | - |

Explicit Cache Management for Volume Ray-Casting on Parallel Architectures | Architecture, Graphics Development, Heterogeneous Computing, Optimization | 2012 |

Evaluation of likelihood functions on CPU and GPU devices | Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Evaluating the Performance and Portability of OpenCL | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 08/12/2011 |

Eulerian Smoke Simulation on the GPU MSc Computer Animation and Visual Eﬀects Master Thesis | Graphics Development, Heterogeneous Computing, OpenCL™ Research | 08/19/2011 |

EpiGPU: exhaustive pairwise epistasis scans parallelized on consumer level graphics cards [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research | 10/29/2010 |

Enabling Traceability in an MDE Approach to Improve Performance of GPU Applications | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 8/30/2011 |

Enabling task-level scheduling on heterogeneous platforms [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Enabling Efﬁcient Online Proﬁling of Homogeneous and Heterogeneous Multicore Systems | Heterogeneous Computing, Optimization | 08/01/2011 |

Enable OpenCL Compiler with Open64 Infrastructures [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 9/2/2011 |

Embedding OpenCL in C++ for Expressive GPU Programming | C++, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Accelerating an imaging spectroscopy algorithm for submerged marine environments using heterogeneous computing | Heterogeneous Computing, HPC, Optimization | 1/1/2012 |

Efficient real-time local optical flow estimation by means of integral projections [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research | 9/11/2011 |

Dynamic Heterogeneous Scheduling Decisions Using Historical Runtime Data | Heterogeneous Computing, OpenCL™ Research, Optimization | - |

Divide-and-Conquer 3D Convex Hulls on the GPU | Heterogeneous Computing | 5/6/2012 |

Distributed OpenCL Distributing OpenCL Platform on Network Scale | Architecture, Heterogeneous Computing, HPC, OpenCL™ Research | 6/1/2012 |

Distributed OpenCL : a platform for distributed, heterogeneous computing for domain scientists | Heterogeneous Computing, OpenCL™ Research | 5/29/2012 |

Distributed computer emulation: Using OpenCL framework [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research | 1/27/2011 |

Discrete dipole approximation simulations on GPUs using OpenCL—Application on cloud ice particles [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research | 8/2011 |

Developing A High Performance GPGPU Compiler Using Cetus | Heterogeneous Computing, HPC, Optimization | - |

Designing APU Oriented Scientific Computing Applications in OpenCL [FEE REQUIRED] | APU, Architecture, HPC, OpenCL™ Research | 9/2/2011 |

Design Flows and Run Time Systems For Heterogeneous Multiprocessor Systems on Programmable Chips (MPSoPCs) | Heterogeneous Computing | 2011 |

Design and Implementation of a LLVM based OpenCL compiler [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research | 3/31/2011 |

Democratizing General Purpose GPU Programming through OpenCL | Heterogeneous Computing, OpenCL™ Research | 1/10/2011 |

Computer Simulation of Dark Matter Effects on Galaxy Rotation | C, Heterogeneous Computing, HPC, OpenCL™ Research | 4/5/2011 |

CNN based high performance computing for real time image processing on GPU [FEE REQUIRED] | Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 7/25/2011 |

Cloth Modeling with a Discrete Cosserat Surface | Heterogeneous Computing | - |

CheCL: Transparent Checkpointing and Process Migration of OpenCL Applications [FEE REQUIRED] | HPC, OpenCL™ Research | 5/16/2011 |

Markov chain Monte Carlo on the GPU | Heterogeneous Computing, OpenCL™ Research | 3/2011 |

Can GPGPU Programming be Liberated from the Data-Parallel Bottleneck? A Style of Braided Parallelism and its Programs (Invited Talk) | C++, Heterogeneous Computing, OpenCL™ Research | 3/31/2012 |

CAMPAIGN: an open-source library of GPU-accelerated data clustering algorithms [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research | 3/10/2011 |

Benchmarks Based on Anti-Parallel Patterns for the Evaluation of GPUs | Heterogeneous Computing, OpenCL™ Research | 2011 |

Benchmarking Energy Efficiency, Power Costs and Carbon Emissions on Heterogeneous Systems [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2/5/2011 |

Bacon: A GPU Programming System With Just in Time Specialization | C, Heterogeneous Computing, OpenCL™ Research | - |

Automatic translation of CUDA to OpenCL and comparison of performance optimizations on GPUS | OpenCL™ Research, Optimization | 5/25/2011 |

Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators based on a Domain-Speciﬁc Language for Medical Imaging | Heterogeneous Computing, OpenCL™ Research, Optimization | 6/25/2012 |

Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design [FEE REQUIRED] | Architecture, OpenCL™ Research, Optimization | 8/29/2011 |

Automatic Multi-GPU Code Generation Applied to Simulation of Electrical Machines [FEE REQUIRED] | Architecture, Heterogeneous Computing, OpenCL™ Research | 2/2012 |

Auto-tuning SkePU: A Multi-Backend Skeleton Programming Framework for Multi-GPU Systems | Architecture, C++, Heterogeneous Computing, OpenCL™ Research, Optimization | 7/28/2011 |

Auto-tuning interactive ray tracing using an analytical GPU architecture model [FEE REQUIRED] | Architecture, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

Auto-tuning a High-Level Language Targeted to GPU Codes | Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | - |

Auto-tunable GPU BLAS | Heterogeneous Computing, OpenCL™ Research, Optimization | 6/2011 |

ARVO-CL: The OpenCL version of the ARVO package — An efficient tool for computing the accessible surface area and the excluded volume of proteins via analytical equations [FEE REQURED] | C, Heterogeneous Computing, OpenCL™ Research | 11/2012 |

Application of GPGPU for Acceleration of Short DNA Sequence Alignment in Unipro UGENE Project | Heterogeneous Computing, Optimization | 2011 |

Applications, tools and techiques on the road to exascale computing [FEE REQUIRED] | Architecture, Heterogeneous Computing, HPC, Optimization | 2012 |

Analyzing program flow within a many-kernel OpenCL application [FEE REQUIRED] | Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

Analysis of GPGPU Platforms Efficiency in GeneralPurpose Computations | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

An OpenCL implementation for the solution of TDSE on GPU and CPU architectures | Architecture, Heterogeneous Computing, OpenCL™ Research | 1/31/2012 |

An OpenCL Fast Fourier Transformation | Heterogeneous Computing, OpenCL™ Research, Optimization | - |

An OpenCL back-end for Accelerate | Heterogeneous Computing, OpenCL™ Research, Optimization | 5/6/2011 |

An MDE Approach for Automatic Code Generation from MARTE to OpenCL | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2/2011 |

An Introduction to the OpenCL Programming Model | Heterogeneous Computing, OpenCL™ Research, Optimization | 2012 |

An Exploration of OpenCL on Multiple Hardware Platforms for a Numerical Relativity Application | Architecture, Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

An Examination of the Current State of the Art in SIMD Processor Extensions for CS570 Winter 2012 | Architecture, Heterogeneous Computing, Optimization | 2012 |

An Efficient Parallel GPU Evaluation of Small Angle X-ray Scattering Profiles | Heterogeneous Computing | 2012 |

An Auto-tuning Solution to Data Streams Clustering in OpenCL [FEE REQUIRED] | Architecture, OpenCL™ Research, Optimization | 8/24/2011 |

Algorithm Construction for GPGPU | Heterogeneous Computing, OpenCL™ Research, Optimization | - |

Advanced Programming Platform for efficient use of Data Parallel Hardware | Heterogeneous Computing, Optimization | 3/23/2011 |

Acceleration of Physics Simulation Engine through OpenCL | Heterogeneous Computing, OpenCL™ Research | 2011 |

Accelerating The Cloud with Heterogeneous Computing | Heterogeneous Computing, HPC, Optimization | 2011 |

A quantitative performance analysis model for GPU architectures [FEE REQUIRED] | Architecture, Heterogeneous Computing, Optimization | 2/12/2011 |

A parallel GPU implementation of an algorithm for determining directional distances [FEE REQUIRED] | Heterogeneous Computing, Optimization | 2011 |

A Parallel Architecture for Interactively Rendering Scattering and Refraction Effects [FEE REQUIRED] | Graphics Development, Heterogeneous Computing | 3/2012 |

A New Compilation Path: From Python/NumPy to OpenCL | Heterogeneous Computing, OpenCL™ Research, Optimization | 2011 |

A Modeling Approach based on UML/MARTE for GPU Architecture | Architecture, Heterogeneous Computing, HPC | 5/23/2011 |

A Hybrid Software Framework for the GPU Acceleration of Multi-Threaded Monte Carlo Applications | Architecture, Heterogeneous Computing, Optimization | 2011 |

A high performance parallel DCT with OpenCL on heterogeneous computing environment [FEE REQUIRED] | Graphics Development, Heterogeneous Computing, OpenCL™ Research | 2/2012 |

A Heterogeneous Accelerated Matrix Multiplication: OpenCL + APU + GPU+ Fast Matrix Multiply | APU, Heterogeneous Computing, OpenCL™ Research | 5/14/2012 |

A grid computing environment for undergraduate research [FEE REQUIRED] | Heterogeneous Computing | 6/22/2012 |

A framework to implement a multifrontal scheme on GPU architectures with OpenCL | C, C++, OpenCL™ Research | 3/31/2011 |

A fast GEMM implementation on the cypress GPU [FEE REQUIRED] | Architecture, Heterogeneous Computing, Optimization | 3/2011 |

A fast CAST-based clustering algorithm for very large database [FEE REQUIRED] | Heterogeneous Computing, Optimization | 6/8/2011 |

A Comprehensive Performance Comparison of CUDA and OpenCL [FEE REQUIRED] | Architecture, OpenCL™ Research, Optimization | 9/13/2011 |

A Common GPU n-Dimensional Array for Python and C | C, Heterogeneous Computing | - |

From Computational Science to Science Discovery: The Next Computing Landscape | HPC | 01/22/2010 |

HPC High Performance Linpackfor AMD® Opteron™ 6200 Series processors | HPC | 04/23/2012 |

Using ACML (AMD Core Math Library) In High Performance Computing Challenge (HPCC) | HPC, Optimization | 10/03/2012 |

NUMA Aware Heap Memory Manager Article | Architecture, Optimization | 2009 |

Instruction-Based Sampling and AMD CodeAnalyst | Optimization | 03/29/2010 |

Incorporating Instruction-Based Sampling into AMD CodeAnalyst | C, Optimization | 04/08/2010 |

New Round-to-Even Technique for Large-scale Data and Its Application in Integer Scaling | Architecture, C++, Optimization | 06/11/2010 |

Java Performance when Debugging is Enabled | Architecture, Java, Optimization | 05/06/2010 |

Dynamic Whole Program Profiling | Optimization | 09/13/2010 |

OpenCL™ Optimization Case Study: Diagonal Sparse Matrix Vector Multiplication | C, Heterogeneous Computing, OpenCL™ Research, Optimization | 06/10/2010 |

OpenCL™ Optimization Case Study: Simple Reductions | Heterogeneous Computing, OpenCL™ Research, Optimization | 08/25/2010 |

Memory Spaces | C++, OpenCL™ Research, Optimization | 10/27/2010 |

Work-Groups and Synchronization | C, C++, Graphics Development, Heterogeneous Computing, OpenCL™ Research, Optimization | 01/06/2011 |

Making OpenCL™ Simple with Haskell | C, C++, OpenCL™ Research | 02/01/2011 |

OpenCL™ Optimization Case Study: Support Vector Machine Training | OpenCL™ Research, Optimization | 02/11/2011 |

Programming models for next generation of GPGPU architectures | Architecture, C, C++, Heterogeneous Computing, OpenCL™ Research | 2/27/2011 |

Coordinating Computations with OpenCL Queues | C++, Heterogeneous Computing, OpenCL™ Research | 03/11/2011 |

AMD Offers Alternative To CUDA For Parallelism | Heterogeneous Computing, OpenCL™ Getting Started | 03/23/2011 |

OpenCL™ and the AMD APP SDK v2.4 | Heterogeneous Computing, HPC, OpenCL™ Getting Started, OpenCL™ SDK | 04/06/2011 |

Primitive Restart and OpenGL Interoperability | Graphics Development, Heterogeneous Computing, OpenCL™ Research, OpenGL, Optimization | 05/24/2011 |

OpenCL Buffers and Memory Affinity | C++, Heterogeneous Computing, OpenCL™ Research | 05/24/2011 |

APU 101: All about AMD Fusion Accelerated Processing Units | APU, Architecture, DirectX, Heterogeneous Computing, OpenCL™ Getting Started, Optimization | 5/31/2011 |

Supercomputer Performance on a Chip Powers Next-Generation Embedded Image Processing | APU, Architecture, C, Graphics Development, Heterogeneous Computing, HPC, OpenCL™ Research, Optimization | 06/23/2011 |

Bulk Encryption on GPUs | Heterogeneous Computing, OpenCL™ Research, Optimization | 10/12/2011 |

OpenCL™ Optimization Case Study Fast Fourier Transform – Part I | OpenCL™ Research, Optimization | 11/1/2011 |

OpenCL™ Optimization Case Study Fast Fourier Transform – Part II | OpenCL™ Research, Optimization | 11/11/2011 |

Tiled Convolution: Fast Image Filtering | Graphics Development, OpenCL™ Research | 12/5/2011 |

JPEG Decoding with Run-Length Encoding: A CPU and GPU Approach | Heterogeneous Computing, OpenCL™ Research | 01/31/2012 |

OpenCL™ plugins | C, C++, Heterogeneous Computing, OpenCL™ Research, OpenGL | 03/30/2012 |

OpenCL™ Extensions and Device Fission | OpenCL™ Research | 03/30/2012 |

Heterogeneous workflows using OpenCL™ | C, C++, Heterogeneous Computing, Java, OpenCL™ Research | 03/30/2012 |