Design Patterns

GPU Coder™ supports some design patterns that map efficiently to GPU structures.

Stencil Processing

Stencil kernel operations compute each element of the output array as a function of a small region of the input array. You can express many filtering operations as a stencil operation. Examples include convolution, median filtering, and finite element methods.

In the GPU Coder implementation of the stencil kernel, each thread computes one element of the output array. Because a given input element is accessed repeatedly for computing multiple neighboring output elements, GPU Coder uses shared memory to improve memory bandwidth and data locality.

Use thegpucoder.stencilKernelfunction and create CUDA^®code for stencil functions. For an example that demonstrates stencil preocessing, seeStencil Processing on GPU.

For very large input sizes, thegpucoder.stencilKernel功能可能会产生CUDA的代码没有号码ically match the MATLAB^®simulation. In such cases, consider reducing the size of the input to produce accurate results.

Matrix-Matrix Processing

Many scientific applications contain matrix-matrix operations including the GEneral Matrix to Matrix Multiplication (GEMM), of the formC = ABwhere you can optionally transposeAandB. The code for such matrix-matrix operations typically takes the pattern:

forx = 1:Mfory = 1:Nforz = 1:K C(x,y) = F(A(x,z),B(z,y));endendend

whereF()is a user-defined function. In these operations, a row from one input matrix and a column from the second input matrix is used to compute the corresponding element of the output matrix. Every thread reloads the row and column. This design pattern allows optimization of this structure by reusing data and making each thread compute multiple output elements.

For example,F()can be a regular matrix multiply,F()=@mtimes. For such patterns, GPU Coder provides theMatrixMatrixkernel to create a highly efficient, fast implementation of matrix-matrix operations on the GPU.

Use thegpucoder.matrixMatrixKernelfunction and create CUDA code for performing matrix-matrix type operations.

Related Examples

Stencil Processing on GPU

Design Patterns

Stencil Processing

Matrix-Matrix Processing

See Also

Related Examples

More About