Applied Parallel Computing LLC: Coding Blog

Calling CUDA device function from OpenACC Fortran kernel

OpenACC is known to be a fast method of developing quite efficient GPU-enabled applications. It is also possible to mix CUDA kernels and libraries with OpenACC ke...

Jul 11, 2014 Software Engineering, OpenACC, CUDA Fortran

International Summer Supercomputing Academy at the Lomonosov Moscow State University

Applied Parallel Computing LLC has given a talk at the International Summer Supercomputing Academy, Lomonosov Moscow State University, Moscow.

Jun 24, 2014 Software Engineering, Trainings, CUDA, OpenACC, OpenCL, Xeon Phi

Jetson K1: bandwidthTest

Chart on the left shows the bandwidths of memory transfers on Jetson K1 (Click to enlarge). For the baseline we also added GTX680M’s host-device and device-host (...

Jun 15, 2014 Software Engineering, Jetson

Jetson K1: from unboxing straight to CUDA in 5 steps

We finally got the most wanted Jetson K1 board in the house! In this post we show how to turn a just unboxed tiny board into fully-functional CUDA development nod...

Jun 14, 2014 Software Engineering, Jetson

Training on High Performance Computing on GPUs at Perm State University

Applied Parallel Computing LLC has delivered a GPU computing training at the “High Performance Computing on GPUs” conference, Perm State University, Russia

Jun 02, 2014 Software Engineering, Trainings, CUDA

How to break Ubuntu 13.04/14.04 with vanilla CUDA driver and unbreak it back

After installing CUDA driver from NVIDIA website, Ubuntu 13.04/14.04 window manager decorations (Unity, via Compiz) may stop working properly on Optimus machines ...

Jun 01, 2014 Software Engineering, CUDA

CSCS-USI Summer School 2014

Applied Parallel Computing LLC will participate in CSCS-USI Summer School, to be held from 30 June to 10 July at Hotel Serpiano (Ticino, Switzerland). We will giv...

May 12, 2014 Software Engineering, Trainings, CUDA

Improving CUDA profiler output of the MPI-CUDA program

Consider we need to profile the following MPI-CUDA program on GPU cluster. The most obvious way to profile this code on console-only cluster would be to invoke th...

Apr 24, 2014 Software Engineering, CUDA, MPI

One non-obvious reason of 'Illegal instruction' in GPU code

If cuda-gdb throws Program received signal CUDA_EXCEPTION_4, Warp Illegal Instruction. for the following code line:

Apr 12, 2014 Software Engineering, CUDA

HPC Advisory Council Switzerland

Together with out colleagues from CSCS we presented introductory talks on GPU and CUDA on the first day of HPC Advisory Council Switzerland.

Apr 12, 2014 Software Engineering, Trainings, CUDA