OpenACC workshop at TU Dortmund
Applied Parallel Computing LLC has delivered the OpenACC Workshop at the Technical University of Dortmund, Germany. The workshop has been kindly supported by NVID...
Applied Parallel Computing LLC has delivered the OpenACC Workshop at the Technical University of Dortmund, Germany. The workshop has been kindly supported by NVID...
Applied Parallel Computing LLC has delivered the 4-day CUDA Course at Airbus Defence and Space, Ulm, Germany.
Rintime Compilation (NVRTC) introduced in CUDA 7.0 allows to dynamically compile CUDA kernels during program execution (see example). This functionality allows to...
Starting from GK110 (Tesla Kepler), “const restrict” annotation on kernel argument has an extra GPU-specific meaning: accesses to that argument should go through ...
Thrust is a very handy STL-like template library for rapid data processing on GPUs.
Largely thanks to LLVM, in recent years we’ve seen a significant increase of interest to domain-specific compilation tools research & development. With the re...
Applied Parallel Computing LLC has delivered the GPU Computing Workshop at Space Technologies Research Institute (TÜBİTAK UZAY), Ankara, Turkey. We would like to ...
It is well-known that CUDA toolkit uses LLVM backend, but the used version number is not shown. We can use gdb and LLVM API function to print the version string:
In CUDA 6.0 release an extremely handy feature has been added to Visual Profiler: support for remote profiling. This means that you can run the profiler GUI from ...
Juraj Kardoš – University of Lugano summer intern and our collaborator – presents a talk on efficient CPU-GPU data transfers and CUDA 6.0 Unified Virtual Memory o...