Parallel Programming
  • Introduction
  • Parallel Computing
  • Matrix Multiply Problem
  • Bigger Matrix Multiply problem
  • Introduction to OpenCL
  • Profiling on OpenCl
  • GPU Considerations
  • OpenCl on Nvidia
  • OpenCl with Matlab
  • Juno Plarform
  • Deep Learning SpeedUp
  • Matlab and Mex
  • Bibliography
Powered by GitBook
On this page

Was this helpful?

OpenCl on Nvidia

Queryhost results on TitanX

laraujo@lindev:~/work/learningOpenCl/chap1$ ./queryHost 
Number of OpenCL platforms found: 1
CL_PLATFORM_PROFILE: FULL_PROFILE
CL_PLATFORM_VERSION: OpenCL 1.2 CUDA 8.0.20
CL_PLATFORM_NAME: NVIDIA CUDA
CL_PLATFORM_VENDOR: NVIDIA Corporation
CL_PLATFORM_EXTENSIONS: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_khr_gl_event
Number of detected OpenCL devices: 1
GPU detected
    Device name is GeForce GTX TITAN X
    Device vendor is NVIDIA Corporation
    VENDOR ID: 0x10de
    Device max memory allocation: 3071 mega-bytes
    Device global cacheline size: 128 bytes
    Device global mem: 12284 mega-bytes
    Maximum number of parallel compute units: 24
    Maximum dimensions for global/local work-item IDs: 3
    Maximum number of work-items in each dimension: ( 1024 1024 64  )
    Maximum number of work-items in a work-group: 1024
PreviousGPU ConsiderationsNextOpenCl with Matlab

Last updated 5 years ago

Was this helpful?