https://en.wikipedia.org/wiki/OpenCLarrow-up-right
https://en.wikipedia.org/wiki/Task_parallelismarrow-up-right
https://en.wikipedia.org/wiki/Data_parallelismarrow-up-right
https://computing.llnl.gov/tutorials/parallel_comp/arrow-up-right
http://cs.nyu.edu/courses/spring12/CSCI-GA.3033-012/lecture1.pdfarrow-up-right
https://www.pgroup.com/lit/articles/insider/v2n1a5.htmarrow-up-right
https://cinwell.wordpress.com/2013/09/06/overview-of-gpu-architecture-fermi-based/arrow-up-right
https://blogs.msdn.microsoft.com/nativeconcurrency/2012/03/26/warp-or-wavefront-of-gpu-threads/arrow-up-right
http://blog.accelereyes.com/blog/2012/02/17/opencl_vs_cuda_webinar_recap/arrow-up-right
http://www.nvidia.com/content/GTC-2010/pdfs/2008_GTC2010.pdfarrow-up-right
http://www.cs.bris.ac.uk/home/simonm/workshops/OpenCL_lecture4.pdfarrow-up-right
http://stackoverflow.com/questions/10460742/how-do-cuda-blocks-warps-threads-map-onto-cuda-coresarrow-up-right
http://slideplayer.com/slide/6003993/arrow-up-right
https://developer.nvidia.com/openclarrow-up-right
http://www.cc.gatech.edu/~vetter/keeneland/tutorial-2011-04-14/06-intro_to_opencl.pdfarrow-up-right
http://www.nvidia.com/content/gtc/documents/1409_gtc09.pdfarrow-up-right
https://www.youtube.com/watch?v=hUiX8rBcNzwarrow-up-right
https://www.youtube.com/watch?v=M6vpq6s1h_Aarrow-up-right
https://anteru.net/blog/2012/11/03/2009/arrow-up-right
https://www.youtube.com/watch?v=oc1-y1V1TPQ&list=PLWVFhSrgolFRatvUa1viEJJy0SV2sfqFrarrow-up-right
https://streamcomputing.eu/blog/2015-03-16/how-to-install-opencl-on-windows/arrow-up-right
https://www.youtube.com/watch?v=8D6yhpiQVVIarrow-up-right
https://www.youtube.com/playlist?list=PLTfYiv7-a3l7mYEdjk35wfY-KQj5yVXO2arrow-up-right
http://cims.nyu.edu/~schlacht/OpenCLModel.pdfarrow-up-right
http://www.slideshare.net/vladimirstarostenkov/hands-on-openclarrow-up-right
http://embedded-computing.com/articles/understand-the-mobile-graphics-processing-unit/#arrow-up-right
http://www.slideshare.net/TomaszBednarz1/introduction-to-opencl-2010arrow-up-right
https://www.fixstars.com/en/opencl/book/OpenCLProgrammingBook/first-opencl-program/arrow-up-right
https://streamcomputing.eu/blog/2013-06-03/the-application-areas-opencl-can-be-used/arrow-up-right
https://www.khronos.org/files/opencl-1-1-quick-reference-card.pdfarrow-up-right
https://www.khronos.org/files/opencl20-quick-reference-card.pdfarrow-up-right
http://www.iwocl.org/wp-content/uploads/iwocl-2016-opencl-caffe.pdfarrow-up-right
https://github.com/BVLC/caffe/tree/opencl/src/caffe/greenteaarrow-up-right
https://github.com/amd/OpenCL-caffe/blob/stable/src/caffe/ocl/arrow-up-right
http://developer.amd.com/resources/articles-whitepapers/opencl-optimization-case-study-support-vector-machine-training/arrow-up-right
http://www.haifux.org/lectures/267/OpenCL_Dos_and_Donts.pdfarrow-up-right
http://www.nvidia.com/content/GTC/documents/1068_GTC09.pdfarrow-up-right
https://github.com/clMathLibraries/clBLASarrow-up-right
https://www.youtube.com/playlist?list=PLzy5q1NUJKCJocUKsRxZ0IPz29p38xeM-arrow-up-right
http://www.seehuhn.de/pages/lineararrow-up-right
http://stackoverflow.com/questions/6890302/barriers-in-openclarrow-up-right
https://www.gnu.org/software/gsl/manual/html_node/GSL-CBLAS-Examples.htmlarrow-up-right
http://stackoverflow.com/questions/3606636/cuda-model-what-is-warp-sizearrow-up-right
https://www.amd.com/Documents/GCN_Architecture_whitepaper.pdfarrow-up-right
https://www.cs.utexas.edu/~pingali/CS378/2015sp/lectures/BasicGPUPerformance.pdfarrow-up-right
https://www.cvg.ethz.ch/teaching/2011spring/gpgpu/GPU-Optimization.pdfarrow-up-right
https://nvlabs.github.io/moderngpu/performance.htmlarrow-up-right
http://courses.cms.caltech.edu/cs179/2015_lectures/cs179_2015_lec05.pdfarrow-up-right
http://www.ertl.jp/~shinpei/papers/icpads13.pdfarrow-up-right
http://blogs.mathworks.com/loren/2012/12/14/measuring-gpu-performance/arrow-up-right
http://www.stuffedcow.net/research/cudabmkarrow-up-right
http://stackoverflow.com/questions/4097635/how-many-memory-latency-cycles-per-memory-access-type-in-opencl-cudaarrow-up-right
https://en.wikipedia.org/wiki/Amdahl%27s_lawarrow-up-right
https://www.javacodegeeks.com/2013/02/amdahls-law-illustrated.htmlarrow-up-right
https://streamcomputing.eu/blog/2015-08-14/opencl-basics-multiple-opencl-devices-with-the-icd/arrow-up-right
http://opencl.codeplex.com/wikipage?title=OpenCL Tutorials - 1arrow-up-right
http://dhruba.name/2012/08/16/opencl-cookbook-building-a-program-and-debugging-failures/arrow-up-right
https://www.olcf.ornl.gov/tutorials/opencl-vector-addition/arrow-up-right
http://will-landau.com/gpu/lectures/cudac-atomics/cudac-atomics.pdfarrow-up-right
http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0538f/BABHCEGA.htmlarrow-up-right
http://sett.com/gpgpuarrow-up-right
http://stackoverflow.com/questions/17704216/opencl-image2d-and-image3d-memory-layoutarrow-up-right
https://www.microway.com/hpc-tech-tips/cuda-host-to-device-transfers-and-data-movement/arrow-up-right
https://en.wikipedia.org/wiki/CUDA_Pinned_memoryarrow-up-right
http://courses.cms.caltech.edu/cs179/2015_lectures/cs179_2015_lec13.pdfarrow-up-right
http://baptiste-wicht.com/posts/2011/09/profile-c-application-with-callgrind-kcachegrind.htmlarrow-up-right
https://web.stanford.edu/class/cs107/guide_callgrind.htmlarrow-up-right
https://www.youtube.com/watch?v=fvTsFjDuag8&list=PLGvfHSgImk4ZZq5KWX0mGT0kgwy9-I-Qearrow-up-right
https://devblogs.nvidia.com/parallelforall/how-implement-performance-metrics-cuda-cc/arrow-up-right
https://web.stanford.edu/class/cs107/guide_gdb.htmlarrow-up-right
http://web.stanford.edu/~adyuen/107arrow-up-right
https://www.tutorialspoint.com/cprogramming/c_multi_dimensional_arrays.htmarrow-up-right
http://gernotklingler.com/blog/gprof-valgrind-gperftools-evaluation-tools-application-level-cpu-profiling-linux/arrow-up-right
http://eli.thegreenplace.net/2015/memory-layout-of-multi-dimensional-arrays/arrow-up-right
https://kusemanohar.wordpress.com/2012/08/13/c-performance-analysis-profiling-tools/arrow-up-right
http://www.brendangregg.com/perf.htmlarrow-up-right
https://github.com/CppCon/CppCon2014arrow-up-right
https://www.youtube.com/watch?v=W0gtG67GUIwarrow-up-right
https://www.youtube.com/watch?v=FJW8nGV4jxYarrow-up-right
http://sandsoftwaresound.net/perf/perf-tutorial-hot-spots/arrow-up-right
https://devblogs.nvidia.com/parallelforall/cudacasts-episode-19-cuda-6-guided-performance-analysis-visual-profiler/arrow-up-right
http://on-demand.gputechconf.com/gtc/2013/webinar/gtc-express-guided-analysis-nvidia-visual-profiler.pdfarrow-up-right
http://gpgpu-computing4.blogspot.co.uk/2009/10/matrix-multiplication-3-opencl.htmlarrow-up-right
http://www.cs.bris.ac.uk/home/simonm/workshops/OpenCL_lecture3.pdfarrow-up-right
http://parallelis.com/how-to-measure-opencl-kernel-execution-time/arrow-up-right
http://www.cmsoft.com.br/opencl-tutorial/case-study-matrix-multiplication/arrow-up-right
http://www.cmsoft.com.br/opencl-tutorial/case-study-high-performance-convolution-using-opencl-__local-memory/arrow-up-right __
http://developer.amd.com/tools-and-sdks/opencl-zone/opencl-resources/programming-in-opencl/image-convolution-using-opencl/image-convolution-using-opencl-a-step-by-step-tutorial/arrow-up-right
https://devblogs.nvidia.com/parallelforall/cuda-7-5-pinpoint-performance-problems-instruction-level-profiling/arrow-up-right
http://www.cedricnugteren.nl/tutorial.phparrow-up-right
https://www.sharcnet.ca/help/index.php/Porting_CUDA_to_OpenCLarrow-up-right
http://developer.amd.com/tools-and-sdks/opencl-zone/opencl-resources/programming-in-opencl/porting-cuda-applications-to-opencl/arrow-up-right
http://btorpey.github.io/blog/2014/02/18/clock-sources-in-linux/arrow-up-right
https://github.com/opencv/opencv_contrib/blob/master/modules/dnn/src/opencl/im2col.clarrow-up-right
https://github.com/hughperkins/DeepCLarrow-up-right
https://classes.soe.ucsc.edu/ee264/Fall11/cmex.pdfarrow-up-right
http://www.shawnlankton.com/2008/03/getting-started-with-mex-a-short-tutorial/arrow-up-right
http://uk.mathworks.com/matlabcentral/newsreader/view_thread/241754arrow-up-right
http://stackoverflow.com/questions/11220250/how-do-i-profile-a-mex-function-in-matlabarrow-up-right
http://uk.mathworks.com/help/matlab/matlab_external/debugging-on-linux-platforms.htmlarrow-up-right
https://www.youtube.com/watch?v=4vAuwk3bj4sarrow-up-right
https://www.youtube.com/watch?v=pvuCg2yT5wYarrow-up-right
http://mdkey.org/?p=174arrow-up-right
Last updated 6 years ago
Was this helpful?