Last Updated: February 25, 2016

Intel OpenCL Implicit Vectorizer

OpenCL compiler differs depending on its vendor, and Intel optimizes its compiler to auto-vectorize some loops that may take the advantage of the SSE and AVX instructions.

For example, the Black-Scholes equation when executed with single thread C99 and single thread OpenCL thread gives the execution time as below:

Input: 10MB of data
calculates both call and put option
both uses -O3 compiler option of gcc-4.4

c99 : 1612.203 ms
OpenCL : 673.248 ms

This 'hidden' optimization is kinda cool, isn't it?

#opencl

#implicit vectorizer

Written by yuriardila

Say Thanks

Respond

Related protips

OpenCL kernel support header for Eclipse CDT

3.01K

OpenCL benchmarking open source project

936

Aparapi: Runtime Java bytecode conversion to OpenCL

822

Have a fresh tip? Share with Coderwall community!

Best #Opencl Authors

3.001K

1.569K

822

346

Related Tags

Awesome Job

Post a job for only $299

#native_title# #native_desc#

#native_cta#