Skip to content
This repository has been archived by the owner on Apr 2, 2021. It is now read-only.

In depth SIMD level

Jordan Peck edited this page Mar 15, 2017 · 11 revisions

CPU instruction support

SSE2

  • Intel Pentium 4 - 2001
  • AMD Opteron/Athlon - 2003

SEE4.1

  • Intel Penryn - 2007
  • AMD Bulldozer - Q4 2011

AVX

  • Intel Sandy Bridge - Q1 2011
  • AMD Bulldozer - Q4 2011

AVX2

  • Intel Haswell - Q2 2013
  • AMD Carrizo - Q2 2015

FMA3

  • Intel Haswell - Q2 2013
  • AMD Piledriver - 2012

AVX512

  • Not available to consumers yet

Performance Comparisons

Timings below are x1000 ns to generate 32x32x32 points of noise.

  • CPU: Intel Xeon Skylake @ 2.0Ghz
  • Compiler: Intel 17.0 x64
Noise Type AVX512 AVX2 SSE4.1 SSE2 FastNoise FastNoise 2D
White Noise 12 17 32 95 194 179
Value 127 197 428 1060 935 519
Value Fractal 406 652 1333 3034 3075 1604
Perlin 199 444 818 1334 1315 645
Perlin Fractal 632 1367 2210 3958 4264 1864
Simplex 195 417 724 790 1612 983
Simplex Fractal 678 1367 2338 2801 5766 3174
Cellular 1300 1870 4121 5246 4469 1496
Cubic 1297 1880 3531 7517 3648 1258
Cubic Fractal 3798 5729 11719 22813 13770 4160
CF Sampled 1 684 996 1875 3438
CF Sampled 2 135 216 418 677
CF Sampled 3 54 143 211 267

CF (Cubic Fractal) Sampled {sampling level}: Comparison of performance increases with different levels of sampling

Clone this wiki locally