Native _mm_load_ps: 2.5459ns -> 0.5112ns. Native SSE1 is 4.98x FASTER than native scalar. | JS _mm_load_ps: 4.1821ns -> 0.9198ns. JS SSE1 is 4.55x FASTER than JS scalar. |
JS _mm_load_ps: JS scalar is 1.64x SLOWER than native scalar. | JS _mm_load_ps: JS SSE1 is 1.80x SLOWER than native SSE1. |
Native _mm_set_ps: 2.5459ns -> 0.5385ns. Native SSE1 is 4.73x FASTER than native scalar. | JS _mm_set_ps: 4.1821ns -> 0.6120ns. JS SSE1 is 6.83x FASTER than JS scalar. |
JS _mm_set_ps: JS scalar is 1.64x SLOWER than native scalar. | JS _mm_set_ps: JS SSE1 is 1.14x SLOWER than native SSE1. |
Native _mm_shuffle_ps: 0.0000ns -> 0.0087ns. Native SSE1 is 8705.00x SLOWER than native scalar. | JS _mm_shuffle_ps: 0.0001ns -> 0.2164ns. JS SSE1 is 3607.07x SLOWER than JS scalar. |
JS _mm_shuffle_ps: JS scalar is 60.00x SLOWER than native scalar. | JS _mm_shuffle_ps: JS SSE1 is 24.86x SLOWER than native SSE1. |
Native _mm_max_ps: 2.0442ns -> 0.2855ns. Native SSE1 is 7.16x FASTER than native scalar. | JS _mm_max_ps: 2.0055ns -> 0.4991ns. JS SSE1 is 4.02x FASTER than JS scalar. |
JS _mm_max_ps: JS scalar is 1.02x FASTER than native scalar. | JS _mm_max_ps: JS SSE1 is 1.75x SLOWER than native SSE1. |
Native _mm_move_ss: 2.5459ns -> 0.7399ns. Native SSE1 is 3.44x FASTER than native scalar. | JS _mm_move_ss: 4.1821ns -> 1.0933ns. JS SSE1 is 3.83x FASTER than JS scalar. |
JS _mm_move_ss: JS scalar is 1.64x SLOWER than native scalar. | JS _mm_move_ss: JS SSE1 is 1.48x SLOWER than native SSE1. |
Native _mm_store_ps: 2.5459ns -> 0.4673ns. Native SSE1 is 5.45x FASTER than native scalar. | JS _mm_store_ps: 4.1821ns -> 0.6469ns. JS SSE1 is 6.46x FASTER than JS scalar. |
JS _mm_store_ps: JS scalar is 1.64x SLOWER than native scalar. | JS _mm_store_ps: JS SSE1 is 1.38x SLOWER than native SSE1. |
Native _mm_and_ps: 0.0985ns -> 0.0087ns. Native SSE1 is 11.31x FASTER than native scalar. | JS _mm_and_ps: 0.2970ns -> 0.1559ns. JS SSE1 is 1.90x FASTER than JS scalar. |
JS _mm_and_ps: JS scalar is 3.02x SLOWER than native scalar. | JS _mm_and_ps: JS SSE1 is 17.91x SLOWER than native SSE1. |
Native _mm_add_ps: 1.6951ns -> 0.3804ns. Native SSE1 is 4.46x FASTER than native scalar. | JS _mm_add_ps: 1.3235ns -> 0.3267ns. JS SSE1 is 4.05x FASTER than JS scalar. |
JS _mm_add_ps: JS scalar is 1.28x FASTER than native scalar. | JS _mm_add_ps: JS SSE1 is 1.16x FASTER than native SSE1. |
Native _mm_rcp_ps: 3.3810ns -> 0.2796ns. Native SSE1 is 12.09x FASTER than native scalar. | JS _mm_rcp_ps: 7.8851ns -> 0.8824ns. JS SSE1 is 8.94x FASTER than JS scalar. |
JS _mm_rcp_ps: JS scalar is 2.33x SLOWER than native scalar. | JS _mm_rcp_ps: JS SSE1 is 3.16x SLOWER than native SSE1. |
Native _mm_cmpeq_ps: 1.4005ns -> 0.2784ns. Native SSE1 is 5.03x FASTER than native scalar. | JS _mm_cmpeq_ps: 1.2527ns -> 0.4402ns. JS SSE1 is 2.85x FASTER than JS scalar. |
JS _mm_cmpeq_ps: JS scalar is 1.12x FASTER than native scalar. | JS _mm_cmpeq_ps: JS SSE1 is 1.58x SLOWER than native SSE1. |