Skip to content

Latest commit

 

History

History
42 lines (27 loc) · 1.09 KB

README.md

File metadata and controls

42 lines (27 loc) · 1.09 KB

sse2neon

IMPORTANT NOTE

This project is now deprecated!

Other software engineers have been making continued improvements on this project in another GitHub repot.

This is NO LONGER considered the official depot for SSE2NEON!!

You should please be redirected to:

https://github.com/DLTcollab/sse2neon

A C/C++ header file that converts Intel SSE intrinsics to ARN NEON intrinsics.

Info

The SIMD instruction set of Intel, which is known as SSE is used in many applications for improved performance. ARM also have introduced an SIMD instruction set called Neon to their processors. Rewriting code written for SSE to work on Neon is very time consuming. This is a header file that can automatically convert some of the SSE instricts into NEON instricts.

Usage

  • Put the SSE2NEON.h file in to your source code directory.

  • Locate the following SSE header files included in the code:

    #include <xmmintrin.h>
    #include <emmintrin.h>
  • Replace them with :
#include "SSE2NEON.h"
  • On Linux compile your code with the following gcc/g++ flag:
-mfpu=neon