Add a new architecture mode: 'avx512-sr'. #4025

mulugetam · 2024-11-12T17:18:16Z

This PR adds a new architecture mode to support the new extensions to AVX512, namely AVX512-FP16, which have been available since Intel® Sapphire Rapids.

This PR is a prerequisite for PR#4020 that speeds up hamming distance evaluations.

… to AVX512. Signed-off-by: Mulugeta Mammo <[email protected]>

mengdilin · 2024-11-12T17:46:40Z

Hmm weird the CIs are not running on the PR. Do you mind pushing a new commit and see if the CIs start?

Signed-off-by: Mulugeta Mammo <[email protected]>

mulugetam · 2024-11-12T18:26:15Z

@mengdilin Pushed a new commit but CI not starting. Is this possibly because I updated .github/workflows/build.yml?

mengdilin · 2024-11-12T19:33:46Z

Yea there is a syntax error in the build file: see https://github.com/facebookresearch/faiss/actions/runs/11804384709

mengdilin · 2024-11-12T19:34:25Z

.github/workflows/build.yml

@@ -67,6 +67,17 @@ jobs:
        uses: ./.github/actions/build_cmake
        with:
          opt_level: avx512
+  linux-x86_64-AVX512-cmake:


linux-x86_64-AVX512-advanced-cmake

Signed-off-by: Mulugeta Mammo <[email protected]>

…vx512-sr

mulugetam · 2024-11-12T22:29:04Z

@mengdilin CI is picking g++ version 11. But AVX512-FP16 (-mavx512fp16) requires version 12+.

mengdilin · 2024-11-12T22:49:07Z

Ah yea the conda publication CI is using the older compiler version. We should investigate on our side; however, I don't think we want to publish this architecture mode to conda right now, can you omit it?

Looks like CI failure is coming from a unit test failure from

faiss/tests/test_contrib.py

Line 552 in 0fb56d9

def test_ivf_train_2level(self):

Try commenting out that test and see if anything else fails?
I'm going on PTO, handing this over to the next performance oncall @kuarora

mengdilin · 2024-11-12T22:48:02Z

conda/faiss/build-lib.sh

      -DFAISS_ENABLE_GPU=OFF \
      -DFAISS_ENABLE_PYTHON=OFF \
      -DBLA_VENDOR=Intel10_64lp \
      -DCMAKE_INSTALL_LIBDIR=lib \
      -DCMAKE_BUILD_TYPE=Release .

-make -C _build -j$(nproc) faiss faiss_avx2 faiss_avx512
+make -C _build -j$(nproc) faiss faiss_avx2 faiss_avx512 faiss_avx512_sr


This is used for faiss's conda packaging upload. I don't think we want to expose this build mode yet in conda officially. Can you omit this for now?

Signed-off-by: Mulugeta Mammo <[email protected]>

mulugetam · 2024-11-13T16:51:37Z

Thanks @mengdilin. @kuarora Could you please review?

alexanderguzhva · 2024-11-13T17:12:34Z

@mulugetam I would use -march=sapphirerapids -mtune=sapphirerapids for the compiler flags, because SR supports many other AVX512 instruction extensions that are not currently listed among compiler flags

Signed-off-by: Mulugeta Mammo <[email protected]>

mengdilin

Back from PTO. LGTM, can you resolve the conflict so I can import the PR. For the unit test, can you relax the threshold instead of commenting it out? If somehow you cannot relax this threshold, you can skip this unittest when in SR mode similar to

faiss/faiss/gpu/test/test_cagra.py

Line 13 in 697b6dd

@unittest.skipIf(

(ideally we don't have to do this)

mengdilin · 2024-12-03T18:43:36Z

faiss/CMakeLists.txt

+endif()
+if(NOT WIN32)
+  # Architecture mode to support AVX512 extensions available since Intel(R) Sapphire Rapids.
+  # Ref: https://networkbuilders.intel.com/solutionslibrary/intel-avx-512-fp16-instruction-set-for-intel-xeon-processor-based-products-technology-guide


thanks for the ref!

mengdilin · 2024-12-03T18:45:16Z

tests/test_contrib.py

@@ -568,7 +569,7 @@ def test_ivf_train_2level(self):
        # normally 47 / 200 differences
        ndiff = (Iref != Inew).sum()
        self.assertLess(ndiff, 51)


is there away to relax the threshold such that this test passes in SR mode as well?

Add a new architecture mode, 'avx512-sr', to support latest additions…

87002d4

… to AVX512. Signed-off-by: Mulugeta Mammo <[email protected]>

facebook-github-bot added the CLA Signed label Nov 12, 2024

mulugetam mentioned this pull request Nov 12, 2024

Use _mm512_popcnt_epi64 to speedup hamming distance evaluation. #4020

Open

Remove unnecessary space.

4899800

Signed-off-by: Mulugeta Mammo <[email protected]>

Merge branch 'main' into avx512-sr

67e6291

mengdilin reviewed Nov 12, 2024

View reviewed changes

mulugetam added 2 commits November 12, 2024 20:09

Fix a typo in workflows/build.yml.

7a46de7

Signed-off-by: Mulugeta Mammo <[email protected]>

Merge branch 'avx512-sr' of https://github.com/mulugetam/faiss into a…

7c48043

…vx512-sr

gtwang01 added the install label Nov 12, 2024

mengdilin reviewed Nov 12, 2024

View reviewed changes

mulugetam added 2 commits November 12, 2024 23:05

Remove avx512-sr mode from conda.

1770b8c

Signed-off-by: Mulugeta Mammo <[email protected]>

Comment out test_ivf_train_2level in faiss/tests/test_contrib.py.

64daa63

Signed-off-by: Mulugeta Mammo <[email protected]>

mulugetam added 2 commits November 13, 2024 21:30

Use sapphirerapids for -march and -mtune.

5043d08

Signed-off-by: Mulugeta Mammo <[email protected]>

Remove unnecessary spaces.

0bcbbd9

Signed-off-by: Mulugeta Mammo <[email protected]>

mengdilin reviewed Dec 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a new architecture mode: 'avx512-sr'. #4025

Add a new architecture mode: 'avx512-sr'. #4025

mulugetam commented Nov 12, 2024

mengdilin commented Nov 12, 2024

mulugetam commented Nov 12, 2024

mengdilin commented Nov 12, 2024

mengdilin Nov 12, 2024

mulugetam commented Nov 12, 2024

mengdilin commented Nov 12, 2024

mengdilin Nov 12, 2024

mulugetam Nov 13, 2024

mulugetam commented Nov 13, 2024

alexanderguzhva commented Nov 13, 2024

mengdilin left a comment

mengdilin Dec 3, 2024

mengdilin Dec 3, 2024

Add a new architecture mode: 'avx512-sr'. #4025

Are you sure you want to change the base?

Add a new architecture mode: 'avx512-sr'. #4025

Conversation

mulugetam commented Nov 12, 2024

mengdilin commented Nov 12, 2024

mulugetam commented Nov 12, 2024

mengdilin commented Nov 12, 2024

mengdilin Nov 12, 2024

Choose a reason for hiding this comment

mulugetam commented Nov 12, 2024

mengdilin commented Nov 12, 2024

mengdilin Nov 12, 2024

Choose a reason for hiding this comment

mulugetam Nov 13, 2024

Choose a reason for hiding this comment

mulugetam commented Nov 13, 2024

alexanderguzhva commented Nov 13, 2024

mengdilin left a comment

Choose a reason for hiding this comment

mengdilin Dec 3, 2024

Choose a reason for hiding this comment

mengdilin Dec 3, 2024

Choose a reason for hiding this comment