Draft of ORT GPU build #5622

ChSonnabend · 2024-09-17T11:13:07Z

This is a draft PR to discuss possible changes to onnxruntime.sh for GPU builds on the EPN's and potentially CUDA (to be tested)

ChSonnabend · 2024-09-17T11:13:28Z

Ping @davidrohr

davidrohr

Du solltest ein paar Umgebungsvariable, die wir in o2.sh nutzen, auch mitaufnehmen und analog behandeln: https://github.com/alisw/alidist/blob/1916f6d88d42959097998d9481b517dc1c1ea84d/o2.sh#L191C9-L191C30

ALIBUILD_O2_FORCE_GPU
DISABLE_GPU
ALIBUILD_ENABLE_CUDA
ALIBUILD_ENABLE_HIP
ALIBUILD_O2_OVERRIDE_HIP_ARCHS
ALIBUILD_O2_OVERRIDE_CUDA_ARCHS

Wenn ENABLE_CUDA oder ENABLE_HIP gesetzt ist, sollte der build fehlschlagen, wenn er CUDA/HIP nicht bauen kann.

davidrohr · 2024-09-17T11:14:55Z

onnxruntime.sh

+                      "
+    elif command -v nvcc >/dev/null 2>&1; then
+      CUDA_VERSION=$(nvcc --version | grep "release" | awk '{print $NF}' | cut -d. -f1)
+      if [[ "$CUDA_VERSION" == "V11" ]]; then


glaube CUDA 11 kannst du weglassen, und nur >=12 annehmen

davidrohr · 2024-09-17T11:15:37Z

onnxruntime.sh

+ORT_BUILD_FLAGS=""
+case $ARCHITECTURE in
+  osx_*)
+    if [[ $ARCHITECTURE == *_x86-64 ]]; then


Solche printouts würde ich weglassen, das ist ja hauptsächlich für debugging

Ja, aber ich nehme an das es auch einen macOS build gibt der die Mac GPU anspricht. Da muss ich nochmal ein bisschen rumsuchen, dann könnte man den if-Block nämlich nehmen um da die build flags rein zu packen. Aber ja, die print-outs nehm ich am Ende natürlich noch raus

davidrohr · 2024-09-17T11:21:35Z

onnxruntime.sh

+    fi
+  ;;
+  *)
+    if command -v rocminfo >/dev/null 2>&1; then


rocm version check fehlt

Es ist nicht klar, ob rocminfo im Pfad liegt. Du solltest zumindest /opt/rocm/bin/rocminfo testen. Und dann ist migraphx ein separates ROCm paket. Sprich, wenn rocminfo vorhanden ist, heist das noch nicht, das migraphx vorhanden ist. Du solltest explicit auf migraphx testen.

Good point, das check ich nochmal

davidrohr · 2024-09-17T11:22:30Z

onnxruntime.sh

+        ORT_BUILD_FLAGS=" -Donnxruntime_USE_CUDA=ON                                                     \
+                          -DCUDA_TOOLKIT_ROOT_DIR=$CUDA_ROOT                                            \
+                          -Donnxruntime_USE_CUDA_NHWC_OPS=ON                                            \
+                          -Donnxruntime_CUDA_USE_TENSORRT=ON                                            \


Wenn du tensorrt nutzt, musst du dann prüfen, ob das explicit installiert ist? Oder ist das immer beim CUDA SDK dabei?

Scheint nicht automatisch mitzukommen (https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html)... Ok da bau ich auch noch einen Check mit ein

davidrohr · 2024-09-17T11:22:33Z

onnxruntime.sh

+                          -Donnxruntime_USE_CUDA_NHWC_OPS=ON                                            \
+                          -Donnxruntime_CUDA_USE_TENSORRT=ON                                            \
+                          "
+      elif [[ "$CUDA_VERSION" == "V12" ]]; then


Was ist wenn ROCm und CUDA beides vorhanden ist? Können wir dann nicht beides bauen?

Ne, die kann man nicht parallel bauen, es geht immer nur eins von beiden: https://github.com/microsoft/onnxruntime/blob/afd642a194b39138ad891e7bb2c8bca26d37b785/cmake/CMakeLists.txt#L288-L290

ktf · 2024-09-17T14:48:41Z

Gneau...

…adding env-variables for GPU enabling during code execution. For al9_gpu container and simultaneous CUDA & ROCm build, this requires ChSonnabend/onnxruntime@6ffc40c

ChSonnabend added 5 commits August 15, 2024 17:23

OnnxRuntime build on AMD GPU's

c80e36c

Merge branch 'alisw:master' into onnxruntime-gpu

4d42b1f

Modifying recipe for build on Nvidia GPU's (still needs testing)

23c3e5f

Updating ONNX build flags

78fcf07

Merge branch 'alisw:master' into onnxruntime-gpu

6e3689b

davidrohr reviewed Sep 17, 2024

View reviewed changes

ChSonnabend added 3 commits September 27, 2024 10:52

Merge branch 'master' into onnxruntime-gpu

0fa8a06

Updating version to 1.19.0

73b54ce

Adding automatic checks for migraphx, changing build cmake flags and …

5e42e46

…adding env-variables for GPU enabling during code execution. For al9_gpu container and simultaneous CUDA & ROCm build, this requires ChSonnabend/onnxruntime@6ffc40c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft of ORT GPU build #5622

Draft of ORT GPU build #5622

ChSonnabend commented Sep 17, 2024

ChSonnabend commented Sep 17, 2024

davidrohr left a comment

davidrohr Sep 17, 2024

davidrohr Sep 17, 2024

ChSonnabend Sep 17, 2024

davidrohr Sep 17, 2024

ChSonnabend Sep 17, 2024

davidrohr Sep 17, 2024

ChSonnabend Sep 17, 2024

davidrohr Sep 17, 2024

ChSonnabend Sep 17, 2024 •

edited

Loading

ktf commented Sep 17, 2024

Draft of ORT GPU build #5622

Are you sure you want to change the base?

Draft of ORT GPU build #5622

Conversation

ChSonnabend commented Sep 17, 2024

ChSonnabend commented Sep 17, 2024

davidrohr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ChSonnabend Sep 17, 2024 • edited Loading

Choose a reason for hiding this comment

ktf commented Sep 17, 2024

ChSonnabend Sep 17, 2024 •

edited

Loading