SYCL: Avoid using SYCL-Graph for unsupported nodes #13587

EwanC · 2025-05-16T11:08:02Z

Currently on a CUDA backend to SYCL when running
GGML_SYCL_DISABLE_GRAPH=0 ./bin/test-backend-ops -b SYCL0 there are two operations that throw an exception from the blocking waits during queue recording.

-o CONCAT : Use of blocking waits on a queue that's being recorded in ggml_sycl_op_concat.
-o MUL_MAT_ID: Blocking wait on a recording queue for a copy to host memory in ggml_sycl_mul_mat_id.

We've noticed that ggml-cuda.cu has the
check_node_graph_compatibility_and_refresh_copy_ops method for checking if a graph can be used, even if enabled. I've taken a similar approach in this PR by adding a method to ggml-sycl.cpp for checking if a graph can be used for the operations even if a user has asked for it to be enabled.

Currently on a CUDA backend to SYCL when running `GGML_SYCL_DISABLE_GRAPH=0 ./bin/test-backend-ops -b SYCL0` there are two operations that throw an exception from the blocking waits during queue recording. * `-o CONCAT` : Use of blocking waits on a queue that's being recorded https://github.com/ggml-org/llama.cpp/blob/master/ggml/src/ggml-sycl/concat.cpp#L185-L187 * `-o MUL_MAT_ID`: Blocking wait on a recording queue for a copy to host memory https://github.com/ggml-org/llama.cpp/blob/master/ggml/src/ggml-sycl/ggml-sycl.cpp#L3072-L3074 We've noticed that `ggml-cuda.cu` has the [check_node_graph_compatibility_and_refresh_copy_ops](https://github.com/ggml-org/llama.cpp/blob/39e73ae0d69f882d7e29cecc6dd8f5052fca6731/ggml/src/ggml-cuda/ggml-cuda.cu#L2458-L2458) method for checking if a graph can be used, even if enabled. I've taken a similar approach in this PR by adding a method to `ggml-sycl.cpp` for checking if a graph can be used for the operations even if a user has asked for it to be enabled.

Rbiessy · 2025-05-19T09:21:01Z

ggml/src/ggml-sycl/ggml-sycl.cpp

+#    ifndef NDEBUG
+                GGML_LOG_DEBUG("%s: disabling SYCL graphs due to unsupported node type\n", __func__);
+#    endif


Nit but maybe worth having this as GGML_LOG_INFO and printing which node type is unsupported?

Rbiessy · 2025-05-19T09:30:28Z

ggml/src/ggml-sycl/ggml-sycl.cpp

@@ -3810,11 +3810,38 @@ static void ggml_backend_sycl_graph_compute_impl(ggml_backend_sycl_context * syc
    }
 }

+#ifdef GGML_SYCL_GRAPH
+static bool check_node_graph_compatibility(ggml_cgraph * cgraph) {


CUDA checks if one of the buffer is split, meaning multiple devices are used, to disable the graph. I assume you don't test that, may be safe to disable it as well? Or do you expect SYCL-Graph would work well in such a case?
I'd suggest just checking if ggml_sycl_info().device_count > 1 to disable SYCL-Graph.

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels May 16, 2025

EwanC changed the title ~~SYCL: Avoid using with SYCL-Graph for unsupported nodes~~ SYCL: Avoid using SYCL-Graph for unsupported nodes May 16, 2025

EwanC mentioned this pull request May 16, 2025

SYCL: Fix test-backend-ops crashes with SYCL-Graph #13357

Closed

EwanC marked this pull request as ready for review May 16, 2025 12:06

Rbiessy reviewed May 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SYCL: Avoid using SYCL-Graph for unsupported nodes #13587

SYCL: Avoid using SYCL-Graph for unsupported nodes #13587

EwanC commented May 16, 2025

Rbiessy May 19, 2025

Rbiessy May 19, 2025

SYCL: Avoid using SYCL-Graph for unsupported nodes #13587

Are you sure you want to change the base?

SYCL: Avoid using SYCL-Graph for unsupported nodes #13587

Conversation

EwanC commented May 16, 2025

Rbiessy May 19, 2025

Choose a reason for hiding this comment

Rbiessy May 19, 2025

Choose a reason for hiding this comment