[OUTDATED!, Autograd] Cond Higher-Order Operation #126007

bohnstingl · 2024-05-11T19:35:44Z

This PR is intended to provide the Autograd mechanism for the cond operation.

@ydwu4

pytorch-bot · 2024-05-11T19:35:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126007

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 40 New Failures, 5 Unrelated Failures

As of commit b5a2b34 with merge base ee804d2 ():

NEW FAILURES - The following jobs have failed:

Lint / lintrunner-clang / linux-job (gh)
RuntimeError: Command docker exec -t 2e48ed3487065cc42dd3742a5f552dddbd244fc7a0fd716de7ef9d8da62c0d8b /exec failed with exit code 1
Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for torch/_higher_order_ops/utils.py:
pull / linux-docs / build-docs-python-false (gh)
Process completed with exit code 2.
pull / linux-focal-py3_8-clang9-xla / test (xla, 1, 1, linux.12xlarge) (gh)
test_layer_norm_decomp
pull / linux-focal-py3.11-clang10 / test (crossref, 1, 2, linux.2xlarge) (gh)
functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random
pull / linux-focal-py3.11-clang10 / test (crossref, 2, 2, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-focal-py3.11-clang10 / test (default, 1, 3, linux.2xlarge) (gh)
functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random
pull / linux-focal-py3.11-clang10 / test (default, 2, 3, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-focal-py3.11-clang10 / test (default, 3, 3, linux.2xlarge) (gh)
dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_pytree_operands
pull / linux-focal-py3.11-clang10 / test (dynamo, 1, 3, linux.2xlarge) (gh)
functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random
pull / linux-focal-py3.11-clang10 / test (dynamo, 2, 3, linux.2xlarge) (gh)
test_torch.py::TestTorch::test_wildcard_import
pull / linux-focal-py3.11-clang10 / test (dynamo, 3, 3, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-focal-py3.12-clang10 / test (default, 1, 3, linux.2xlarge) (gh)
test_torch.py::TestTorch::test_wildcard_import
pull / linux-focal-py3.12-clang10 / test (default, 2, 3, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-focal-py3.12-clang10 / test (default, 3, 3, linux.2xlarge) (gh)
dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_pytree_operands
pull / linux-focal-py3.12-clang10 / test (dynamo, 1, 3, linux.2xlarge) (gh)
test_torch.py::TestTorch::test_wildcard_import
pull / linux-focal-py3.12-clang10 / test (dynamo, 2, 3, linux.2xlarge) (gh)
test_utils.py::TestRenderUtils::test_basic
pull / linux-focal-py3.12-clang10 / test (dynamo, 3, 3, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-focal-py3.8-clang10 / test (crossref, 1, 2, linux.2xlarge) (gh)
functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random
pull / linux-focal-py3.8-clang10 / test (crossref, 2, 2, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-focal-py3.8-clang10 / test (default, 1, 3, linux.2xlarge) (gh)
functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random
pull / linux-focal-py3.8-clang10 / test (default, 2, 3, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-focal-py3.8-clang10 / test (default, 3, 3, linux.2xlarge) (gh)
dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_pytree_operands
pull / linux-focal-py3.8-clang10 / test (dynamo, 1, 3, linux.2xlarge) (gh)
functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random
pull / linux-focal-py3.8-clang10 / test (dynamo, 2, 3, linux.2xlarge) (gh)
test_torch.py::TestTorch::test_wildcard_import
pull / linux-focal-py3.8-clang10 / test (dynamo, 3, 3, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-focal-py3.8-clang10-onnx / test (default, 1, 2, linux.2xlarge) (gh)
onnx/test_fx_to_onnx_with_onnxruntime.py::TestFxToOnnxWithOnnxRuntime_op_level_debug_True_dynamic_shapes_True_model_type_TorchModelType.TORCH_NN_MODULE::test_fx_symbolic_tracer_large_scale_exporter_with_toy_mlp
pull / linux-focal-py3.8-clang10-onnx / test (default, 2, 2, linux.2xlarge) (gh)
onnx/test_fx_to_onnx_with_onnxruntime.py::TestFxToOnnxWithOnnxRuntime_op_level_debug_True_dynamic_shapes_True_model_type_TorchModelType.TORCH_NN_MODULE::test_fx_symbolic_tracer_large_scale_exporter_with_tiny_gpt2
pull / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, linux.2xlarge) (gh)
AttributeError: '_PyOpNamespace' 'torch.ops.higher_order' object has no attribute 'cond'
pull / linux-jammy-py3.10-clang15-asan / test (default, 1, 6, linux.4xlarge) (gh)
functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random
pull / linux-jammy-py3.10-clang15-asan / test (default, 2, 6, linux.4xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-jammy-py3.10-clang15-asan / test (default, 3, 6, linux.4xlarge) (gh)
dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_pytree_operands
pull / linux-jammy-py3.10-clang15-asan / test (default, 4, 6, linux.4xlarge) (gh)
dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_cond_with_quantization_dynamic_shapes
pull / linux-jammy-py3.10-clang15-asan / test (default, 5, 6, linux.4xlarge) (gh)
functorch/test_aotdispatch.py::TestAOTExport::test_aot_export_predispatch_with_cond
pull / linux-jammy-py3.10-clang15-asan / test (default, 6, 6, linux.4xlarge) (gh)
dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_map_with_quantization_dynamic_shapes
pull / linux-jammy-py3.8-gcc11 / test (default, 1, 3, linux.2xlarge) (gh)
functorch/test_memory_efficient_fusion.py::RandomOpTestCase::test_random
pull / linux-jammy-py3.8-gcc11 / test (default, 2, 3, linux.2xlarge) (gh)
functorch/test_control_flow.py::TestControlFlow::test_cond_autograd_simple
pull / linux-jammy-py3.8-gcc11 / test (default, 3, 3, linux.2xlarge) (gh)
dynamo/test_higher_order_ops.py::HigherOrderOpTests::test_cond_pytree_operands
pull / linux-jammy-py3.8-gcc11 / test (distributed, 1, 2, linux.2xlarge) (gh)
distributed/fsdp/test_fsdp_fx.py::TestSymbolicTracing::test_symbolic_tracing_outputs
pull / linux-jammy-py3.8-gcc11 / test (distributed, 2, 2, linux.2xlarge) (gh)
distributed/test_c10d_gloo.py::CompilerTest::test_allgather_work_wait_cpu

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / linux-focal-cuda11.8-py3.10-gcc9 / build (gh) (similar failure)
Process completed with exit code 1.
pull / linux-focal-cuda12.1-py3.10-gcc9 / build (gh) (similar failure)
Process completed with exit code 1.
pull / linux-focal-cuda12.1-py3.10-gcc9-sm86 / build (gh) (similar failure)
Process completed with exit code 1.
pull / linux-jammy-cuda11.8-cudnn8-py3.8-clang12 / build (gh) (similar failure)
Process completed with exit code 1.
pull / linux-jammy-py3.8-gcc11 / test (docs_test, 1, 1, linux.2xlarge) (gh) (similar failure)
Process completed with exit code 2.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ydwu4

Overall looks good. Should probably fix existing test failures first and add more tests for the following cases:

More scenerios for autograd beyond a simple function: e.g. 1. map + cond, 2. cond + nn modules (with parameters and buffers), closures, 3. nested cond autograd and others (can use your imagination to come up with more).
Tracing the forward and backward graph with make_fx (with different tracing mode) for cases listed in a, also use expectedInline tests to make sure the produced forward and backward graphs are correct (we have examples for map).

ydwu4 · 2024-05-14T20:28:14Z

cmake/public/cuda.cmake

@@ -57,6 +57,8 @@ if(CMAKE_VERSION VERSION_GREATER_EQUAL 3.12.0)
 endif()

 find_package(CUDAToolkit REQUIRED)
+add_library(CUDA::nvToolsExt INTERFACE IMPORTED)


Why do we want to change this?

ydwu4 · 2024-05-14T20:29:37Z

functorch/experimental/control_flow.py

 from torch._higher_order_ops.map import (  # noqa: F401
    _stack_pytree,
    _unstack_pytree,
    map,
 )
+from torch._higher_order_ops.cond import ( # noqa: F401


Is this change necessary?

ydwu4 · 2024-05-14T20:30:43Z

test/functorch/test_control_flow.py

+        result = cond(pred, true_fn, false_fn, [x])
+        self.assertEqual(result, torch.cos(x))
+
+        grad_out = torch.ones_like(result)


can you also use "assertExpectedInline" to show the forward and backward graph? Similar as what we did for map.

ydwu4 · 2024-05-14T20:31:50Z

test/functorch/test_control_flow.py

-    def test_cond_make_fx_preserve_stack_trace_for_nodes_in_subgraph(self):
-        def true_fn(x):
-            return x + x.cos()
+    # def test_cond_make_fx_preserve_stack_trace_for_nodes_in_subgraph(self):


Does this test starts to fail because of the change? Why is that?

ydwu4 · 2024-05-14T20:32:06Z

torch/__init__.py

@@ -1912,7 +1912,8 @@ def fn(model: Callable):

 from torch import export as export

-from torch._higher_order_ops import cond
+# from torch._higher_order_ops import cond


what happens here?

ydwu4 · 2024-05-14T20:35:52Z

torch/_higher_order_ops/cond.py

+            num_mapped_args = len(operands)
+
+            unwrapped_mapped_operands = pytree.tree_map(_from_fun, operands)
+            # example_operands = _unstack_pytree(unwrapped_mapped_operands)[0]


yeah, I don't think we need _unstack_pytree because for cond, the operands are already flattened by dynamo (i.e. becomes a sequence of tensors) by the time we entering autograd key of cond.

ydwu4 · 2024-05-14T21:25:01Z

torch/_higher_order_ops/cond.py

+            fw_true_graph = make_fx(true_fn)(*example_operands)
+            fw_false_graph = make_fx(false_fn)(*example_operands)
+
+        def joint_f(fn, *joint_mapped_args):


Didn't look too carefully into the implementation of these. Can you add more tests to make check whether the true graph and false graph creates correct backward graph?

ydwu4 · 2024-05-14T21:26:34Z

test/functorch/test_control_flow.py

@@ -256,6 +256,23 @@ def false_fn(x):
        pred = torch.tensor(False, device="cuda")
        result = cond(pred, true_fn, false_fn, [x])
        self.assertEqual(result, torch.cos(x))
+
+    def test_cond_autograd_simple(self):


Can we also add opinfo tests for cond here https://github.com/pytorch/pytorch/blob/main/torch/testing/_internal/hop_db.py? Following the map autograd tests.

WIP: Autograd for cond higher-order operation

b5a2b34

pytorchbot added the open source label May 11, 2024

ydwu4 reviewed May 14, 2024

View reviewed changes

bohnstingl changed the title ~~[Autograd] Cond Higher-Order Operation~~ [OUTDATED!, Autograd] Cond Higher-Order Operation May 22, 2024

bohnstingl mentioned this pull request May 22, 2024

[Autograd] Cond Higher-Order Operation #126911

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OUTDATED!, Autograd] Cond Higher-Order Operation #126007

[OUTDATED!, Autograd] Cond Higher-Order Operation #126007

bohnstingl commented May 11, 2024 •

edited

pytorch-bot bot commented May 11, 2024 •

edited

ydwu4 left a comment

ydwu4 May 14, 2024

ydwu4 May 14, 2024

ydwu4 May 14, 2024

ydwu4 May 14, 2024

ydwu4 May 14, 2024

ydwu4 May 14, 2024

ydwu4 May 14, 2024

ydwu4 May 14, 2024

[OUTDATED!, Autograd] Cond Higher-Order Operation #126007

Are you sure you want to change the base?

[OUTDATED!, Autograd] Cond Higher-Order Operation #126007

Conversation

bohnstingl commented May 11, 2024 • edited

pytorch-bot bot commented May 11, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126007

❌ 40 New Failures, 5 Unrelated Failures

ydwu4 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bohnstingl commented May 11, 2024 •

edited

pytorch-bot bot commented May 11, 2024 •

edited