Indices Operator #1735

McArthur-Alford · 2024-05-06T16:40:02Z

Indices Operator for Int Tensors

Checklist

[ x ] Confirmed that run-checks all script has been executed.
[ x ] Made sure the book is up to date with changes in this PR.

Related Issues/PRs

None

Changes

Added a indices function for int tensors. This is similar to pytorches meshgrid, or numpys indices functions though with slightly different arrangement. For example, the output of Tensor::<B, 2, Int>::indices::<3>(Shape { dims: [2, 3] }, &device); would be:

[[[0, 0],  [0, 1],  [0, 2]],
 [[1, 0],  [1, 1],  [1, 2]]],

Testing

Added a super basic but functional test to make sure indices produces some expected typical outputs.

codecov · 2024-05-06T22:14:08Z

Codecov Report

Attention: Patch coverage is 96.87500% with 1 lines in your changes are missing coverage. Please review.

Project coverage is 86.54%. Comparing base (7f94f4c) to head (2a9920a).
Report is 2 commits behind head on main.

Files	Patch %	Lines
crates/burn-tensor/src/tensor/api/int.rs	95.23%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1735   +/-   ##
=======================================
  Coverage   86.54%   86.54%           
=======================================
  Files         699      700    +1     
  Lines       83223    83255   +32     
=======================================
+ Hits        72025    72056   +31     
- Misses      11198    11199    +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

laggui

Thanks for working on this 🙂

Implementation looks good, some minor comments on form.

laggui · 2024-05-07T12:10:30Z

burn-book/src/building-blocks/tensor.md

@@ -285,6 +285,7 @@ Those operations are only available for `Int` tensors.
 | ------------------------------------------------ | ------------------------------------------------------- |
 | `tensor.arange(5..10, device)       `            | `tensor.arange(start=5, end=10, device=device)`         |
 | `tensor.arange_step(5..10, 2, device)`           | `tensor.arange(start=5, end=10, step=2, device=device)` |
+| `tensor.indices(shape, device)`                  | `torch.meshgrid(tensors)`                               |


Two notes:

Usage should be Tensor::indices(shape, device) like Tensor::cat or Tensor::empty.

Given the implementation here, there is no 1-to-1 equivalent for torch since meshgrid takes tensors as input and not the shape of the desired grid. In this case this is much closer to numpy.indices but the indexing is in cartesian space. The actual torch equivalent for the 2D example in your unit tests would be:

yv, xv = torch.meshgrid([torch.arange(2), torch.arange(2)], indexing='xy') grid = torch.stack((xv, yv), 2)

So in this case, I'm not sure we would have to provide the comparison in the table.

laggui · 2024-05-07T12:10:37Z

crates/burn-tensor/src/tensor/api/int.rs

+    /// Produces an indices tensor for the given shape and device.
+    /// The resulting tensor contains coordinates corresponding to each element in the shape at dimension D.
+    ///
+    /// # Arguments
+    ///
+    /// * `shape` - The shape specifying the dimensions of the tensor.
+    /// * `device` - The device to create the tensor on.
+    ///
+    /// # Panics
+    ///
+    /// Panics if `D2` is not equal to `D+1`.
+    ///
+    /// # Examples
+    ///
+    /// ```rust
+    ///    use burn_tensor::Int;
+    ///    use burn_tensor::{backend::Backend, Shape, Tensor};
+    ///    fn example<B: Backend>() {
+    ///        let device = Default::default();
+    ///        let result = Tensor::<B, 2, Int>::indices::<3>(Shape { dims: [2, 3] }, &device);
+    ///        println!("{}", result);
+    ///    }
+    /// ```
+    pub fn indices<const D2: usize>(shape: Shape<D>, device: &B::Device) -> Tensor<B, D2, Int> {


Usually indices will be produced to use with a matrix/tensor (i.e., indexing in i, j like np.indices) but I think the intention with this method is to produce a grid in cartesian space. With that in mind, I think I would make it a bit more explicit in the method's doc and rename the method to something like nd_grid, meshgrid or even more explicit cartesian_grid (also opened to suggestions).

cartesian_grid seems to be a good name!

laggui · 2024-05-07T12:17:01Z

crates/burn-tensor/src/tensor/api/int.rs

+    /// Produces an indices tensor for the given shape and device.
+    /// The resulting tensor contains coordinates corresponding to each element in the shape at dimension D.
+    ///
+    /// # Arguments
+    ///
+    /// * `shape` - The shape specifying the dimensions of the tensor.
+    /// * `device` - The device to create the tensor on.
+    ///
+    /// # Panics
+    ///
+    /// Panics if `D2` is not equal to `D+1`.
+    ///
+    /// # Examples
+    ///
+    /// ```rust
+    ///    use burn_tensor::Int;
+    ///    use burn_tensor::{backend::Backend, Shape, Tensor};
+    ///    fn example<B: Backend>() {
+    ///        let device = Default::default();
+    ///        let result = Tensor::<B, 2, Int>::indices::<3>(Shape { dims: [2, 3] }, &device);
+    ///        println!("{}", result);
+    ///    }
+    /// ```
+    pub fn indices<const D2: usize>(shape: Shape<D>, device: &B::Device) -> Tensor<B, D2, Int> {


Instead of forcing users to pass a Shape, we could have the api like this:

pub fn indices<S: Into<Shape<D>>, const D2: usize,>(shape: S, device: &B::Device) -> Tensor<B, D2, Int>

So an array could be provided by the user and it would work.

laggui · 2024-05-07T12:20:36Z

@nathanielsimard what do you think about the naming for this method? (see my comments for possible suggestions)

nathanielsimard · 2024-05-08T20:32:16Z

crates/burn-tensor/src/tensor/api/int.rs

+    /// Produces an indices tensor for the given shape and device.
+    /// The resulting tensor contains coordinates corresponding to each element in the shape at dimension D.
+    ///
+    /// # Arguments
+    ///
+    /// * `shape` - The shape specifying the dimensions of the tensor.
+    /// * `device` - The device to create the tensor on.
+    ///
+    /// # Panics
+    ///
+    /// Panics if `D2` is not equal to `D+1`.
+    ///
+    /// # Examples
+    ///
+    /// ```rust
+    ///    use burn_tensor::Int;
+    ///    use burn_tensor::{backend::Backend, Shape, Tensor};
+    ///    fn example<B: Backend>() {
+    ///        let device = Default::default();
+    ///        let result = Tensor::<B, 2, Int>::indices::<3>(Shape { dims: [2, 3] }, &device);
+    ///        println!("{}", result);
+    ///    }
+    /// ```
+    pub fn indices<const D2: usize>(shape: Shape<D>, device: &B::Device) -> Tensor<B, D2, Int> {


cartesian_grid seems to be a good name!

nathanielsimard · 2024-05-08T20:34:09Z

crates/burn-tensor/src/tensor/api/int.rs

+    ///        println!("{}", result);
+    ///    }
+    /// ```
+    pub fn indices<const D2: usize>(shape: Shape<D>, device: &B::Device) -> Tensor<B, D2, Int> {


I think we should move this method to the backend API so that backends can optimize it, since calling a lot of arange and repeat can be very expansive for big matrices. We should keep default implementation in the backend definition.

@McArthur-Alford if you're not sure what that means, see for example the narrow op. It is defined by a default implementation but also overridden by some backends.

cartesian_grid sounds good to me. Ill get started on moving it to a backend op.

McArthur-Alford added 2 commits May 6, 2024 16:20

Cherry picked indices

a9758f0

Updated book

2a9920a

laggui requested changes May 7, 2024

View reviewed changes

nathanielsimard reviewed May 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Indices Operator #1735

Indices Operator #1735

McArthur-Alford commented May 6, 2024

codecov bot commented May 6, 2024

laggui left a comment

laggui May 7, 2024

laggui May 7, 2024

nathanielsimard May 8, 2024

laggui May 7, 2024

laggui commented May 7, 2024

nathanielsimard May 8, 2024

nathanielsimard May 8, 2024

laggui May 9, 2024

McArthur-Alford May 12, 2024

Indices Operator #1735

Are you sure you want to change the base?

Indices Operator #1735

Conversation

McArthur-Alford commented May 6, 2024

Indices Operator for Int Tensors

Checklist

Related Issues/PRs

Changes

Testing

codecov bot commented May 6, 2024

Codecov Report

laggui left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

laggui commented May 7, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment