Allow linalg.lstsq to use svd to compute the result for rank deficient matrices. #126652

ZelboK · 2024-05-19T19:35:06Z

This PR adds the logic so that in the case of rank deficient matrices, it can fallback to an SVD backend for batched mode.

I apologize for the previous PR... I messed up a rebase and it ended up showing a million changes.

cc @lezcano

…stsq on CUDA.

pytorch-bot · 2024-05-19T19:35:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126652

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit fd07d5e with merge base 853081a ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

aten/src/ATen/native/BatchLinearAlgebra.cpp

lezcano · 2024-05-20T08:24:01Z

aten/src/ATen/native/BatchLinearAlgebra.cpp

+    solution.set_(solution.storage(), solution_view.storage_offset(),
+                  solution_view.sizes(), solution_view.strides());
+  } else {
+    solution = at::zeros({solution.size(-1), n}, solution.options());


what is going on here??

You're referring to just everything inside the else clause correct?

I found that with a tensor A that has rows > cols

A = torch.tensor([[1.0, 2.0], [3.0, 4.0], [5.0, 6.0], [7.0, 8.0]], device='cuda') # Create tensor B with shape (4, 1) B = torch.tensor([[1.0], [2.0], [3.0], [4.0]], device='cuda') X_lstsq = torch.linalg.lstsq(A, B, driver='gelss').solution

would lead to
RuntimeError: start (2) + length (2) exceeds dimension size (2).

Is this incorrect? I'm refreshing my linear algebra here and I might not have the correct understanding.

def svd_lstsq(AA, BB, tol=1e-5): U, S, Vh = torch.linalg.svd(AA, full_matrices=False) Spinv = torch.zeros_like(S) Spinv[S>tol] = 1/S[S>tol] UhBB = U.adjoint() @ BB if Spinv.ndim!=UhBB.ndim: Spinv = Spinv.unsqueeze(-1) SpinvUhBB = Spinv * UhBB return Vh.adjoint() @ SpinvUhBB X_svd= svd_lstsq(A, B)

This fo example will not throw an error with the same tensors.

Also I should have clarified this earlier. Sorry.

I don't see why should you allocate a new tensor when you already have a solution allocated in the else path? And why a tensor of zeros?

Ah, you're right, I shouldn't allocate a new tensor.
So wrt the zeros, in hindsight, it makes no sense to zero it out as that's not the correct behavior(this should have a solution, right?). An exception actually tells the user too where this is just silent UB. How do we handle this case though? Does solution need to be reshaped before using set_ or something in the else path?

Since im still new, curious to know if this is out of scope for this PR? This exception occurs in general for when the solution.size(-2) < n. I don't mind doing it in this PR since it is small(better use of github runners too rather than 2 split PRs).

ZelboK · 2024-05-22T22:28:29Z

aten/src/ATen/native/BatchLinearAlgebra.cpp

+    if (input.numel() == 0) {
+        auto output_shape = input.sizes().vec();
+        output_shape.back() = other.size(-1);
+        rank.zero_();


rank is required later on, this solves the problem of the integer overflow later when toInt() is called, because it wasn't set to anything.

Add new code path for SVD to be used in rank deficient matrices for l…

7e40521

…stsq on CUDA.

ZelboK requested review from lezcano, nikitaved and IvanYashchuk as code owners May 19, 2024 19:35

pytorch-bot bot added the release notes: linalg_frontend release notes category label May 19, 2024

ZelboK commented May 19, 2024

View reviewed changes

aten/src/ATen/native/BatchLinearAlgebra.cpp Show resolved Hide resolved

pytorchbot added the open source label May 19, 2024

lezcano reviewed May 20, 2024

View reviewed changes

drisspg added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label May 20, 2024

Fix case when rank tensor is not initialized.

6bcef58

ZelboK commented May 22, 2024

View reviewed changes

remove print

fd07d5e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow linalg.lstsq to use svd to compute the result for rank deficient matrices. #126652

Allow linalg.lstsq to use svd to compute the result for rank deficient matrices. #126652

ZelboK commented May 19, 2024

pytorch-bot bot commented May 19, 2024 •

edited

lezcano May 20, 2024

ZelboK May 20, 2024 •

edited

lezcano May 20, 2024

ZelboK May 21, 2024 •

edited

ZelboK May 22, 2024

Allow linalg.lstsq to use svd to compute the result for rank deficient matrices. #126652

Are you sure you want to change the base?

Allow linalg.lstsq to use svd to compute the result for rank deficient matrices. #126652

Conversation

ZelboK commented May 19, 2024

pytorch-bot bot commented May 19, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126652

✅ No Failures

lezcano May 20, 2024

Choose a reason for hiding this comment

ZelboK May 20, 2024 • edited

Choose a reason for hiding this comment

lezcano May 20, 2024

Choose a reason for hiding this comment

ZelboK May 21, 2024 • edited

Choose a reason for hiding this comment

ZelboK May 22, 2024

Choose a reason for hiding this comment

pytorch-bot bot commented May 19, 2024 •

edited

ZelboK May 20, 2024 •

edited

ZelboK May 21, 2024 •

edited