[WIP] cholesky #44

willtebbutt · 2019-09-06T14:08:36Z

A first attempt at implementing the cholesky factorisation for Kronecker matrices, will close #10 if completed. Attempting to implement this has highlighted a few aspects of the codebase that we really should sort out:

GeneralizedKroneckerProduct <: AbstractMatrix{Number} - this was bad causes type stability issues. If you don't parametrise GeneralisedKroneckerProduct{T}, you wind up constructing things like Cholesky{Number, ...}. This required quite a few changes throughout the codebase. Seems to work fine (tests pass), but of course I might have missed something. This also seems to resolve an issue whereby attempting to display UpperTriangular(d) for some KroneckerProduct d threw an error.
We're not doing a good job of handling UpperTriangular and LowerTriangular. For Cholesky stuff to really work, we need this to work. The approach I've taken here is to cheat and make UpperTriangular of an AbstractKroneckerProduct return an AbstractKroneckerProduct of UpperTriangular matrices. While this makes sense mathematically I'm not sure that it's a great move semantically. It's also been causing some stack overflow errors that I've not managed to track down.

I also took the (highly opinionated) liberty of renaming the files in test so that they don't begin with the word test - this word being redundant as they're in the test folder. Please feel free to suggest that I revert this :)

Let me know what you think :)

MichielStock · 2019-09-10T12:56:40Z

Thanks for pointing this out. @Beramos and I tried to improve it at the hackathon at Juliacon, but I think we did not 100% got the hang of it. General parametric type is a good idea, because ideally Kronecker.jl should work with other stuff than numbers, such as strings.
This should work I think and is the approach I would have taken. I do worry that the LowerTriangular of a Kronecker product is not the Kronecker product of the LT of the individual matrices. The latter itself is lower triangular though (even sparser).

I will try out your branch and see I can resolve the errors.

src/cholesky.jl

MichielStock · 2019-09-10T14:43:17Z

src/cholesky.jl

+    return istriu(A) && istriu(B)
+end
+
+function getproperty(C::KroneckerCholesky, d::Symbol)


I fixed an error here by changing it into

function getproperty(C::KroneckerCholesky, d::Symbol) Cfactors = getfield(C, :factors) Cuplo = getfield(C, :uplo) info = getfield(C, :info) Cuplo != 'U' && throw(NotImplementedError("")) if d == :U return UpperTriangular(Cfactors) elseif d == :L return LowerTriangular(copy(Cfactors')) #return LowerTriangular(Cuplo === char_uplo(d) ? Cfactors : copy(Cfactors')) elseif d == :UL return (Cuplo === 'U' ? UpperTriangular(Cfactors) : LowerTriangular(Cfactors)) else return getfield(C, d) end end

I also needed my own version of copy which provides shallow copies of Kronecker products. This gives the right answer.

MichielStock · 2019-09-10T14:46:57Z

src/cholesky.jl

+    return size(B, 1) * logdet_A + size(A, 1) * logdet_B
+end
+
+function \(C::KroneckerCholesky, x::AbstractVecOrMat)


I am not sure this is correct. If I understand it correctly, the main performance of Cholesky decomposition originates from the fact that it is easy to solve a triangular system. The vec trick kind of destroys this structure...

Hmm. I admit that I hadn't really thought very hard about this when writing it, but I'm pretty sure that you can use the vec trick to compute backsolves. Note that

vec(inv(L2) X inv(L1)) = (inv(L1) ⊗ inv(L2)) vec(X) = inv(L1 ⊗ L2) vec(X)

so we should be able to implement backsolving with kronecker matrices efficiently provided that the individual backsolves are efficient.

So you would leave it as is for the moment?

willtebbutt · 2019-09-11T10:45:47Z

This should work I think and is the approach I would have taken. I do worry that the LowerTriangular of a Kronecker product is not the Kronecker product of the LT of the individual matrices. The latter itself is lower triangular though (even sparser).

You're right. I shouldn't have been doing the above. The cholesky factorisation of a kronecker product is the kronecker product of the cholesky factors though, which is what I would ultimately like to exploit. Probably requires some care to make this work correctly.

dkarrasch · 2019-09-17T06:46:07Z

src/kroneckerpowers.jl

-                        K2::KroneckerPower{T2, N}) where {T1, T2, N}
+function Base.:*(
+    K1::KroneckerPower{T1, TA1, N},
+    K2::KroneckerPower{T1, TA2, N},


Why do you require that both Kronecker factors have the same eltype T1? I don't think that's necessary.

dkarrasch · 2019-09-17T06:53:07Z

src/cholesky.jl

+
+function cholesky(A::AbstractKroneckerProduct; check=true)
+    P, Q = getmatrices(A)
+    chol_P, chol_Q = cholesky(P; check=true), cholesky(Q; check=true)


Should this be

chol_P, chol_Q = cholesky(P; check=check), cholesky(Q; check=check)

? Otherwise, the kwarg doesn't enter anywhere.

dkarrasch · 2019-09-17T06:53:24Z

src/cholesky.jl

+
+function UpperTriangular(C::AbstractKroneckerProduct)
+    A, B = getmatrices(C)
+    return UpperTriangular(A) ⊗ UpperTriangular(B)


UpperTriangular is the constructor of the UpperTriangular type. So I guess this should return an object of that type, and you may wish to consider returning

UpperTriangular(UpperTriangular(A) ⊗ UpperTriangular(B))

? Otherwise, subsequent code will not benefit from upper-triangularizing an AbstractKroneckerProduct, it will be "invisible" from the type point of view.

willtebbutt · 2019-11-04T16:24:43Z

Closing in favour of #50

cholesky

7f6d6f3

willtebbutt mentioned this pull request Sep 6, 2019

Factorisations #10

Open

MichielStock reviewed Sep 10, 2019

View reviewed changes

src/cholesky.jl Show resolved Hide resolved

MichielStock reviewed Sep 10, 2019

View reviewed changes

dkarrasch reviewed Sep 17, 2019

View reviewed changes

willtebbutt closed this Nov 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] cholesky #44

[WIP] cholesky #44

willtebbutt commented Sep 6, 2019

MichielStock commented Sep 10, 2019

MichielStock Sep 10, 2019

MichielStock Sep 10, 2019

willtebbutt Sep 11, 2019

MichielStock Sep 12, 2019

willtebbutt commented Sep 11, 2019

dkarrasch Sep 17, 2019

dkarrasch Sep 17, 2019

dkarrasch Sep 17, 2019

willtebbutt commented Nov 4, 2019

[WIP] cholesky #44

[WIP] cholesky #44

Conversation

willtebbutt commented Sep 6, 2019

MichielStock commented Sep 10, 2019

MichielStock Sep 10, 2019

Choose a reason for hiding this comment

MichielStock Sep 10, 2019

Choose a reason for hiding this comment

willtebbutt Sep 11, 2019

Choose a reason for hiding this comment

MichielStock Sep 12, 2019

Choose a reason for hiding this comment

willtebbutt commented Sep 11, 2019

dkarrasch Sep 17, 2019

Choose a reason for hiding this comment

dkarrasch Sep 17, 2019

Choose a reason for hiding this comment

dkarrasch Sep 17, 2019

Choose a reason for hiding this comment

willtebbutt commented Nov 4, 2019