implement `precompute = POLYNOMIAL` for `blocking = false` #94

JakobAsslaender · 2022-03-23T21:31:08Z

Thanks, @tknopp for the comments in #93! Indeed, for calling calculateToeplitzKernel! repeatedly, the settings precompute = LINEAR, blocking = false are the fastest. Hence, it would be awesome to have an implementation for precompute = POLYNOMIAL, blocking = false, if you say this faster!

Off topic: Many thanks for your prompt merging / fixing the other issues! Would you mind releasing the patch to make it easier to incorporate the bugfixes?

The text was updated successfully, but these errors were encountered:

tknopp · 2022-03-24T08:39:37Z

The release ist triggered.

Regarding the other implementation: I will give this a go once I find some time. Want to have this for completeness anyway. If it will help in your case is not 100% clear. Depends on the fraction, the precomputation right now takes.

tknopp · 2022-03-26T07:28:13Z

I gave this a go. If you plan to benchmark it would be great to do this in a systematic way. I would be in particular interested in the impact on precomputation and runtime. I expect almost zero precomputation cost. The runtime should not change for small m and be better for large m. In your application I doubt that you need large m, however.

JakobAsslaender · 2022-03-26T17:00:49Z

Many thanks! Not sure along which axes of the large parameter space you want me to run systematic benchmarks. In my real world example, with the default settings m=4 and simga=2, LINEAR and POLYNOMINAL seem to be roughly the same speed. In this case, I plan the NFFT once, copy it 40x (one for each thread), and then have a parallel for loop in which I call nodes! followed by one mul!(x, adjoint(p), d).

tknopp · 2022-03-27T09:11:17Z

What would be interesting is:

planning time
copying time
runtime (multi- and single-threaded)
My feeling is that in your case the last part takes more than 90%.

tknopp · 2022-06-19T09:14:35Z

This feature is implemented

tknopp added a commit that referenced this issue Mar 26, 2022

implement #94

7f3ffa9

tknopp closed this as completed Jun 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement `precompute = POLYNOMIAL` for `blocking = false` #94

implement `precompute = POLYNOMIAL` for `blocking = false` #94

JakobAsslaender commented Mar 23, 2022

tknopp commented Mar 24, 2022

tknopp commented Mar 26, 2022

JakobAsslaender commented Mar 26, 2022

tknopp commented Mar 27, 2022

tknopp commented Jun 19, 2022

implement precompute = POLYNOMIAL for blocking = false #94

implement precompute = POLYNOMIAL for blocking = false #94

Comments

JakobAsslaender commented Mar 23, 2022

tknopp commented Mar 24, 2022

tknopp commented Mar 26, 2022

JakobAsslaender commented Mar 26, 2022

tknopp commented Mar 27, 2022

tknopp commented Jun 19, 2022

implement `precompute = POLYNOMIAL` for `blocking = false` #94

implement `precompute = POLYNOMIAL` for `blocking = false` #94