Decouple `interpolate` functions from trajectory #645

isVoid · 2022-08-15T20:52:19Z

Description

closes #644

This PR renames t argument of CubicSpline to indicate that it can be used as general CubicSpline interpolation function.
Other minor documentation refresh are also included.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

python/cuspatial/cuspatial/core/trajectory/generic.py

isVoid · 2022-08-16T02:47:47Z

As discussed in weekly, we maintain trajectory module and interpolate module. The interpolate module is refactored as generic interpolation functions. Specifically, argument t is renamed as x for general independent variable.

isVoid · 2022-08-16T02:55:28Z

During rewriting the docs, I was trying to run micro benchmarks to find out about the performance cut-off point and raise warnings for users about a "too-small input". But the result is a bit confusing as the cut off point seems to shift by the ratio of the number of curves and the complexity of the curve. See benchmark: https://gist.github.com/isVoid/108d388173e7199758e31c1ce7f6842a

Perhaps @thomcom can help explain?

thomcom · 2022-08-16T16:13:15Z

Hey @isVoid I'm not sure what you mean by the cut-off point?

isVoid · 2022-08-16T17:25:21Z

@thomcom currently, in CubicSpline code we intentionally to avoid API parity with scipy because we want to avoid user calling the API when the input data size is small:

cuspatial/python/cuspatial/cuspatial/core/interpolate.py

Lines 45 to 47 in 19a8d3f

    
               This allows API parity with scipy. This isn't recommended, as scipy 
        
               host based interpolation performance is likely to exceed GPU performance 
        
               for a single curve.

This isn't helpful when user still calls the API with a small data size without aware of the performance implication. In this refactor I attempt to argue that the proper way to design the API is to use the same argument variable as scipy and raise a UserWarning when input size is small. This is similar to how cudf groupby.apply handles many group cases. The key question here is: how to determine when the input size is too small? There must exist some data size when scipy is more performant than cuspatial, and that data size is the "cut-off" point I mentioned above.

thomcom · 2022-08-16T22:46:02Z

Ah, I see. I just ran a few quick benchmarks and I see that scipy is 15x faster than cuspatial on interpolations on my HP Z8 workstation. cuspatial only parallelizes each individual spline fit, so I estimate that if you've got 15 trajectories, cuspatial will be faster at interpolation. I was able to verify this with a set of 15 trajectories of length 1m. Creating the curve object still takes 15x as long (which is basically just creating 15 interpolations in parallel, each taking 15x as long as the hcurve) but interpolating 1m samples across 15 separate trajectories takes 8ms, interpolating 15m samples in scikit learn takes 230ms.

isVoid · 2022-08-17T01:29:44Z

a set of 15 trajectories of length 1m

Do you mean 15 curves, each curve has 1,000,000 vertices?

Creating the curve object still takes 15x as long (which is basically just creating 15 interpolations in parallel, each taking 15x as long as the hcurve)

Do you mean when cuspatial interpolates 15 curves in parallel, the total time of the kernel is 15x as long as scipy? What's the order of time cost here, s? ms?

interpolating 1m samples across 15 separate trajectories takes 8ms, interpolating 15m samples in scikit learn takes 230ms

Sounds like cuspatial has advantage when sampling a complex curve (many-vertices curve) and does not have advantage computing the curve parameters when the total number of curve is small? I think we can raise a performance warning when we see small number of curves (<100) is constructed, do you agree?

Out of scope, but can you fit the curve with scipy and pass the parameter to device and sample with device? If you only have 15 curves, you only have 15x4 parameters.

thomcom · 2022-08-17T13:12:50Z

a set of 15 trajectories of length 1m

Do you mean 15 curves, each curve has 1,000,000 vertices?

Yes

Do you mean when cuspatial interpolates 15 curves in parallel, the total time of the kernel is 15x as long as scipy? What's the order of time cost here, s? ms?

I think that when cuspatial interpolates 15 curves in parallel the cost is similar to using scipy to compute them in serial.

Sounds like cuspatial has advantage when sampling a complex curve (many-vertices curve) and does not have advantage computing the curve parameters when the total number of curve is small? I think we can raise a performance warning when we see small number of curves (<100) is constructed, do you agree?

I was thinking that < 15 curves was appropriate to compute the warning. I'll write a notebook to concrete the benchmark.

Out of scope, but can you fit the curve with scipy and pass the parameter to device and sample with device? If you only have 15 curves, you only have 15x4 parameters.

This would be theoretically possible but our curve fitting equations are not identical to scipys, iirc.

isVoid · 2022-08-19T00:47:29Z

rerun tests

python/cuspatial/cuspatial/core/interpolate.py

…into refactor/trajectory

python/cuspatial/cuspatial/core/interpolate.py

Co-authored-by: Mark Harris <[email protected]>

thomcom

lgtm

thomcom · 2022-08-25T18:52:06Z

@gpucibot merge

isVoid added 3 commits August 15, 2022 13:01

Move trajectory into sub-package

0700863

Organize test files in the same folder structure

6a9b0dd

Group interpolate under trajectory category.

eb35b2c

isVoid requested a review from a team as a code owner August 15, 2022 20:52

isVoid requested a review from thomcom August 15, 2022 20:52

github-actions bot added the Python Related to Python code label Aug 15, 2022

reorganize spline tests

a007d57

isVoid added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Aug 15, 2022

harrism changed the title ~~Refactors Trajectory Functions to Trajectory Module~~ Refactor Trajectory Functions to Trajectory Module Aug 15, 2022

harrism requested changes Aug 15, 2022

View reviewed changes

python/cuspatial/cuspatial/core/trajectory/generic.py Outdated Show resolved Hide resolved

isVoid added 3 commits August 15, 2022 19:39

Refactor CubicSpline to be independent of trajectory component.

b2017fb

Match test files structure

943c5a8

style

8125efe

isVoid changed the title ~~Refactor Trajectory Functions to Trajectory Module~~ Decouple interpolate function and trajectory Aug 16, 2022

isVoid changed the title ~~Decouple interpolate function and trajectory~~ Decouple interpolate functions from trajectory Aug 16, 2022

isVoid added breaking Breaking change and removed non-breaking Non-breaking change labels Aug 16, 2022

Add interpolation module to documentation

3eb5df7

isVoid requested a review from harrism August 19, 2022 17:44

isVoid mentioned this pull request Aug 19, 2022

Refactor spatial related functions under spatial package #656

Merged

harrism approved these changes Aug 22, 2022

View reviewed changes

python/cuspatial/cuspatial/core/interpolate.py Outdated Show resolved Hide resolved

isVoid added 3 commits August 22, 2022 20:42

Merge branch 'branch-22.10' of https://github.com/rapidsai/cuspatial …

2baed1a

…into refactor/trajectory

add full name for SoA

9db5e03

Add performance warning

17aff65

harrism requested changes Aug 23, 2022

View reviewed changes

python/cuspatial/cuspatial/core/interpolate.py Outdated Show resolved Hide resolved

Update doc and example

d4970de

harrism requested changes Aug 24, 2022

View reviewed changes

python/cuspatial/cuspatial/core/interpolate.py Outdated Show resolved Hide resolved

Update python/cuspatial/cuspatial/core/interpolate.py

e4aacb3

Co-authored-by: Mark Harris <[email protected]>

thomcom approved these changes Aug 24, 2022

View reviewed changes

isVoid requested a review from harrism August 25, 2022 00:30

harrism approved these changes Aug 25, 2022

View reviewed changes

rapids-bot bot merged commit db26c1a into rapidsai:branch-22.10 Aug 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decouple `interpolate` functions from trajectory #645

Decouple `interpolate` functions from trajectory #645

isVoid commented Aug 15, 2022 •

edited

Loading

isVoid commented Aug 16, 2022 •

edited

Loading

isVoid commented Aug 16, 2022

thomcom commented Aug 16, 2022

isVoid commented Aug 16, 2022

thomcom commented Aug 16, 2022 •

edited

Loading

isVoid commented Aug 17, 2022 •

edited

Loading

thomcom commented Aug 17, 2022

isVoid commented Aug 19, 2022

thomcom left a comment

thomcom commented Aug 25, 2022

Decouple interpolate functions from trajectory #645

Decouple interpolate functions from trajectory #645

Conversation

isVoid commented Aug 15, 2022 • edited Loading

Description

Checklist

isVoid commented Aug 16, 2022 • edited Loading

isVoid commented Aug 16, 2022

thomcom commented Aug 16, 2022

isVoid commented Aug 16, 2022

thomcom commented Aug 16, 2022 • edited Loading

isVoid commented Aug 17, 2022 • edited Loading

thomcom commented Aug 17, 2022

isVoid commented Aug 19, 2022

thomcom left a comment

Choose a reason for hiding this comment

thomcom commented Aug 25, 2022

Decouple `interpolate` functions from trajectory #645

Decouple `interpolate` functions from trajectory #645

isVoid commented Aug 15, 2022 •

edited

Loading

isVoid commented Aug 16, 2022 •

edited

Loading

thomcom commented Aug 16, 2022 •

edited

Loading

isVoid commented Aug 17, 2022 •

edited

Loading