Beta geo expected probability no purchase #1094

DylanZammit · 2024-10-19T09:13:27Z

Description

The linked issue below describes the motivation and derivation of the newly introduced method. Only 2 files have been updated:

pymc_marketing/clv/models/beta_geo.py: Introduction of BetaGeoModel.expected_probability_no_purchase, which implements the derivation in aforementioned document.
docs/source/notebooks/bg_nbd.ipynb: Includes an example of how to use it with some example customers.

I might require guidance on how to implement unit tests for this new method. Other unit tests for the BetaGeo model seem to compare the output results of similar methods with the ones given by the lifetimes library. However, there is no corresponding functionality for the new proposed method.

Related Issue

Closes #
Related to Include probability of 0 purchases in time range (T, T+t] #1093

Checklist

Checked that the pre-commit linting/style checks pass
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks)
If you are a pro: each commit corresponds to a relevant logical change

Modules affected

MMM
CLV

Type of change

📚 Documentation preview 📚: https://pymc-marketing--1094.org.readthedocs.build/en/1094/

review-notebook-app · 2024-10-19T09:13:33Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

pymc_marketing/clv/models/beta_geo.py

review-notebook-app · 2024-10-19T09:20:25Z

View / edit / reply to this conversation on ReviewNB

juanitorduz commented on 2024-10-19T09:20:24Z
----------------------------------------------------------------

Can you please remove this from the notebook?

DylanZammit commented on 2024-10-19T09:25:43Z
----------------------------------------------------------------

Removed

juanitorduz · 2024-10-19T09:25:34Z

I might require guidance on how to implement unit tests for this new method.

Absolutely! Some ideas for this new method are:

Test that the method does not fail by simply calling it on a fitted model as in

pymc-marketing/tests/clv/models/test_beta_geo.py

Line 260 in e46f690

def test_expected_num_purchases(self):
Predict this on a user (or set of users) where this probability should be very high (close to one) and very low (close to zero)

DylanZammit · 2024-10-19T09:25:44Z

Removed

View entire conversation on ReviewNB

DylanZammit · 2024-10-19T09:28:27Z

I might require guidance on how to implement unit tests for this new method.

Absolutely! Some ideas for this new method are:

Test that the method does not fail by simply calling it on a fitted model as in

pymc-marketing/tests/clv/models/test_beta_geo.py

Line 260 in e46f690

def test_expected_num_purchases(self):

Predict this on a user (or set of users) where this probability should be very high (close to one) and very low (close to zero)

Thank you! I will try to work on this between today and tomorrow.

I have removed the redundant cell-block in the notebook and made reference to Equation 34 which I based the method off in the docstring of the new method. Appreciate the help and feedback, please let me know if there are any other suggested changes and I will review accordingly.

DylanZammit · 2024-10-19T11:48:37Z

@juanitorduz @ColtAllen

Following unit tests added:

assertion on expected output layout
frequent customers with large t should have probability of no purchases close to 0
infrequent customers with small t should have probability of no purchases close to 1
If t = 0, then the probability of no purchases should be 1

All tests passed.

ColtAllen

Thanks for creating this PR @DylanZammit. I noticed the expression in the research note includes the model likelihood function, but it seems here you've simplified things and removed terms that would otherwise cancel out. If results are valid this will be a more performant solution, well done.

Aside from renaming the function and a few minor details in the notebook and docstrings, I don't have any major remarks.

pymc_marketing/clv/models/beta_geo.py

review-notebook-app · 2024-10-21T12:55:05Z

View / edit / reply to this conversation on ReviewNB

ColtAllen commented on 2024-10-21T12:55:05Z
----------------------------------------------------------------

Suppress this warning so that it does not appear in the documentation

ColtAllen commented on 2024-10-24T10:45:40Z
----------------------------------------------------------------

Re-running the cell should clear it.

review-notebook-app · 2024-10-21T12:55:06Z

View / edit / reply to this conversation on ReviewNB

ColtAllen commented on 2024-10-21T12:55:05Z
----------------------------------------------------------------

See PR review notes about renaming this method. If you're curious how this method is demonstrated with ParetoNBDModel:

https://www.pymc-marketing.io/en/stable/notebooks/clv/pareto_nbd.html#probability-of-n-purchases-over-time-t

juanitorduz · 2024-10-23T17:30:36Z

@DylanZammit I think this one is very close to the finish line. I think there are a couple of open comments from Colt, and then we are good to go. Please let us know if you need support :)

pymc_marketing/clv/models/beta_geo.py

ColtAllen · 2024-10-24T10:45:41Z

Re-running the cell should clear it.

View entire conversation on ReviewNB

ColtAllen · 2024-10-24T10:58:52Z

@DylanZammit I think this one is very close to the finish line. I think there are a couple of open comments from Colt, and then we are good to go. Please let us know if you need support :)

I saw two typos in the notebook ("ot keep" and "deposits" instead of "spending"), but ReviewNB isn't allowing me to post any more comments for some reason. I'm also curious about that DIV/0 warning with the MAP fit, and noticed an older version of pytensor in the watermark.

DylanZammit · 2024-10-24T20:53:53Z

Made all suggested changes:

typos
pytensor version (reinstalled the conda environment using the same command conda env create -f environment.yml)
remove docstring reference of special case when y=0
re-ran the cell that gave a warning (which removes the warning).

I have still not investigated the DIV/0 error, however, I suspect it is due to numerical issues. Please let me know if you have any further suggestions.

ColtAllen · 2024-10-25T20:59:30Z

I have still not investigated the DIV/0 error, however, I suspect it is due to numerical issues. Please let me know if you have any further suggestions.

I'm not able to re-create the error on my 2023 M2 Macbook with identical pymc and pytensor versions; maybe it's a C++ compiler difference between your OS and mine. Anywho, if you add this cell to the top of the notebook and run it, then delete it before pushing, it'll hide all the warnings:

import warnings 
  
# Set warnings to be ignored 
warnings.filterwarnings('ignore')

…_purchase' into beta_geo_expected_probability_no_purchase

codecov · 2024-10-27T21:10:49Z

Codecov Report

Attention: Patch coverage is 96.00000% with 1 line in your changes missing coverage. Please review.

Project coverage is 95.61%. Comparing base (e8a4e3b) to head (e3751a7).
Report is 121 commits behind head on main.

Files with missing lines	Patch %	Lines
pymc_marketing/clv/models/beta_geo.py	96.00%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1094   +/-   ##
=======================================
  Coverage   95.61%   95.61%           
=======================================
  Files          39       39           
  Lines        4039     4063   +24     
=======================================
+ Hits         3862     3885   +23     
- Misses        177      178    +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ColtAllen

Good work on the notebook and augmenting existing research with a more performant function derivation! If you ever want to link your PDF to the docstring references feel free to open another PR.

pymc_marketing/clv/models/beta_geo.py

DylanZammit added 3 commits October 18, 2024 15:51

beta_geo expected probability of no deposits in range

00d0a1d

max -> np.maximum, remove 1 minus

7e29dd9

add documentation example

a8acf5d

juanitorduz requested a review from ColtAllen October 19, 2024 09:17

juanitorduz added the CLV label Oct 19, 2024

juanitorduz reviewed Oct 19, 2024

View reviewed changes

pymc_marketing/clv/models/beta_geo.py Outdated Show resolved Hide resolved

DylanZammit added 2 commits October 19, 2024 11:21

make reference to Equation 34 of notes

dd915f8

removed redundant "cd" from notebook

a12fb2c

DylanZammit added 6 commits October 19, 2024 13:40

test: probability no purchase layout

66dfcde

test: probability no purchase now should be 1

f0c2b53

test: probability no purchase frequent customers should be 0

39317d7

test: probability no purchase infrequent customers should be 1

e6d0479

ylabel was mistakenly set to "probability alive"

25d184d

no need for test_t parameterization

21d7ac4

ColtAllen requested changes Oct 21, 2024

View reviewed changes

pymc_marketing/clv/models/beta_geo.py Show resolved Hide resolved

wd60622 added the enhancement New feature or request label Oct 21, 2024

add hardie notes reference in class docstring

e0ea322

juanitorduz requested a review from ColtAllen October 22, 2024 07:55

ColtAllen linked an issue Oct 22, 2024 that may be closed by this pull request

Include probability of 0 purchases in time range (T, T+t] #1093

Closed

juanitorduz added this to the 0.11.0 milestone Oct 22, 2024

ColtAllen reviewed Oct 24, 2024

View reviewed changes

pymc_marketing/clv/models/beta_geo.py Show resolved Hide resolved

DylanZammit added 2 commits October 24, 2024 22:03

remove cell warning

f44fe24

typos

fd67bf3

github-actions bot added docs Improvements or additions to documentation tests duplicate This issue or pull request already exists labels Oct 24, 2024

remove mention of special case when y=0 in docstring

a806e2e

juanitorduz requested a review from ColtAllen October 24, 2024 20:20

later pytensor version

550330a

Merge branch 'main' into beta_geo_expected_probability_no_purchase

e68ddcc

wd60622 removed the duplicate This issue or pull request already exists label Oct 26, 2024

DylanZammit added 2 commits October 27, 2024 21:02

remove div/0 warning

06427f8

Merge remote-tracking branch 'origin/beta_geo_expected_probability_no…

e3751a7

…_purchase' into beta_geo_expected_probability_no_purchase

github-actions bot added the duplicate This issue or pull request already exists label Oct 27, 2024

ColtAllen approved these changes Oct 27, 2024

View reviewed changes

pymc_marketing/clv/models/beta_geo.py Show resolved Hide resolved

pymc_marketing/clv/models/beta_geo.py Show resolved Hide resolved

pymc_marketing/clv/models/beta_geo.py Show resolved Hide resolved

ColtAllen merged commit 31def91 into pymc-labs:main Oct 27, 2024
14 checks passed

wd60622 removed the duplicate This issue or pull request already exists label Oct 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Beta geo expected probability no purchase #1094

Beta geo expected probability no purchase #1094

DylanZammit commented Oct 19, 2024 •

edited by wd60622

Loading

review-notebook-app bot commented Oct 19, 2024

review-notebook-app bot commented Oct 19, 2024 •

edited

Loading

juanitorduz commented Oct 19, 2024

DylanZammit commented Oct 19, 2024

DylanZammit commented Oct 19, 2024

DylanZammit commented Oct 19, 2024

ColtAllen left a comment

review-notebook-app bot commented Oct 21, 2024 •

edited

Loading

review-notebook-app bot commented Oct 21, 2024 •

edited

Loading

juanitorduz commented Oct 23, 2024

ColtAllen commented Oct 24, 2024

ColtAllen commented Oct 24, 2024

DylanZammit commented Oct 24, 2024

ColtAllen commented Oct 25, 2024

codecov bot commented Oct 27, 2024 •

edited

Loading

ColtAllen left a comment

Beta geo expected probability no purchase #1094

Beta geo expected probability no purchase #1094

Conversation

DylanZammit commented Oct 19, 2024 • edited by wd60622 Loading

Description

Related Issue

Checklist

Modules affected

Type of change

review-notebook-app bot commented Oct 19, 2024

review-notebook-app bot commented Oct 19, 2024 • edited Loading

juanitorduz commented Oct 19, 2024

DylanZammit commented Oct 19, 2024

DylanZammit commented Oct 19, 2024

DylanZammit commented Oct 19, 2024

ColtAllen left a comment

Choose a reason for hiding this comment

review-notebook-app bot commented Oct 21, 2024 • edited Loading

review-notebook-app bot commented Oct 21, 2024 • edited Loading

juanitorduz commented Oct 23, 2024

ColtAllen commented Oct 24, 2024

ColtAllen commented Oct 24, 2024

DylanZammit commented Oct 24, 2024

ColtAllen commented Oct 25, 2024

codecov bot commented Oct 27, 2024 • edited Loading

Codecov Report

ColtAllen left a comment

Choose a reason for hiding this comment

DylanZammit commented Oct 19, 2024 •

edited by wd60622

Loading

review-notebook-app bot commented Oct 19, 2024 •

edited

Loading

review-notebook-app bot commented Oct 21, 2024 •

edited

Loading

review-notebook-app bot commented Oct 21, 2024 •

edited

Loading

codecov bot commented Oct 27, 2024 •

edited

Loading