feat(experiments): Init new experiment query runner #28347

andehen · 2025-02-05T17:59:08Z

Problem

We are building a new query runner for experiments, see https://github.com/PostHog/product-internal/pull/712

Changes

Initial setup of a new query runner. Currently behind a feature flag. Will only be tested internally for now.

Contains:

new metric types to represent experiment metrics
new metric form components
logic to handle both metric types safely

How did you test this code?

lots of tests added + query snapshots
lots of manual testing to verify it does not break anything for existing users
storybook story added

posthog/hogql_queries/experiments/experiment_query_runner.py

posthog/hogql_queries/experiments/test/test_experiment_query_runner.py

jurajmajerik

Looking great. Especially love that we're now unifying the data warehouse integration with the standard use case. This makes so much sense in hindsight.

I know this is very much a WIP, so feel free to disregard some of my comments :)

jurajmajerik · 2025-02-06T11:05:13Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+            case _:
+                # Else, we default to count
+                # We then just emit 1 so we can easily sum it up
+                metric_value = "1"


Curious how the funnel/binomial comes into play here. Would we also simply count "1" for every recorded result returned from the funnel's last step?

jurajmajerik · 2025-02-06T11:06:32Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+                    *test_accounts_filter,
+                ]
+            ),
+            group_by=[ast.Field(chain=["variant"]), ast.Field(chain=["distinct_id"])],


This isn't a new problem, but I’m just noting that a single user can have two different variant values. This exposure query groups by [variant, distinct id], so if a user receives two different flag values, they'll appear in both groups. This is mainly an SDK/feature flag implementation issue, but the question is if we want to handle it on our end (e.g. counting each user only once).

Yeah, I think what we want to do here is to exclude users with multiple variant exposure and include the count in the "health checks" we display to the user.

Exclude them entirely? Not simply use the most recent exposure?

Yes, that's my position, as:

Users with multiple variant exposure screws up results

it's an error we should warn the user about

it complicates our queries if we have to handle this

I think we should start with that at least, as that's the most sane default behavior. More advanced handling could be implemented in the future if requested.

jurajmajerik · 2025-02-06T11:09:31Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+                        expr=ast.Field(chain=[series_node.table_name, series_node.distinct_id_field]),
+                    ),
+                    ast.Field(chain=["exposure", "variant"]),
+                    parse_expr(f"{metric_value} as value"),


This is a minor nit/more of a curiosity thing (I haven't written any AST myself), but is there a convention for when to use parse_expr versus assembling the query programmatically? i.e:

This:
parse_expr(f"{metric_value} as value"),

vs this:

ast.Alias( alias="value", expr=parse_expr(metric_value) )

The latter seems more consistent and less error-prone.

jurajmajerik · 2025-02-06T11:10:33Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+                    ast.Field(chain=["events", "timestamp"]),
+                    ast.Field(chain=["events", "distinct_id"]),
+                    ast.Field(chain=["exposure", "variant"]),
+                    ast.Field(chain=["events", "event"]),
+                    parse_expr(f"{metric_value} as value"),


There are a lot of manually written strings here and we'll be writing many more similar queries. It would be good to abstract the common ones into string constants to reduce the risk of typos.

jurajmajerik · 2025-02-06T11:10:57Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+                next_join=ast.JoinExpr(
+                    table=events_after_exposure_query,
+                    join_type="LEFT JOIN",
+                    alias="eae",


I'd prefer a bit more descriptive alias here :)

jurajmajerik · 2025-02-06T11:11:39Z

posthog/schema.py

@@ -7196,6 +7196,21 @@ class ExperimentTrendsQuery(BaseModel):
    response: Optional[ExperimentTrendsQueryResponse] = None


+class ExperimentQuery(BaseModel):


Is this generated from the frontend's schema-general.ts (as it should be)? Let's commit that file too?

Is this generated from the frontend's schema-general.ts (as it should be)? Let's commit that file too?

Nope, added it solely as a talking point for now.

andehen · 2025-02-06T11:17:07Z

Looking great. Especially love that we're now unifying the data warehouse integration with the standard use case. This makes so much sense in hindsight.

Good! 👍

I know this is very much a WIP, so feel free to disregard some of my comments :)

Yeah, I have consciously taken many shortcuts to identify and resolve the complex issues as quickly as possible. So I'll wait with any polishing until the core stuff feels right. It's beneficial to keep the code changes minimal until interfaces stabilizes, so iterating and refactoring is easier.

Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>

posthog-bot · 2025-02-12T11:42:32Z

📸 UI snapshots have been updated

2 snapshot changes in total. 0 added, 2 modified, 0 deleted:

chromium: 0 added, 2 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

frontend/src/scenes/experiments/ExperimentView/ExperimentView.tsx

posthog-bot · 2025-02-14T00:07:55Z

📸 UI snapshots have been updated

2 snapshot changes in total. 0 added, 2 modified, 0 deleted:

chromium: 0 added, 2 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

greptile-apps

PR Summary

Based on the provided files and context, I'll provide a concise summary of the key changes in this PR:

Initial setup of a new experiment query runner with unified data warehouse integration. Key changes include:

Added new ExperimentMetric type to replace ExperimentQuery, simplifying the metrics structure
Introduced ExperimentActionMetricConfig for handling action-based metrics alongside event metrics
Created new ExperimentQueryRunner class in Python for handling experiment analysis through HogQL queries
Added feature flag EXPERIMENTS_NEW_QUERY_RUNNER to control rollout
Updated frontend components to support both legacy and new metric types with proper type safety

The changes appear well-structured and maintain backward compatibility while introducing the new query runner infrastructure. The implementation is currently WIP and not yet being called from anywhere.

Potential concerns:

Missing error handling for database queries in ExperimentQueryRunner
Type safety improvements needed in filterToMetricConfig utility
Validation needed for statistical assumptions in query runner

_{34 file(s) reviewed, 33 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

ee/clickhouse/views/experiment_saved_metrics.py

frontend/src/queries/schema/schema-general.ts

frontend/src/scenes/experiments/ExperimentView/ExperimentView.tsx

greptile-apps · 2025-02-14T09:00:23Z

frontend/src/scenes/experiments/ExperimentView/components.tsx

+                    {result &&
+                        (result.kind === 'ExperimentTrendsQuery' || result.kind === 'ExperimentFunnelsQuery') && (
+                            <ExploreButton result={result} />


logic: string literals used for type comparison instead of NodeKind enum that's already imported and used elsewhere in the file

Suggested change

{result &&

(result.kind === 'ExperimentTrendsQuery' || result.kind === 'ExperimentFunnelsQuery') && (

<ExploreButton result={result} />

{result &&

(result.kind === NodeKind.ExperimentTrendsQuery || result.kind === NodeKind.ExperimentFunnelsQuery) && (

<ExploreButton result={result} />

greptile-apps · 2025-02-14T09:01:16Z

frontend/src/scenes/experiments/Metrics/ExperimentMetricForm.tsx

+    if (!metricIdx && metricIdx !== 0) {
+        return <></>
+    }


style: returning an empty fragment when metricIdx is undefined could lead to confusing UI state - consider showing a message or error state instead

greptile-apps · 2025-02-14T09:06:44Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+                # If the metric type is continuous, we need to extract the value from the event property
+                metric_property = self.metric.metric_config.math_property
+                if is_data_warehouse_query:
+                    metric_value = f"toFloat('{metric_property}')"


logic: potential SQL injection vulnerability - metric_property value should be escaped/sanitized before interpolation

greptile-apps · 2025-02-14T09:06:44Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+        self.experiment = Experiment.objects.get(id=self.query.experiment_id)
+        self.feature_flag = self.experiment.feature_flag


logic: missing error handling for Experiment.objects.get() which could raise DoesNotExist

Suggested change

self.experiment = Experiment.objects.get(id=self.query.experiment_id)

self.feature_flag = self.experiment.feature_flag

try:

self.experiment = Experiment.objects.get(id=self.query.experiment_id)

self.feature_flag = self.experiment.feature_flag

except Experiment.DoesNotExist:

raise ValidationError(f"Experiment with id {self.query.experiment_id} does not exist")

greptile-apps · 2025-02-14T09:06:45Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+            date_range=self.date_range,
+            team=self.team,
+            interval=IntervalType.DAY,
+            now=datetime.now(),


style: using datetime.now() without timezone info could cause inconsistencies - should use datetime.now(UTC)

Suggested change

now=datetime.now(),

now=datetime.now(UTC),

greptile-apps · 2025-02-14T09:07:32Z

posthog/hogql_queries/experiments/test/test_experiment_query_runner.py

+        elif end_date is not None:
+            end_date = timezone.make_aware(end_date)  # Make naive datetime timezone-aware


logic: Redundant condition - elif end_date is not None will always be true since it's the else branch of if end_date is None

Suggested change

elif end_date is not None:

end_date = timezone.make_aware(end_date) # Make naive datetime timezone-aware

else:

end_date = timezone.make_aware(end_date) # Make naive datetime timezone-aware

greptile-apps · 2025-02-14T09:07:42Z

posthog/hogql_queries/query_runner.py

+    if kind == "ExperimentQuery":
+        from .experiments.experiment_query_runner import ExperimentQueryRunner
+
+        return ExperimentQueryRunner(
+            query=query,
+            team=team,
+            timings=timings,
+            modifiers=modifiers,
+            limit_context=limit_context,
+        )


style: The ExperimentQuery case follows the same pattern as other runners but doesn't use cast() like most other cases do. Consider adding cast for consistency.

posthog-bot · 2025-02-14T10:37:19Z

📸 UI snapshots have been updated

4 snapshot changes in total. 0 added, 4 modified, 0 deleted:

chromium: 0 added, 4 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

This reverts commit 38c3bc4.

posthog-bot · 2025-02-14T12:19:59Z

📸 UI snapshots have been updated

4 snapshot changes in total. 0 added, 4 modified, 0 deleted:

chromium: 0 added, 4 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

danielbachhuber

Looks good 👍 I did a final round of testing...

With the feature flag disabled:

An existing experiments displays results as expected.
I could add and remove metrics from the experiment as expected.
I could create and delete shared metrics as expected.

With the feature flag enabled:

I could edit an existing shared metric with a query.
Creating a new shared metric used the experiment metric form. The event and property both saved as expected.
Loading an existing experiment displayed results as expected.
I could add new query metrics and remove existing metrics from an existing experiment.
Creating a new experiment used the new experiment metric UX as expected.

danielbachhuber · 2025-02-14T13:09:08Z

frontend/src/scenes/experiments/experimentLogic.tsx

@@ -1331,13 +1363,22 @@ export const experimentLogic = kea<experimentLogicType>([
        getMetricType: [


Might want to rename this to getExperimentQueryType in a follow-up.

yep, noted it down

danielbachhuber · 2025-02-14T13:15:59Z

posthog/hogql_queries/experiments/experiment_query_runner.py

+            raise ValueError("Control variant not found in experiment results")
+
+        # Statistical analysis
+        if self.stats_version == 2:


Can't we only support stats_version == 2 here?

Yes, shouldn't be possible with stats_version == 1 here. Noted it down to clean this up.

jurajmajerik

Let's go!

posthog-bot · 2025-02-14T15:18:22Z

📸 UI snapshots have been updated

2 snapshot changes in total. 0 added, 2 modified, 0 deleted:

chromium: 0 added, 2 modified, 0 deleted (diff for shard 1)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

danielbachhuber mentioned this pull request Feb 5, 2025

feat(experiments): ExperimentQueryRunner iteration 1 #28353

Merged

danielbachhuber reviewed Feb 5, 2025

View reviewed changes

posthog/hogql_queries/experiments/experiment_query_runner.py Show resolved Hide resolved

posthog/hogql_queries/experiments/test/test_experiment_query_runner.py Outdated Show resolved Hide resolved

andehen mentioned this pull request Feb 6, 2025

POC new experiment query #28016

Closed

jurajmajerik reviewed Feb 6, 2025

View reviewed changes

andehen force-pushed the init-new-experiment-query-runner branch from 2587bf4 to f89e988 Compare February 6, 2025 13:07

danielbachhuber mentioned this pull request Feb 11, 2025

feat(experiments): First pass at ExperimentQuery in frontend #28545

Merged

andehen and others added 7 commits February 11, 2025 12:14

Init ExperimentQueryRunner

e76abf5

Init test of new experiment query runner

2c053a0

remove duplicate attributes

7edfc5e

Fix test

515023f

feat(experiments): ExperimentQueryRunner iteration 1 (#28353)

9aa0445

Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>

feat(experiments): new experiment query and metric interfaces (#28367)

0d7f1cc

Fix merge conflict errors

a2e2e27

danielbachhuber force-pushed the init-new-experiment-query-runner branch from f89e988 to a2e2e27 Compare February 11, 2025 20:20

github-actions bot and others added 2 commits February 11, 2025 20:33

Update query snapshots

3744b59

Add a basic funnel test

1cadac1

danielbachhuber mentioned this pull request Feb 11, 2025

feat(experiments): Add a basic funnels test #28520

Closed

andehen and others added 3 commits February 12, 2025 12:20

feat(experiments): WIP new experiment metric form (#28550)

0ddd2cb

Merge branch 'master' into init-new-experiment-query-runner

f93f6d5

Update UI snapshots for chromium (2)

88b23a6

danielbachhuber reviewed Feb 12, 2025

View reviewed changes

frontend/src/scenes/experiments/ExperimentView/ExperimentView.tsx Outdated Show resolved Hide resolved

danielbachhuber mentioned this pull request Feb 12, 2025

feat(experiments): Experiment date and event property filters #28600

Merged

danielbachhuber and others added 3 commits February 13, 2025 15:42

fix: Fix self-capture initialization when multiple users (#28700)

605efcb

Merge branch 'master' into init-new-experiment-query-runner

1e0ef1b

Update UI snapshots for chromium (2)

cf31c7e

andehen marked this pull request as ready for review February 14, 2025 08:56

greptile-apps bot reviewed Feb 14, 2025

View reviewed changes

Extend correct query response model

365a01b

PostHog deleted a comment from greptile-apps bot Feb 14, 2025

github-actions bot and others added 5 commits February 14, 2025 09:45

Update query snapshots

08146a2

Compare with type instead of string

39f3e0d

update metric form name and data-attr

2ba2f9b

compare with actual type instead of string

38c3bc4

Update UI snapshots for chromium (2)

9da5b58

andehen and others added 3 commits February 14, 2025 12:40

use types instead of strings for node kinds

8846fb5

Revert "compare with actual type instead of string"

82d91e9

This reverts commit 38c3bc4.

Update UI snapshots for chromium (2)

a480c56

andehen added 4 commits February 14, 2025 13:30

use hogql placeholders

6cea91f

use hogql compare operation instead of parse_expr

0ca8a81

use hogql compare operation for feature flag key check

8518db6

improve query comments and aliases

5d40435

danielbachhuber approved these changes Feb 14, 2025

View reviewed changes

jurajmajerik approved these changes Feb 14, 2025

View reviewed changes

andehen and others added 3 commits February 14, 2025 15:59

Merge branch 'master' into init-new-experiment-query-runner

e8def4b

Update query snapshots

82ad651

Update UI snapshots for chromium (1)

38cfae5

andehen and others added 2 commits February 14, 2025 16:58

Merge branch 'master' into init-new-experiment-query-runner

65fe467

Update query snapshots

2b01c43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(experiments): Init new experiment query runner #28347

feat(experiments): Init new experiment query runner #28347

andehen commented Feb 5, 2025 •

edited

Loading

jurajmajerik left a comment

jurajmajerik Feb 6, 2025

jurajmajerik Feb 6, 2025

andehen Feb 6, 2025

danielbachhuber Feb 6, 2025

andehen Feb 7, 2025

jurajmajerik Feb 6, 2025

jurajmajerik Feb 6, 2025

jurajmajerik Feb 6, 2025

jurajmajerik Feb 6, 2025

danielbachhuber Feb 6, 2025

andehen commented Feb 6, 2025

posthog-bot commented Feb 12, 2025

posthog-bot commented Feb 14, 2025

greptile-apps bot left a comment

greptile-apps bot Feb 14, 2025

greptile-apps bot Feb 14, 2025

greptile-apps bot Feb 14, 2025

greptile-apps bot Feb 14, 2025

greptile-apps bot Feb 14, 2025

greptile-apps bot Feb 14, 2025

greptile-apps bot Feb 14, 2025

posthog-bot commented Feb 14, 2025

posthog-bot commented Feb 14, 2025

danielbachhuber left a comment

danielbachhuber Feb 14, 2025

andehen Feb 14, 2025

danielbachhuber Feb 14, 2025

andehen Feb 14, 2025

jurajmajerik left a comment

posthog-bot commented Feb 14, 2025

		@@ -7196,6 +7196,21 @@ class ExperimentTrendsQuery(BaseModel):
		response: Optional[ExperimentTrendsQueryResponse] = None


		class ExperimentQuery(BaseModel):

		self.experiment = Experiment.objects.get(id=self.query.experiment_id)
		self.feature_flag = self.experiment.feature_flag

		elif end_date is not None:
		end_date = timezone.make_aware(end_date) # Make naive datetime timezone-aware

		@@ -1331,13 +1363,22 @@ export const experimentLogic = kea<experimentLogicType>([
		getMetricType: [

feat(experiments): Init new experiment query runner #28347

Are you sure you want to change the base?

feat(experiments): Init new experiment query runner #28347

Conversation

andehen commented Feb 5, 2025 • edited Loading

Problem

Changes

How did you test this code?

jurajmajerik left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andehen commented Feb 6, 2025

posthog-bot commented Feb 12, 2025

📸 UI snapshots have been updated

posthog-bot commented Feb 14, 2025

📸 UI snapshots have been updated

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

greptile-apps bot Feb 14, 2025

Choose a reason for hiding this comment

greptile-apps bot Feb 14, 2025

Choose a reason for hiding this comment

greptile-apps bot Feb 14, 2025

Choose a reason for hiding this comment

greptile-apps bot Feb 14, 2025

Choose a reason for hiding this comment

greptile-apps bot Feb 14, 2025

Choose a reason for hiding this comment

greptile-apps bot Feb 14, 2025

Choose a reason for hiding this comment

greptile-apps bot Feb 14, 2025

Choose a reason for hiding this comment

posthog-bot commented Feb 14, 2025

📸 UI snapshots have been updated

posthog-bot commented Feb 14, 2025

📸 UI snapshots have been updated

danielbachhuber left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jurajmajerik left a comment

Choose a reason for hiding this comment

posthog-bot commented Feb 14, 2025

📸 UI snapshots have been updated

andehen commented Feb 5, 2025 •

edited

Loading