[Concurrent Segment Search] Add support for query profiler with concurrent aggregation #8330

ticheng-aws · 2023-06-29T00:08:28Z

Placeholder issue to support query profiler with concurrent aggregation flow (both for global and non-global aggregation). This will be a subtask for #7354

ticheng-aws · 2023-07-12T18:55:44Z

Background

The search profile query section includes detailed timing information about the execution of the query tree by Lucene on a particular shard. The breakdown component further provides detailed statistics on the low-level execution performed by Lucene. Existing types of breakdown for query profile timing/count include:

create_weight: This property measures the elapsed time, in nanoseconds, for generating a Weight object, which serves as a temporary context object to store state associated with the (IndexSearcher, Query) tuple.
build_scorer: This property measures the elapsed time, in nanoseconds, for constructing a Scorer object for the query. It records the time taken to generate the Scorer, rather than the actual scoring of documents.
next_doc: This property measures the elapsed time, in nanoseconds, for determining the next matching document, which can vary significantly depending on the complexity of the query.
advance: This property measures the elapsed time, in nanoseconds, for finding the next matching doc in the lower level of the next_doc property.
match: This property measures the elapsed time, in nanoseconds, for matching documents in two-phase process.
score: This property measures the elapsed time, in nanoseconds, for scoring a specific document using its Scorer.
shallow_advance: This property measures the elapsed time, in nanoseconds, for executing the advanceShallow Lucene method.
compute_max_score: This property measures the elapsed time, in nanoseconds, for executing the getMaxScore Lucene method.
set_min_competitive_score: This property measures the elapsed time, in nanoseconds, for executing the setMinCompetitiveScore Lucene method.

Currently, with the model of concurrent search, the query portion of the profile response in the ConcurrentQueryProfileBreakdown class, introduced in #1673, displays the cumulative summary timing of the query tree.

@Override
public Map<String, Long> toBreakdownMap() {
    final Map<String, Long> map = new HashMap<>(buildBreakdownMap(this));

    for (final AbstractProfileBreakdown<QueryTimingType> context : contexts.values()) {
        for (final Map.Entry<String, Long> entry : buildBreakdownMap(context).entrySet()) {
            map.merge(entry.getKey(), entry.getValue(), Long::sum);
        }
    }

    return map;
}

The contexts map has all the timing information across all segments. The key of the contexts map is LeafReaderContext, which is created per segment. For example, in the 2 slices case, if we have 4 segments, then we still get 4 context key-value pairs instead of 2 pairs. As a result, the total timing obtained by summing all segments does not represent the slice-level detailed timing of the query during concurrent execution.

Proposed Solution

In the concurrent search case:

Introduce profiled time stats (i.e. min/max/avg) for all existing slice-level breakdown properties and time_in_nanos.
create_weight field do not have profiled time stats because they are executed at the query-level rather than the slice-level.
The existing slice-level breakdown time properties will reflect the total elapsed time across all segment slices (i.e. max(slice_end_time) - min(slice_start_time))
Introduce profiled count stats (i.e. min/max/avg) for all existing slice-level breakdown properties.
Maintain the same profile as today in the non-concurrent search case. For example, since there is no specific timer for time_in_nanos field, its timing is instead reflected as the cumulative summary of all timing types.

Sample query response

Non-concurrent search

{
    ...
    "profile":
    {
        "shards":
        [
            {
                "id": "[aggqMTQ4QbOShmjsbcXtrQ][index][0]",
                "inbound_network_time_in_millis": 0,
                "outbound_network_time_in_millis": 0,
                "searches":
                [
                    {
                        "query":
                        [
                            {
                                "type": "MatchAllDocsQuery",
                                "description": "*:*",
                                "time_in_nanos": 382251,
                                "breakdown":
                                {
                                    "set_min_competitive_score_count": 0,
                                    "match_count": 0,
                                    "shallow_advance_count": 0,
                                    "set_min_competitive_score": 0,
                                    "next_doc": 6708,
                                    "match": 0,
                                    "next_doc_count": 11,
                                    "score_count": 7,
                                    "compute_max_score_count": 0,
                                    "compute_max_score": 0,
                                    "advance": 1875,
                                    "advance_count": 3,
                                    "score": 2959,
                                    "build_scorer_count": 6,
                                    "create_weight": 102875,
                                    "shallow_advance": 0,
                                    "create_weight_count": 1,
                                    "build_scorer": 267834
                                }
                            }
                        ],
                        "rewrite_time": 8501,
                        "collector": [...]
                    }
                ],
                "aggregations": [...]
            }
        ]
    }
}

Concurrent search with 3 segment slices

{
    ...
    "profile":
    {
        "shards":
        [
            {
                "id": "[09K0cflDSwO6kVx95Gj-eA][index][0]",
                "inbound_network_time_in_millis": 0,
                "outbound_network_time_in_millis": 0,
                "searches":
                [
                    {
                        "query":
                        [
                            {
                                "type": "MatchAllDocsQuery",
                                "description": "*:*",
                                "time_in_nanos": 271500,
                                "max_slice_time_in_nanos": 271500,
                                "min_slice_time_in_nanos": 232340,
                                "avg_slice_time_in_nanos": 258877,
                                "breakdown":
                                {
                                    "create_weight": 171556,
                                    "create_weight_count": 1,
                                    "build_scorer": 1691375,
                                    "max_build_scorer": 576136,
                                    "min_build_scorer": 551446,
                                    "avg_build_scorer": 563791,
                                    "build_scorer_count": 6,
                                    "max_build_scorer_count": 2,
                                    "min_build_scorer_count": 2,
                                    "avg_build_scorer_count": 2,
                                    "next_doc": 3834,
                                    "max_next_doc": 1334,
                                    "min_next_doc": 1222,
                                    "avg_next_doc": 1278,
                                    "next_doc_count": 8,
                                    "max_next_doc_count": 3,
                                    "min_next_doc_count": 2,
                                    "avg_next_doc_count": 3,
                                    ...
                                }
                            }
                        ],
                        "rewrite_time": 10083,
                        "collector": [...]
                    }
                ],
                "aggregations": [...]
            }
        ]
    }
}

Alternatives

1. Expand query profile stats across all slices with concurrent execution

Instead of exposing min/max/avg stats across segment slices execution of a collector manager, we list out all the collectors response from the collector manager.

Sample query response for a concurrent search with 3 segment slices

{
    ...
    "profile":
    {
        "shards":
        [
            {
                "id": "[09K0cflDSwO6kVx95Gj-eA][index][0]",
                "inbound_network_time_in_millis": 0,
                "outbound_network_time_in_millis": 0,
                "searches":
                [
                    {
                        "query":
                        [
                            {
                                "type": "MatchAllDocsQuery",
                                "description": "*:*",
                                "time_in_nanos": 90500,
                                "breakdown":
                                {
                                    "set_min_competitive_score_count": 0,
                                    "match_count": 0,
                                    "shallow_advance_count": 0,
                                    "set_min_competitive_score": 0,
                                    "next_doc": 3834,
                                    "match": 0,
                                    "next_doc_count": 8,
                                    "score_count": 8,
                                    "compute_max_score_count": 0,
                                    "compute_max_score": 0,
                                    "advance": 2958,
                                    "advance_count": 3,
                                    "score": 5459,
                                    "build_scorer_count": 6,
                                    "create_weight": 271500,
                                    "shallow_advance": 0,
                                    "create_weight_count": 1,
                                    "build_scorer": 1691375
                                }
                            },
                            {
                                "type": "MatchAllDocsQuery",
                                "description": "*:*",
                                "time_in_nanos": 90500,
                                "breakdown":
                                {
                                    "set_min_competitive_score_count": 0,
                                    "match_count": 0,
                                    "shallow_advance_count": 0,
                                    "set_min_competitive_score": 0,
                                    "next_doc": 2334,
                                    "match": 0,
                                    "next_doc_count": 8,
                                    "score_count": 5,
                                    "compute_max_score_count": 0,
                                    "compute_max_score": 0,
                                    "advance": 1587,
                                    "advance_count": 3,
                                    "score": 5125,
                                    "build_scorer_count": 2,
                                    "create_weight": 271500,
                                    "shallow_advance": 0,
                                    "create_weight_count": 1,
                                    "build_scorer": 1691234
                                }
                            },
                            {...}
                        ],
                        "rewrite_time": 10083,
                        "collector": [...]
                    }
                ],
                "aggregations": [...]
            }
        ]
    }
}

2. Keep the current cumulative summary version of the timing breakdown

Currently, no code changes are required for this approach. However, it is important to note that we will lose the slice-level profile details.

Sample query response for a concurrent search with 3 segment slices

{
    ...
    "profile":
    {
        "shards":
        [
            {
                "id": "[OHv1ByGtTHqVf55eR4Bvsg][index][0]",
                "inbound_network_time_in_millis": 0,
                "outbound_network_time_in_millis": 0,
                "searches":
                [
                    {
                        "query":
                        [
                            {
                                "type": "MatchAllDocsQuery",
                                "description": "*:*",
                                "time_in_nanos": 129584,
                                "breakdown":
                                {
                                    "set_min_competitive_score_count": 0,
                                    "match_count": 0,
                                    "shallow_advance_count": 0,
                                    "set_min_competitive_score": 0,
                                    "next_doc": 15418,
                                    "match": 0,
                                    "next_doc_count": 26,
                                    "score_count": 16,
                                    "compute_max_score_count": 0,
                                    "compute_max_score": 0,
                                    "advance": 3042,
                                    "advance_count": 3,
                                    "score": 7876,
                                    "build_scorer_count": 6,
                                    "create_weight": 129584,
                                    "shallow_advance": 0,
                                    "create_weight_count": 1,
                                    "build_scorer": 765164
                                }
                            }
                        ],
                        "rewrite_time": 11333,
                        "collector": [...]
                    }
                ],
                "aggregations": [...]
            }
        ]
    }
}

ticheng-aws · 2023-07-12T19:18:29Z

Tagging @sohami @yigithub @jed326 @andrross @reta for feedback. Thank you.

reta · 2023-07-14T18:42:39Z

@ticheng-aws thanks for the proposal

As a result, the total timing obtained by summing all segments does not represent the slice-level detailed timing of the query during concurrent execution.

This is correct, the timing is cumulative. When I originally worked on the feature, the ideal picture for the profiling I had in mind was "perceived time", not avg / min / max. What I mean by that: we know that some slices will be processed concurrently, some will be queued (and hopefully processed concurrently later on). By "perceived time" I envisioned the time between first chunk of slices being picked up and the last chunk of slices being completed. That the time that consumers will see as the search execution. The proposal by itself adds useful information, I think what is missed is the fact there is concurrency involved (so the numbers don't end up, they rather interleave, in general case).

ticheng-aws added enhancement Enhancement or improvement to existing feature or request untriaged labels Jun 29, 2023

ticheng-aws changed the title ~~[Concurrent Segment Search] Add support for profile option with concurrent aggregation~~ [Concurrent Segment Search] Add support for query profiler with concurrent aggregation Jun 29, 2023

anasalkouz added this to Concurrent Search Jun 29, 2023

github-project-automation bot moved this to Todo in Concurrent Search Jun 29, 2023

anasalkouz added distributed framework and removed untriaged labels Jun 29, 2023

ticheng-aws mentioned this issue Jun 30, 2023

[Concurrent Segment Search] Add support for profile option with concurrent aggregation #7354

Closed

anasalkouz assigned ticheng-aws Jul 6, 2023

yigithub moved this from Todo to In Progress in Concurrent Search Jul 13, 2023

ticheng-aws mentioned this issue Aug 11, 2023

Add support for query profiler with concurrent aggregation #9248

Merged

6 tasks

reta added the v2.10.0 label Sep 1, 2023

sohami self-assigned this Sep 1, 2023

reta closed this as completed in #9248 Sep 5, 2023

github-project-automation bot moved this from In Progress to Done in Concurrent Search Sep 5, 2023

ticheng-aws mentioned this issue Sep 6, 2023

[Backport] [2.x] [Concurrent Segment Search]: Add support for query profiler with concurrent aggregation (#9248) #9835

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Concurrent Segment Search] Add support for query profiler with concurrent aggregation #8330

[Concurrent Segment Search] Add support for query profiler with concurrent aggregation #8330

ticheng-aws commented Jun 29, 2023

ticheng-aws commented Jul 12, 2023 •

edited

Loading

ticheng-aws commented Jul 12, 2023

reta commented Jul 14, 2023 •

edited

Loading

[Concurrent Segment Search] Add support for query profiler with concurrent aggregation #8330

[Concurrent Segment Search] Add support for query profiler with concurrent aggregation #8330

Comments

ticheng-aws commented Jun 29, 2023

ticheng-aws commented Jul 12, 2023 • edited Loading

Background

Proposed Solution

Sample query response

Non-concurrent search

Concurrent search with 3 segment slices

Alternatives

1. Expand query profile stats across all slices with concurrent execution

Sample query response for a concurrent search with 3 segment slices

2. Keep the current cumulative summary version of the timing breakdown

Sample query response for a concurrent search with 3 segment slices

ticheng-aws commented Jul 12, 2023

reta commented Jul 14, 2023 • edited Loading

ticheng-aws commented Jul 12, 2023 •

edited

Loading

reta commented Jul 14, 2023 •

edited

Loading