-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Setting the IndexingPolicy #19540
Setting the IndexingPolicy #19540
Conversation
Thank you for your contribution amarathavale! We will review the pull request and get back to you soon. |
Note: This PR can/should be merged before #19515 for a couple of reasons: this change is more urgent, and I can handle the merge conflict in the other one after this is committed. |
.../src/main/java/com/azure/cosmos/benchmark/linkedin/data/InvitationsCollectionAttributes.java
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. please wait for @simplynaveen20 review and sign off
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Question - "Refactored the existing implementation to support additional 'entities'. Like Invitations, entities such as UserGeneratedContent (where we use composite indexes) have a different document structure." -> Are we creating another set up documents for this entities along with invitation. So if we load 100 document , in BE it will be 100(invitation)+100(entities)
...smos-benchmark/src/main/java/com/azure/cosmos/benchmark/linkedin/DataGenerationIterator.java
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for memory issue fix . LGTM
As part of this change, the indexingPolicy is set to use the same strategy that we use for our production workloads: Index only the id and partitioningKey field for this collection (there is another use-case where we will validate the composite indexes; but in another change)
Validated this change by running the java cli command, and you can see the indexingPolicy:

Additional changes in here:
Validated these 2 items by running it locally, and verifying the NotFound count is 0.
-- Meters ----------------------------------------------------------------------
GET ctlWorkloadInvitations Document NotFound Operations
count = 0
...
GET ctlWorkloadInvitations Successful Operations
count = 132975
mean rate = 237.10 events/second
GET ctlWorkloadInvitations Unsuccessful Operations
count = 0
...
-- Timers ----------------------------------------------------------------------
GET ctlWorkloadInvitations Latency
count = 132975
...