[App Performance] Profile the time it takes for the app to load the configs and patient data from the local DB. #2066

dubdabasoduba · 2023-02-20T21:53:33Z

pld · 2023-02-21T23:51:13Z

How do we close this? How about through a new test that fails if this request takes longer than some very lenient amount of time

dubdabasoduba · 2023-02-22T02:01:04Z

How do we close this? How about through a new test that fails if this request takes longer than some very lenient amount of time

Makes sense. I think we add that. We also want to record the times and device resources and then use that as a baseline in order to validate improvements made to the code.

pld · 2023-02-22T02:02:54Z

Cool nb if we need to add example FHIR resources into this repo to support content for testing, I think that is appropriate

ndegwamartin · 2023-03-14T09:16:51Z

@pld @dubdabasoduba is this ticket about profiling to get the current metrics or about writing the integration test such that it passes or fails if it exceeds some period x?

pld · 2023-03-14T13:29:40Z

I think it should be about the latter, which would also mean we have to do the former, at least for a specific code path, does that make sense?

ekigamba · 2023-03-17T08:28:56Z

Loading times as tested on the Blu G60 with 3 GB RAM, 64 GB ROM, Octa-core 1.6 GHz and running on Android 9

The family register page currently takes at least 60 seconds and a maximum of 3 minutes to show 25 family items.
Each register page loads 25 items.

Time taken to load the register

Register Page	Time	Total count on the app
Household	60 secs to 3 mins	60
Children	12 sec	219
Sick child	8 secs	133
ANC	5 secs	114
PNC	6 secs	31
FP	5 secs	149
HIV	5 secs	25

Time taken to load profile pages

Profile Page	Time
Household	12 - 17 secs
Children	10 sec
Sick child	8 secs
ANC	8 secs

Optimizations tried/made on the household page

Reduce calls to loadRegister. Calls to loadRegister was happening ~3 times instead of once whenever the app was openned. The method RegisterViewModel.retrieveRegisterUiState() seems to be calling paginateRegisterData() multiple times
Use count queries instead of loading the related resources that needed to be counted. The task fetch queries were the slowest because there are 8K tasks and we need to fetch tasks for each family member. These queries each took around 100 ms
Write raw queries instead of using the FHIR Engine search API. I had to add a method inside FhirEngine to allow raw queries and return a raw Cursor
Rewrite the register queries to cache values such as family member resourceUuid and familyUuid which is the unique identifier on the DB
Create a single normalized table to store the register data only

Time taken after improvements (household register only)

Time taken after database query optimizations ~ 7 seconds
Time taken after using a single RegisterFamilies table ~ 300 ms where the optimized queries update the table in the background taking 7 - 9 seconds. This however completely breaks configurability and would need more time na complexity to work for all the register pages and all registers

Table to store the RegisterFamilies and their data

CREATE TABLE IF NOT EXISTS "RegisterFamilies" (
	"resourceUuid"	BLOB UNIQUE,
	"lastUpdated"	INTEGER,
	"childCount" INTEGER,
	"taskCount" INTEGER,
	"taskStatus" TEXT,
	"pregnantWomenCount" INTEGER,
	"familyName" TEXT,
	"householdNo" TEXT,
	"householdLocation" TEXT,
	PRIMARY KEY("resourceUuid")
);

INDEX TO SPEED UP SORT BY lastUpdated

CREATE INDEX "index_RegisterFamilies_lastUpdated" ON "RegisterFamilies" (
	"lastUpdated"
);

QUERY TO UPDATE THE lastUpdated in RegisterFamilies

INSERT INTO RegisterFamilies (resourceUuid, lastUpdated)
SELECT a.resourceUuid, c.index_from
FROM ResourceEntity a
LEFT JOIN DateIndexEntity b
ON a.resourceType = b.resourceType AND a.resourceUuid = b.resourceUuid AND b.index_name = "_lastUpdated"
LEFT JOIN DateTimeIndexEntity c
ON a.resourceType = c.resourceType AND a.resourceUuid = c.resourceUuid AND c.index_name = "_lastUpdated"
WHERE a.resourceType = "Group"
AND a.resourceUuid IN (
SELECT resourceUuid FROM TokenIndexEntity
WHERE resourceType = "Group" AND index_name = "type" AND (index_value = "person" AND (index_system = "http://hl7.org/fhir/group-type"))
)
AND a.resourceUuid IN (
SELECT resourceUuid FROM TokenIndexEntity
WHERE resourceType = "Group" AND index_name = "code" AND (index_value = "35359004" AND IFNULL(index_system,'') = "https://www.snomed.org")
)
ORDER BY b.index_from DESC, c.index_from DESC

======

SELECT * FROM RegisterFamilies ORDER BY lastUpdated DESC LIMIT 25 OFFSET 0

=======

QUERY TO GET Family Member resourceUuid's

SELECT resourceUuid FROM ResourceEntity WHERE resourceType = "Patient" AND resourceId IN (
SELECT SUBSTR(index_value, 9) FROM ReferenceIndexEntity WHERE index_name = "member" 
AND resourceUuid = (SELECT resourceUuid FROM ResourceEntity WHERE resourceId = ?)
)

Query to get Family resourceUuid

SELECT resourceUuid FROM ResourceEntity WHERE resourceId = ?

QUERY TO GET THE TASK COUNT FOR A PATIENT

SELECT COUNT(*) FROM TokenIndexEntity WHERE resourceType = "Task" AND index_name = "status" 
AND resourceUuid IN (
SELECT resourceUuid FROM ReferenceIndexEntity WHERE resourceType = "Task" AND index_name = "subject" 
AND index_value IN (SELECT index_value FROM ReferenceIndexEntity WHERE resourceUuid = x'$groupUUID' AND index_name = "member")
) 
AND (index_value = "failed" OR index_value = "completed" OR index_value = "cancelled")

QUERY TO GET THE CHILD COUNT IN A FAMILY

SELECT COUNT(*) FROM TokenIndexEntity a JOIN DateIndexEntity b ON a.resourceUuid = b.resourceUuid  
WHERE a.resourceUuid IN ($memberSelector) AND a.index_name = "active" AND a.index_value = "true" 
AND b.index_name = "birthdate" AND b.index_from >= ?

QUERY TO GET THE NUMBER OF PREGNANT WOMEN IN A FAMILY

SELECT COUNT(*) FROM TokenIndexEntity WHERE resourceType = "Condition" 
AND index_name = "code" AND index_system = "http://snomed.info/sct" AND index_value = "77386006" 
AND resourceUuid IN (SELECT resourceUuid FROM ReferenceIndexEntity WHERE resourceType = "Condition" 
AND index_name = "subject" AND index_value IN (SELECT index_value FROM ReferenceIndexEntity WHERE index_name = "member" AND resourceUuid = x'$groupUUID') )

=== SPEEDUP FOR FETCH MEMBERS QUERY ===

CREATE INDEX `index_ResourceEntity_resourceId` ON `ResourceEntity` (`resourceId`)

ekigamba · 2023-03-17T08:42:16Z

On measuring performance on CI, we have a number of options that we can evaluate

Android Macro and micro-benchmarks that allows us to write macro and micro benchmarks that run on devices. We can then evaluate these results in a different script and fail the CI or provide the results in the PR. Here are the links https://developer.android.com/topic/performance/benchmarking/macrobenchmark-metrics, https://circleci.com/blog/benchmarking-android/
Hugo which allows us to log method execution times during runtime can be modified to provide the results and possibly evaluate these results or provide the results in the PR. It only requires adding an annotation

We can get more ideas on this before commiting to an idea

ekigamba · 2023-03-27T12:42:39Z

With the optimisations done by the team @ellykits and @ndegwamartin , performance has improved by 50%

Time taken to load the register

Register Page	Before	After
Household	60 secs to 3 mins	24 secs
Children	12 sec	4 sec
Sick child	8 secs	3 sec
ANC	5 secs	3 secs
PNC	6 secs	3 secs
FP	5 secs	3 secs
HIV	5 secs	1 secs

These times were taken with the device wifi turned off meaning background syncing jobs were not running. This also used the previous ECBIS preview instance data used during the first benchmark

pld · 2023-03-27T13:48:17Z

@ekigamba is this from the latest RC? what branch/PR are these results based on?

ekigamba · 2023-03-27T13:59:34Z

@pld This is based on the latest ~~master~~ main. The performance numbers with FHIR staging data are even better, 5 - 11 seconds for the household register

ekigamba · 2023-05-04T13:19:33Z

Results after revInclude integration

Looks good for the profile pages, general improvement.

Household profile ~2 sec
Child profile = ~3 sec,
sick child profile = ~ 3 sec

No change in the performance of the household, children and sick child register. Respectively times are 24 sec, 3 sec and 3 sec.

The bug fixes done yesterday by @kitoto did fix the performance degradation on child and sick child registers

ekigamba · 2023-05-09T09:30:19Z

Results after forward include integration into FHIR Core

Household register = ~20 secs
Child register = ~2.5 secs
Sick child register = ~ 2.5 secs

Household profile = ~ 2 secs
Child profile = ~ 2 secs
sick child profile = ~ 3 secs

cc @ellykits @ndegwamartin

jingtang10 · 2023-05-12T13:11:11Z

@ekigamba any more insight into why household register is slow? how can we help now that we have tried rev and forward include?

aditya's gonna merge those two changes soon.

ellykits · 2023-05-12T15:19:04Z

@jingtang10 We are actively working on improving the household register performance. We identified a few issues with the code used to fetch the register data that we are optimizing leveraging the recent functionalities from search APIs.

Aggregating all counts for household members retrieved via forward include. At the moment we run count for each member then sum all in code.
Retrieve all household members accompanying resources all at once, we'd been handling this recursively for each member before forward include implementation.

We'll share updated stats after the above issues are addressed.

Aditya mentioned:

"A plan to refactor the api's to be just search and have include and revInclude just like has in the current search api.". Current implementation requires different calls for forward and reverse include. (Internally from our last discussion both functionalities run two queries one to fetch the primary resources and then another query to retrieve the accompanying resources)
Parsing JSON to the corresponding FHIR representation Java model takes longer during processing of the query results, any details on this?

Do we have any plans to support filters on accompanying resources via rev/forward to reduce the amount of data to process? I understand this may not be available on the server _include/ _revinclude which the inspiration for the client implementation.

jingtang10 · 2023-05-13T04:05:03Z

Thanks Elly. I'm identifying 3 asks from your update: 1) implement filter in forward and reverse included 2) implement basic aggregate functions in search 3) optimise parsing For 1) you're right, this is not included in the fhir spec. But as I have said, our search API is inspired by the fhir restful search API, but does not have to follow it to the letter (and we don't). The reason is that our search is on-device and not a client-server search and has different requirements and constraints. So in principle I think we can definitely provide filter in the forward and rev include. Aditya and I discussed this briefly and please share your exact query and create an issue so we can design this in detail. For 2) this again is a reasonable request :) We will need to take a look after the search result refactoring following Aditya's rev include pr. This is because the aggregate result need to be included in a search result wrapper. I discussed this with Bashir as well and will pull him in where necessary. 3) possibly quite hard to actually improve the performance of the actual parsing. But I wonder if we can parse lazily as an optimisation. As the moment we parse everything eagerly as soon as they're returned from the DB. I've not tried myself but I'd like to explore.

…

On Sat, 13 May 2023, 00:19 Elly Kitoto, ***@***.***> wrote: @jingtang10 <https://github.com/jingtang10> We are actively working on improving the household register performance. We identified a few issues with the code used to fetch the register data that we are optimizing leveraging the recent functionalities from search APIs. 1. Aggregating all counts for household members retrieved via forward include. At the moment we run count for each member then sum all in code. 2. Retrieve all household members accompanying resources all at once, we'd been handling this recursively for each member before forward include implementation. We'll share updated stats after the above issues are addressed. Aditya mentioned: 1. "A plan to refactor the api's to be just search and have include and revInclude just like has in the current search api.". Current implementation requires different calls for forward and reverse include. (Internally from our last discussion both functionalities run two queries one to fetch the primary resources and then another query to retrieve the accompanying resources) 2. Parsing resources taking longer any details on this do we expect any improvements in future? Do we have any plans to support filters on accompanying resources via rev/forward to reduce the amount of data to process? I understand this may not be available on the server _include/ _revinclude which the inspiration for the client implementation. — Reply to this email directly, view it on GitHub <#2066 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AG3M2XG3DPVS53OAYLIJ5VDXFZIHFANCNFSM6AAAAAAVCKEEUU> . You are receiving this because you were mentioned.Message ID: ***@***.***>

ellykits · 2023-05-16T11:26:22Z

Thanks for your response @jingtang10

In agreement with you for the asks (priority in the order they are listed).

Based on Aditya's finding (which I confirmed) on this Add index to DateTimeIndexEntity table index_from column google/android-fhir#1964 about the sort functionality not using the _lastUpdated index from the DateTimeIndexEntity table. Is it possible to investigate why the index is not used, could be related to the query generation. The query I shared on Aditya's PR was generated by the SDK:
See section 1.2. Temporary Sorting B-Trees
Another ask, possiblity to limit the amount of data extracted from the resources esp. the ones that are neither used in search nor sorting. E.g: Patient.address-use is something we do not use in our queries but it is extracted into the TokenIndexEntity table anyways. Our demo server has over 219K entries in that table alone (of course with data extracted from other resource types)

pld · 2023-07-07T15:18:08Z

@ekigamba let's a add a perf test to close this out with

pld · 2023-07-17T12:27:22Z

thinking same pattern here, which looks in line w/the draft pr, #2067 (comment)

f-odhiambo · 2023-07-24T14:18:53Z

@ekigamba Whats the ETA for this ticket ?

ekigamba · 2023-07-24T14:36:04Z

The remaining fixes might take 1 - 2 FTE days. I'll create issues any unresolved and potential issues originating from this ticket

ekigamba · 2023-07-25T22:50:36Z

Summary of performance testing implementation

Tools used

Jetpack microbenchmark - it uses best practices for profiling performance such as getting the median of multiple iterations and using metrics only after performance has stabiliised. It also provides trace files of the tested methods. It can be configured to run at optimal
Sample DB containing a lot of data. This allows us to avoid syncing and logging in on the emulator reducing the time it takes to profile the
Gradle task to evaluate the results

Possible issues

Flakiness due to change in Github CI runners and infrastructure. This might cause random improvements or degradations in performance. The changes that can cause this include increased hardware resources or assigned resources for the runner.
Flakiness due to change in emulator configuration. Any changes in the emulator configuration might cause improvements or degradations in performance. Jetpack benchmark library is able to stabilise performance by applying tricks such as lock clocks to lock CPU clock speeds, apply activities to prevent interaction with other apps, run no-work threads to stabilise results for multi-core metrics
Unrealistic results. Given that the Github CI infrastructure is faster than a Blu G60 phone, the metrics are not indicative of how much time it takes on a Blue G60 phone but they allow for us to easily detect degradations or improvements in performance. The threshold is 50% of the current metrics and this can be easily changed in the margin value

Possible improvements

Reduce the hardware configuration for the emulator to fewer cores
Use an emulator running on docker or kubernetes with fewer compute units. This allows us to closely emulate performance of the field devices such as the G60. Possible use on the Hetzner server
Run on rooted emulator/device to allow for full configuration by the library to get more stable results
Post the results on the PR for easier tracking of performance
Run the tests manually on the Blu G60 device during QA before every release
Update the benchmark library to allow running on > API 28 devices. This will require increasing the compile SDK version and related changes. This will also allow running using the official Benchmark runner which configures the emulator/device for more stable performance

...

pld · 2023-07-26T08:29:09Z

Cool, so I am 100% fine w/all of those Possible issues, those are all expected given the infrastructure we're using. From the Possible improvements if we choose to do any of these they can come later, (5.) seems reasonable to request, (4.) maybe.

I guess my question is, when you say flakey, how flakey, like are you seeing 10x changes between runs? I think what we can test for is that it's staying within the expected order of magnitude, like if you're seeing 2-5x changes, we can fail the test if we see a 10x change, if you're seeing 20-50x changes, we can fail the test if we see a 100x change -- we're only trying to put a wide envelope on it for now, in follow-up we'll tighten it

ekigamba · 2023-07-26T12:46:20Z

I'd think that I wouldn't expect it to be flaky initially given that the tests are isolated and emulator barely has any other interaction when the tests are running, but any changes in infrastructure that we might not know off will become noticeable. I have set to limit to ~1.5x based on this assumption and we can change this if the assumption is wrong.

The library manages stability of results by running the tests until the results are consistent and then it uses the results from the stable iterations

dubdabasoduba added Discussion This is an open discussion that may or may not lead to actionable points Performance (App or Server) labels Feb 20, 2023

ekigamba assigned ekigamba and unassigned ekigamba Mar 16, 2023

jingtang10 mentioned this issue Apr 3, 2023

Use custom SQL query in FHIR Engine google/android-fhir#1950

Closed

ellykits mentioned this issue Jun 21, 2023

Search method takes more time when load 7k household list google/android-fhir#2040

Closed

pld assigned ekigamba Jul 7, 2023

pld added this to the Sprint 14 (4th July - 17th July) milestone Jul 7, 2023

f-odhiambo modified the milestones: Sprint 14 (4th July - 17th July), Sprint 15 (18th July - 31 July) Jul 10, 2023

pld mentioned this issue Jul 17, 2023

Performance tests for loading register data #2592

Merged

11 tasks

f-odhiambo closed this as completed Jul 24, 2023

f-odhiambo reopened this Jul 24, 2023

pld modified the milestones: Sprint 15 (18th July - 31 July), Sprint 16 (1st Aug - 14th Aug) Jul 31, 2023

pld closed this as completed in #2592 Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[App Performance] Profile the time it takes for the app to load the configs and patient data from the local DB. #2066

[App Performance] Profile the time it takes for the app to load the configs and patient data from the local DB. #2066

dubdabasoduba commented Feb 20, 2023 •

edited

Loading

pld commented Feb 21, 2023

dubdabasoduba commented Feb 22, 2023 •

edited

Loading

pld commented Feb 22, 2023

ndegwamartin commented Mar 14, 2023

pld commented Mar 14, 2023

ekigamba commented Mar 17, 2023 •

edited

Loading

ekigamba commented Mar 17, 2023 •

edited

Loading

ekigamba commented Mar 27, 2023

pld commented Mar 27, 2023

ekigamba commented Mar 27, 2023 •

edited

Loading

ekigamba commented May 4, 2023

ekigamba commented May 9, 2023

jingtang10 commented May 12, 2023

ellykits commented May 12, 2023 •

edited

Loading

jingtang10 commented May 13, 2023 via email

ellykits commented May 16, 2023

pld commented Jul 7, 2023

pld commented Jul 17, 2023

f-odhiambo commented Jul 24, 2023

ekigamba commented Jul 24, 2023

ekigamba commented Jul 25, 2023 •

edited

Loading

pld commented Jul 26, 2023

ekigamba commented Jul 26, 2023 •

edited

Loading

[App Performance] Profile the time it takes for the app to load the configs and patient data from the local DB. #2066

[App Performance] Profile the time it takes for the app to load the configs and patient data from the local DB. #2066

Comments

dubdabasoduba commented Feb 20, 2023 • edited Loading

pld commented Feb 21, 2023

dubdabasoduba commented Feb 22, 2023 • edited Loading

pld commented Feb 22, 2023

ndegwamartin commented Mar 14, 2023

pld commented Mar 14, 2023

ekigamba commented Mar 17, 2023 • edited Loading

Time taken to load the register

Time taken to load profile pages

Optimizations tried/made on the household page

Time taken after improvements (household register only)

Table to store the RegisterFamilies and their data

INDEX TO SPEED UP SORT BY lastUpdated

QUERY TO UPDATE THE lastUpdated in RegisterFamilies

QUERY TO GET Family Member resourceUuid's

Query to get Family resourceUuid

QUERY TO GET THE TASK COUNT FOR A PATIENT

QUERY TO GET THE CHILD COUNT IN A FAMILY

QUERY TO GET THE NUMBER OF PREGNANT WOMEN IN A FAMILY

ekigamba commented Mar 17, 2023 • edited Loading

ekigamba commented Mar 27, 2023

Time taken to load the register

pld commented Mar 27, 2023

ekigamba commented Mar 27, 2023 • edited Loading

ekigamba commented May 4, 2023

Results after revInclude integration

ekigamba commented May 9, 2023

Results after forward include integration into FHIR Core

jingtang10 commented May 12, 2023

ellykits commented May 12, 2023 • edited Loading

jingtang10 commented May 13, 2023 via email

ellykits commented May 16, 2023

pld commented Jul 7, 2023

pld commented Jul 17, 2023

f-odhiambo commented Jul 24, 2023

ekigamba commented Jul 24, 2023

ekigamba commented Jul 25, 2023 • edited Loading

Summary of performance testing implementation

Tools used

Possible issues

Possible improvements

pld commented Jul 26, 2023

ekigamba commented Jul 26, 2023 • edited Loading

dubdabasoduba commented Feb 20, 2023 •

edited

Loading

dubdabasoduba commented Feb 22, 2023 •

edited

Loading

ekigamba commented Mar 17, 2023 •

edited

Loading

ekigamba commented Mar 17, 2023 •

edited

Loading

ekigamba commented Mar 27, 2023 •

edited

Loading

ellykits commented May 12, 2023 •

edited

Loading

ekigamba commented Jul 25, 2023 •

edited

Loading

ekigamba commented Jul 26, 2023 •

edited

Loading