Added the test scripts for resumption #2117

shubham-yb · 2024-12-25T16:18:52Z

Describe the changes in this pull request

Added the test framework for resumption tests for import data file and offline import data
Added the test cases of large sized table and large number of tables for import data file
Added the test case for PG offline import data resumption with datatypes, indexes, partitions, case sensitivity / reserved words, multiple schemas

Describe if there are any user-facing changes

N/A

How was this pull request tested?

Made the changes to the Jenkins pipeline as well.

CLAassistant · 2024-12-25T16:18:58Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

shubham-yb · 2025-01-07T03:06:16Z

Does your PR have changes that can cause upgrade issues?

Component	Breaking changes?
MetaDB	No
Name registry json	No
Data File Descriptor Json	No
Export Snapshot Status Json	No
Import Data State	No
Export Status Json	No
Data .sql files of tables	No
Export and import data queue	No
Schema Dump	No
AssessmentDB	No
Sizing DB	No
Migration Assessment Report Json	No
Callhome Json	No
YugabyteD Tables	No
TargetDB Metadata Tables	No

shubham-yb · 2025-01-07T07:15:09Z

https://jenkins.dev.yugabyte.com/job/users/job/yb-voyager-testing/job/yb-voyager-testing-pipeline-test/1084/

makalaaneesh · 2025-01-07T10:11:33Z

migtests/scripts/resumption.py

+    """
+    Runs the yb-voyager command with support for resumption testing.
+    """
+    for attempt in range(1, resumption['max_restarts'] + 1):


let's get/define all the configs in the beginning. It will make it easier to understand what all configuration options are involved.

max_restarts = resumption['max_restarts'] min_interrupt_seconds = resumption['min_interrupt_seconds'] ...

makalaaneesh · 2025-01-07T10:13:21Z

migtests/scripts/resumption.py

+                    if not output:  # Exit if output is empty (end of process output)
+                        break
+                    full_output += output
+                if time.time() - start_time > 5:


why break ? what is 5? seconds? minutes?

This was written so that we get the output in realtime. Turning back it isn't much helpful in case of automation. Have changed this implementation.

makalaaneesh · 2025-01-07T10:14:48Z

migtests/scripts/resumption.py

+    # Final import retry logic
+    print("\n--- Final attempt to complete the import ---")
+
+    for _ in range(2): 


Why 2 attempts finally?

The two attempts and the sleep were to avoid any intermittent issues or system overload. Have removed it.

makalaaneesh · 2025-01-07T10:16:08Z

migtests/scripts/resumption.py

+        try:
+            print("\nVoyager command output:")
+
+            process = subprocess.Popen(


nit: separate function for starting command (can be called in above for-loop as well)

makalaaneesh · 2025-01-07T10:16:53Z

migtests/scripts/resumption.py

+            )
+
+            # Capture and print output
+            for line in iter(process.stdout.readline, ''):


in the above for-loop, we're reading both stderr and stdout, here we're only reading stdout. Any particular reason? Would be good to be consistent here (call a common function that captures stdout/stderr)

Also till when will you keep reading? How long will the loop run?

The loop was to be run till the command exits and print the output in realtime.
Implemented a separate function for running the command and capturing stdout and stderr.

makalaaneesh · 2025-01-07T10:18:06Z

migtests/scripts/resumption.py

+            for line in iter(process.stderr.readline, ''):
+                print(line.strip())
+                sys.stdout.flush()
+            time.sleep(30)


It was added to avoid intermittent failures / system overload. Have removed it.

makalaaneesh · 2025-01-07T10:18:32Z

migtests/scripts/resumption.py

+        print("Final import failed after 2 attempts.")
+        sys.exit(1)
+
+def validate_row_counts(row_count, export_dir):


note for future: you can create a common python file that has such helper
functions.

makalaaneesh · 2025-01-07T10:21:29Z

migtests/tests/pg/partitions/snapshot.sh

@@ -0,0 +1,133 @@
+#!/bin/bash


Assuming that the ONLY change here is that you're specifying ROW_COUNT and essentially making generate_series dynamic.

Yes correct

makalaaneesh · 2025-01-07T10:26:59Z

migtests/tests/resumption/pg/resumption/config.yaml

+  schema2.Case_Sensitive_Table: 5000000
+  schema2.case: 5000000
+  schema2.Table: 5000000
+  public.boston: 2500000


where is the code that generates data for all these other tables boston/cust/emp/etc? I only see code for table/case/Case_Sensitive_Table

Those are being done via the partitions test schema / data

makalaaneesh

LGTM with a minor comment

makalaaneesh · 2025-01-22T07:46:22Z

migtests/scripts/resumption.py

+                    try:
+                        process.terminate()
+                        process.wait(timeout=10)
+                    except subprocess.TimeoutExpired:


let's put some logs/prints to determine if we terminated the process or killed it

shubham-yb · 2025-01-22T08:32:54Z

https://jenkins.dev.yugabyte.com/job/users/job/yb-voyager-testing/job/resumption-test/20/

Added the test scripts for resumption

045ad30

shubham-yb added 5 commits December 26, 2024 12:30

Updated the large count tables test

78209bf

Renamed test in GH Actions

0485f4c

Merge branch 'main' into shubham/resumption

1b2ee52

Test: Only run GH integration tests

8fbb84f

Added AWS region to large table test

0e5bcd0

shubham-yb marked this pull request as ready for review December 27, 2024 11:45

Merge branch 'main' into shubham/resumption

986a51e

shubham-yb requested review from makalaaneesh, sanyamsinghal, ShivanshGahlot and priyanshi-yb December 27, 2024 11:47

shubham-yb added 9 commits December 27, 2024 12:16

Cleanup

d5b7aaa

Reduced time between each retry for the large table test

e8bd070

Merge branch 'main' into shubham/resumption

18a38ab

Merge branch 'main' into shubham/resumption

61c9b1d

Added import data resumption test framework and PG test case

d72c841

Increased the table sizes for the PG test

b38a7f5

Increased the table sizes for the PG test

a14fa6b

Added conditional check while dropping the database

20c2715

Row count optimsation and cleanup

9a6cf4c

makalaaneesh reviewed Jan 7, 2025

View reviewed changes

shubham-yb added 5 commits January 7, 2025 12:26

Merge branch 'main' into shubham/resumption

b42f922

Addressed review comments

2d9749d

Added better error handling

6751621

Merge branch 'main' into shubham/resumption

c19acf9

Test

b5f7c9e

shubham-yb added 5 commits January 20, 2025 11:57

Test

ec1f782

Merge branch 'main' into shubham/resumption

5e00018

Test fix for deadlock issue

fd8c456

Cleanup and misc changes

288ab0f

Cleanup

3b9e19a

makalaaneesh approved these changes Jan 22, 2025

View reviewed changes

Added prints to determine if the process was terminated or killed

5d609e0

shubham-yb merged commit a6dd701 into main Jan 22, 2025
66 of 67 checks passed

shubham-yb deleted the shubham/resumption branch January 22, 2025 09:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added the test scripts for resumption #2117

Added the test scripts for resumption #2117

shubham-yb commented Dec 25, 2024 •

edited

Loading

CLAassistant commented Dec 25, 2024

shubham-yb commented Jan 7, 2025

shubham-yb commented Jan 7, 2025

makalaaneesh Jan 7, 2025

makalaaneesh Jan 7, 2025

shubham-yb Jan 20, 2025

makalaaneesh Jan 7, 2025

shubham-yb Jan 20, 2025

makalaaneesh Jan 7, 2025

makalaaneesh Jan 7, 2025

makalaaneesh Jan 7, 2025

shubham-yb Jan 20, 2025

makalaaneesh Jan 7, 2025

shubham-yb Jan 20, 2025

makalaaneesh Jan 7, 2025

makalaaneesh Jan 7, 2025

shubham-yb Jan 16, 2025

makalaaneesh Jan 7, 2025 •

edited

Loading

shubham-yb Jan 16, 2025

makalaaneesh left a comment

makalaaneesh Jan 22, 2025

shubham-yb commented Jan 22, 2025

Added the test scripts for resumption #2117

Added the test scripts for resumption #2117

Conversation

shubham-yb commented Dec 25, 2024 • edited Loading

Describe the changes in this pull request

Describe if there are any user-facing changes

How was this pull request tested?

CLAassistant commented Dec 25, 2024

shubham-yb commented Jan 7, 2025

Does your PR have changes that can cause upgrade issues?

shubham-yb commented Jan 7, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

makalaaneesh Jan 7, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

makalaaneesh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shubham-yb commented Jan 22, 2025

shubham-yb commented Dec 25, 2024 •

edited

Loading

makalaaneesh Jan 7, 2025 •

edited

Loading