We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SNAPSHOT-20231201
openjdk version "11.0.21" 2023-10-17
Linux
Redshift bulk loader fails when trying to write an stream to S3 (CSV) and COPY into Redshift.
COPY
2023/12/01 05:59:00 - Hop - Pipeline opened. 2023/12/01 05:59:00 - Hop - Launching pipeline [redshift_empty_table]... 2023/12/01 05:59:00 - Hop - Started the pipeline execution. 2023/12/01 05:59:00 - redshift_empty_table - Executing this pipeline using the Local Pipeline Engine with run configuration 'local' 2023/12/01 05:59:00 - redshift_empty_table - Execution started for pipeline [redshift_empty_table] 2023/12/01 05:59:03 - S3 File - Part size less than minimum of 5MB, set to minimum 2023/12/01 05:59:03 - S3 File - Part size less than minimum of 5MB, set to minimum 2023/12/01 05:59:04 - load empty_table.0 - Connected to database dwh 2023/12/01 05:59:04 - load empty_table.0 - ERROR: Unexpected error 2023/12/01 05:59:04 - load empty_table.0 - ERROR: org.apache.hop.core.exception.HopDatabaseException: 2023/12/01 05:59:04 - load empty_table.0 - Error executing COPY statements 2023/12/01 05:59:04 - load empty_table.0 - ERROR: The specified S3 prefix 'main/empty_table.csv' does not exist 2023/12/01 05:59:04 - load empty_table.0 - Detail: 2023/12/01 05:59:04 - load empty_table.0 - ----------------------------------------------- 2023/12/01 05:59:04 - load empty_table.0 - error: The specified S3 prefix 'main/empty_table.csv' does not exist 2023/12/01 05:59:04 - load empty_table.0 - code: 8001 2023/12/01 05:59:04 - load empty_table.0 - context: 2023/12/01 05:59:04 - load empty_table.0 - query: 161874653[child_sequence:1] 2023/12/01 05:59:04 - load empty_table.0 - location: s3_utility.cpp:708 2023/12/01 05:59:04 - load empty_table.0 - process: padbmaster [pid=1073963716] 2023/12/01 05:59:04 - load empty_table.0 - ----------------------------------------------- 2023/12/01 05:59:04 - load empty_table.0 - 2023/12/01 05:59:04 - load empty_table.0 - 2023/12/01 05:59:04 - load empty_table.0 - at org.apache.hop.pipeline.transforms.redshift.bulkloader.RedshiftBulkLoader.processRow(RedshiftBulkLoader.java:124) 2023/12/01 05:59:04 - load empty_table.0 - at org.apache.hop.pipeline.transform.RunThread.run(RunThread.java:55) 2023/12/01 05:59:04 - load empty_table.0 - at java.base/java.lang.Thread.run(Thread.java:829) 2023/12/01 05:59:04 - load empty_table.0 - Caused by: com.amazon.redshift.util.RedshiftException: ERROR: The specified S3 prefix 'main/empty_table.csv' does not exist 2023/12/01 05:59:04 - load empty_table.0 - Detail: 2023/12/01 05:59:04 - load empty_table.0 - ----------------------------------------------- 2023/12/01 05:59:04 - load empty_table.0 - error: The specified S3 prefix 'main/empty_table.csv' does not exist 2023/12/01 05:59:04 - load empty_table.0 - code: 8001 2023/12/01 05:59:04 - load empty_table.0 - context: 2023/12/01 05:59:04 - load empty_table.0 - query: 161874653[child_sequence:1] 2023/12/01 05:59:04 - load empty_table.0 - location: s3_utility.cpp:708 2023/12/01 05:59:04 - load empty_table.0 - process: padbmaster [pid=1073963716] 2023/12/01 05:59:04 - load empty_table.0 - ----------------------------------------------- 2023/12/01 05:59:04 - load empty_table.0 - 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2608) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.processResultsOnThread(QueryExecutorImpl.java:2276) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1881) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1873) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:370) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeInternal(RedshiftStatementImpl.java:515) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.execute(RedshiftStatementImpl.java:436) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeWithFlags(RedshiftStatementImpl.java:377) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeCachedSql(RedshiftStatementImpl.java:363) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeWithFlags(RedshiftStatementImpl.java:340) 2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeUpdate(RedshiftStatementImpl.java:298) 2023/12/01 05:59:04 - load empty_table.0 - at org.apache.hop.pipeline.transforms.redshift.bulkloader.RedshiftBulkLoader.processRow(RedshiftBulkLoader.java:116) 2023/12/01 05:59:04 - load empty_table.0 - ... 2 more 2023/12/01 05:59:04 - load empty_table.0 - Finished processing (I=0, O=0, R=0, W=0, U=0, E=1) 2023/12/01 05:59:04 - redshift_empty_table - Pipeline duration : 4.006 seconds [ 4.006" ] 2023/12/01 05:59:04 - redshift_empty_table - Execution finished on a local pipeline engine with run configuration 'local' 2023/12/01 05:59:04 - redshift_empty_table - Pipeline detected one or more transforms with errors. 2023/12/01 05:59:04 - redshift_empty_table - Pipeline is killing the other transforms!
Priority: 2
Component: Database, Component: Transforms
The text was updated successfully, but these errors were encountered:
additional checks to skip writing file + copy statement on empty stre…
b710945
…am. fixes apache#3469
updated documentationUrl. apache#3469
dd055c9
Merge pull request #3470 from bamaer/3469
f59481e
additional checks to skip file + copy on empty stream #3469
bamaer
Successfully merging a pull request may close this issue.
Apache Hop version?
SNAPSHOT-20231201
Java version?
openjdk version "11.0.21" 2023-10-17
Operating system
Linux
What happened?
Redshift bulk loader fails when trying to write an stream to S3 (CSV) and
COPY
into Redshift.2023/12/01 05:59:00 - Hop - Pipeline opened.
2023/12/01 05:59:00 - Hop - Launching pipeline [redshift_empty_table]...
2023/12/01 05:59:00 - Hop - Started the pipeline execution.
2023/12/01 05:59:00 - redshift_empty_table - Executing this pipeline using the Local Pipeline Engine with run configuration 'local'
2023/12/01 05:59:00 - redshift_empty_table - Execution started for pipeline [redshift_empty_table]
2023/12/01 05:59:03 - S3 File - Part size less than minimum of 5MB, set to minimum
2023/12/01 05:59:03 - S3 File - Part size less than minimum of 5MB, set to minimum
2023/12/01 05:59:04 - load empty_table.0 - Connected to database dwh
2023/12/01 05:59:04 - load empty_table.0 - ERROR: Unexpected error
2023/12/01 05:59:04 - load empty_table.0 - ERROR: org.apache.hop.core.exception.HopDatabaseException:
2023/12/01 05:59:04 - load empty_table.0 - Error executing COPY statements
2023/12/01 05:59:04 - load empty_table.0 - ERROR: The specified S3 prefix 'main/empty_table.csv' does not exist
2023/12/01 05:59:04 - load empty_table.0 - Detail:
2023/12/01 05:59:04 - load empty_table.0 - -----------------------------------------------
2023/12/01 05:59:04 - load empty_table.0 - error: The specified S3 prefix 'main/empty_table.csv' does not exist
2023/12/01 05:59:04 - load empty_table.0 - code: 8001
2023/12/01 05:59:04 - load empty_table.0 - context:
2023/12/01 05:59:04 - load empty_table.0 - query: 161874653[child_sequence:1]
2023/12/01 05:59:04 - load empty_table.0 - location: s3_utility.cpp:708
2023/12/01 05:59:04 - load empty_table.0 - process: padbmaster [pid=1073963716]
2023/12/01 05:59:04 - load empty_table.0 - -----------------------------------------------
2023/12/01 05:59:04 - load empty_table.0 -
2023/12/01 05:59:04 - load empty_table.0 -
2023/12/01 05:59:04 - load empty_table.0 - at org.apache.hop.pipeline.transforms.redshift.bulkloader.RedshiftBulkLoader.processRow(RedshiftBulkLoader.java:124)
2023/12/01 05:59:04 - load empty_table.0 - at org.apache.hop.pipeline.transform.RunThread.run(RunThread.java:55)
2023/12/01 05:59:04 - load empty_table.0 - at java.base/java.lang.Thread.run(Thread.java:829)
2023/12/01 05:59:04 - load empty_table.0 - Caused by: com.amazon.redshift.util.RedshiftException: ERROR: The specified S3 prefix 'main/empty_table.csv' does not exist
2023/12/01 05:59:04 - load empty_table.0 - Detail:
2023/12/01 05:59:04 - load empty_table.0 - -----------------------------------------------
2023/12/01 05:59:04 - load empty_table.0 - error: The specified S3 prefix 'main/empty_table.csv' does not exist
2023/12/01 05:59:04 - load empty_table.0 - code: 8001
2023/12/01 05:59:04 - load empty_table.0 - context:
2023/12/01 05:59:04 - load empty_table.0 - query: 161874653[child_sequence:1]
2023/12/01 05:59:04 - load empty_table.0 - location: s3_utility.cpp:708
2023/12/01 05:59:04 - load empty_table.0 - process: padbmaster [pid=1073963716]
2023/12/01 05:59:04 - load empty_table.0 - -----------------------------------------------
2023/12/01 05:59:04 - load empty_table.0 -
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.receiveErrorResponse(QueryExecutorImpl.java:2608)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.processResultsOnThread(QueryExecutorImpl.java:2276)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1881)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.processResults(QueryExecutorImpl.java:1873)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.core.v3.QueryExecutorImpl.execute(QueryExecutorImpl.java:370)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeInternal(RedshiftStatementImpl.java:515)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.execute(RedshiftStatementImpl.java:436)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeWithFlags(RedshiftStatementImpl.java:377)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeCachedSql(RedshiftStatementImpl.java:363)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeWithFlags(RedshiftStatementImpl.java:340)
2023/12/01 05:59:04 - load empty_table.0 - at com.amazon.redshift.jdbc.RedshiftStatementImpl.executeUpdate(RedshiftStatementImpl.java:298)
2023/12/01 05:59:04 - load empty_table.0 - at org.apache.hop.pipeline.transforms.redshift.bulkloader.RedshiftBulkLoader.processRow(RedshiftBulkLoader.java:116)
2023/12/01 05:59:04 - load empty_table.0 - ... 2 more
2023/12/01 05:59:04 - load empty_table.0 - Finished processing (I=0, O=0, R=0, W=0, U=0, E=1)
2023/12/01 05:59:04 - redshift_empty_table - Pipeline duration : 4.006 seconds [ 4.006" ]
2023/12/01 05:59:04 - redshift_empty_table - Execution finished on a local pipeline engine with run configuration 'local'
2023/12/01 05:59:04 - redshift_empty_table - Pipeline detected one or more transforms with errors.
2023/12/01 05:59:04 - redshift_empty_table - Pipeline is killing the other transforms!
Issue Priority
Priority: 2
Issue Component
Component: Database, Component: Transforms
The text was updated successfully, but these errors were encountered: