-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Improve][Doc] Add file_filter_pattern
example to doc
#7922
Conversation
1、When the ClickHouse connector is set to multi parallelism, the task extraction is completed but cannot be stopped normally [(#7897)](#7897) 2、Added E2E test cases for this issue [(#7897)](#7897) 3、Local developers want to observe **Job Progress Information** in a timely manner, Need to modify the following configuration.The configuration in config is invalid ``` seatunnel engine/seatunnel-engineer-common/src/main/resources/seatunnely.yaml ```
1、When the ClickHouse connector is set to multi parallelism, the task extraction is completed but cannot be stopped normally [(#7897)](#7897) 2、Added E2E test cases for this issue [(#7897)](#7897) 3、Local developers want to observe **Job Progress Information** in a timely manner, Need to modify the following configuration.The configuration in config is invalid ``` seatunnel engine/seatunnel-engineer-common/src/main/resources/seatunnely.yaml ```
1、When the ClickHouse connector is set to multi parallelism, the task extraction is completed but cannot be stopped normally [(#7897)](#7897) 2、Added E2E test cases for this issue [(#7897)](#7897) 3、Local developers want to observe **Job Progress Information** in a timely manner, Need to modify the following configuration.The configuration in config is invalid ``` seatunnel engine/seatunnel-engineer-common/src/main/resources/seatunnely.yaml ```
This reverts commit 2e9162d.
This reverts commit e64b8a6.
This reverts commit 1b80667.
This reverts commit 4276681.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @YOMO-Lee ! I left some comments.
@@ -254,6 +254,72 @@ Specifies Whether to process data using the tag attribute format. | |||
|
|||
Filter pattern, which used for filtering files. | |||
|
|||
The filtering format is similar to wildcard matching file names in Linux. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We cannot tell users about the ambiguous conclusion. Please tell users directly that we use Java regular expressions.
| Wildcard | Meaning | Example | | ||
|--------------|--------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------| | ||
| * | Match 0 or more characters | f*     Any file starting with f<br/>b*.txt   Any file starting with b, any character in the middle, and ending with. txt | | ||
| [] | Match a single character in parentheses | [abc]*   A file that starts with any one of the characters a, b, or c | | ||
| ? | Match any single character | f?.txt   Any file starting with 'f' followed by a character and ending with '. txt' | | ||
| [!] | Match any single character not in parentheses | [!abc]*   Any file that does not start with abc | | ||
| [a-z] | Match any single character from a to z | [a-z]*   Any file starting with a to z | | ||
| {a,b,c}/a..z | When separated by commas, it represents individual characters<br/>When separated by two dots, represents continuous characters | {a,b,c}*   Files starting with any character from abc<br/>{a..Z}*    Files starting with any character from a to z | | ||
|
||
However, it should be noted that unlike Linux wildcard characters, when encountering file suffixes, the middle dot cannot be omitted. | ||
|
||
For example, `abc20241022.csv`, the normal Linux wildcard `abc*` is sufficient, but here we need to use `abc*.*` , Pay attention to a point in the middle. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please replace this part to link to https://en.wikipedia.org/wiki/Regular_expression. Let user to learn regular itself.
report.txt | ||
notes.txt | ||
input.csv | ||
abch20241022.csv | ||
abcw20241022.csv | ||
abcx20241022.csv | ||
abcq20241022.csv | ||
abcg20241022.csv | ||
abcv20241022.csv | ||
abcb20241022.csv | ||
old_data.csv | ||
logo.png | ||
script.sh | ||
helpers.sh |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add some file path, not only match file name.
Optimize the describe about Regex
@@ -254,6 +254,54 @@ Specifies Whether to process data using the tag attribute format. | |||
|
|||
Filter pattern, which used for filtering files. | |||
|
|||
The pattern follows standard regular expressions. For details, please refer to https://en.wikipedia.org/wiki/Regular_expression. learn it |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The pattern follows standard regular expressions. For details, please refer to https://en.wikipedia.org/wiki/Regular_expression. learn it | |
The pattern follows standard regular expressions. For details, please refer to https://en.wikipedia.org/wiki/Regular_expression. |
@@ -254,6 +254,54 @@ Specifies Whether to process data using the tag attribute format. | |||
|
|||
Filter pattern, which used for filtering files. | |||
|
|||
The pattern follows standard regular expressions. For details, please refer to https://en.wikipedia.org/wiki/Regular_expression. learn it | |||
|
|||
File Structure Example: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
File Structure Example: | |
There are some examples. | |
File Structure Example: |
Optimize document structure
Optimize document structure
@Hisoka-X Please review this |
Please provide a description of all connectors that support the file_filter_pattern parameter
Added the following file connector description about file_filter_pattern: CosFile(en)、OssFile(en)、OssJindoFile(en)、HdfsFile(en)
Added the following file connector description about file_filter_pattern: FtpFile(en)、SftpFile(en)、S3File(en)、HdfsFile(zh)
file_filter_pattern
example to doc
@zhilinli123 please review |
Added file filtering instructions to the localfile connector documentation