From e4d9ed7f09ed2de1ab21522bf4ff89e7a795d2ac Mon Sep 17 00:00:00 2001 From: Iain-S <25081046+Iain-S@users.noreply.github.com> Date: Fri, 11 Aug 2023 16:50:57 +0100 Subject: [PATCH 1/6] Add SNS metadata spec and update some of ours. https://docs.smartnoise.org/sql/metadata.html#yaml-format --- docs/source/_static/config_schema.html | 2 +- sqlsynthgen/json_schemas/config_schema.json | 41 ++++++++++++++++++--- tests/examples/example_config.yaml | 13 ++++++- 3 files changed, 47 insertions(+), 9 deletions(-) diff --git a/docs/source/_static/config_schema.html b/docs/source/_static/config_schema.html index d81eb9c..da965c5 100644 --- a/docs/source/_static/config_schema.html +++ b/docs/source/_static/config_schema.html @@ -1 +1 @@ -
A SQLSynthGen configuration YAML file
No Additional PropertiesAll properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:^(?!censor_dims).*$
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:.*
A SQLSynthGen configuration YAML file
No Additional PropertiesAll properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:^(?!(max_ids|row_privacy|sample_max_ids|censor_dims|clamp_counts|clamp_columns|use_dpsu)).*$
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:.*
A SQLSynthGen configuration YAML file
No Additional PropertiesAll properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:^(?!(max_ids|row_privacy|sample_max_ids|censor_dims|clamp_counts|clamp_columns|use_dpsu)).*$
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:.*
A SQLSynthGen configuration YAML file
No Additional PropertiesRun source-statistics queries using asyncpg.
The name of a local Python module of row generators (excluding .py).
The name of a local Python module of story generators (excluding .py).
An array of source statistics queries.
A name for the query.
A SQL query.
A SmartNoise SQL query.
The differential privacy epsilon value for the DP query.
The differential privacy delta value for the DP query.
See https://docs.smartnoise.org/sql/metadata.html#yaml-format.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:^(?!(max_ids|row_privacy|sample_max_ids|censor_dims|clamp_counts|clamp_columns|use_dpsu)).*$
An array of story generators.
todo.
todo.
Table configurations.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:.*
A table configuration.
No Additional PropertiesWhether to export the table data.
The number of rows to generate per pass.
An array of row generators to create column values.
The name of a (built-in or custom) function.
The arguments to pass to the function.
The keyword arguments to pass to the function.
One or more columns to assign the return value to.
A SQLSynthGen configuration YAML file
No Additional PropertiesRun source-statistics queries using asyncpg.
The name of a local Python module of row generators (excluding .py).
The name of a local Python module of story generators (excluding .py).
An array of source statistics queries.
A name for the query.
A SQL query.
A SmartNoise SQL query.
The differential privacy epsilon value for the DP query.
The differential privacy delta value for the DP query.
See https://docs.smartnoise.org/sql/metadata.html#yaml-format.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:^(?!(max_ids|row_privacy|sample_max_ids|censor_dims|clamp_counts|clamp_columns|use_dpsu)).*$
An array of story generators.
todo.
todo.
Table configurations.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:.*
A table configuration.
No Additional PropertiesWhether to export the table data.
The number of rows to generate per pass.
An array of row generators to create column values.
The name of a (built-in or custom) function.
The arguments to pass to the function.
The keyword arguments to pass to the function.
One or more columns to assign the return value to.
A SQLSynthGen configuration YAML file
No Additional PropertiesRun source-statistics queries using asyncpg.
The name of a local Python module of row generators (excluding .py).
The name of a local Python module of story generators (excluding .py).
An array of source statistics queries.
A name for the query, which will be used in the stats file.
A SQL query.
A SmartNoise SQL query.
The differential privacy epsilon value for the DP query.
The differential privacy delta value for the DP query.
See https://docs.smartnoise.org/sql/metadata.html#yaml-format.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:^(?!(max_ids|row_privacy|sample_max_ids|censor_dims|clamp_counts|clamp_columns|use_dpsu)).*$
An array of story generators.
The full name of a story generator (e.g. mystorygenerators.short_story).
Positional arguments to pass to the story generator.
Keyword arguments to pass to the story generator.
todo
todo.
Table configurations.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:.*
A table configuration.
No Additional PropertiesWhether to export the table data.
The number of rows to generate per pass.
An array of row generators to create column values.
The name of a (built-in or custom) function (e.g. max or myrowgenerators.my_gen).
Positional arguments to pass to the function.
Keyword arguments to pass to the function.
One or more columns to assign the return value to.
A SQLSynthGen configuration YAML file
No Additional PropertiesRun source-statistics queries using asyncpg.
The name of a local Python module of row generators (excluding .py).
The name of a local Python module of story generators (excluding .py).
An array of source statistics queries.
A name for the query, which will be used in the stats file.
A SQL query.
A SmartNoise SQL query.
The differential privacy epsilon value for the DP query.
The differential privacy delta value for the DP query.
See https://docs.smartnoise.org/sql/metadata.html#yaml-format.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:^(?!(max_ids|row_privacy|sample_max_ids|censor_dims|clamp_counts|clamp_columns|use_dpsu)).*$
An array of story generators.
The full name of a story generator (e.g. mystorygenerators.short_story).
Positional arguments to pass to the story generator.
Keyword arguments to pass to the story generator.
todo
todo.
Table configurations.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:.*
A table configuration.
No Additional PropertiesWhether to export the table data.
The number of rows to generate per pass.
An array of row generators to create column values.
The name of a (built-in or custom) function (e.g. max or myrowgenerators.my_gen).
Positional arguments to pass to the function.
Keyword arguments to pass to the function.
One or more columns to assign the return value to.
A SQLSynthGen configuration YAML file
No Additional PropertiesRun source-statistics queries using asyncpg.
The name of a local Python module of row generators (excluding .py).
The name of a local Python module of story generators (excluding .py).
An array of source statistics queries.
A name for the query, which will be used in the stats file.
A SQL query.
A SmartNoise SQL query.
The differential privacy epsilon value for the DP query.
The differential privacy delta value for the DP query.
See https://docs.smartnoise.org/sql/metadata.html#yaml-format.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:^(?!(max_ids|row_privacy|sample_max_ids|censor_dims|clamp_counts|clamp_columns|use_dpsu)).*$
An array of story generators.
The full name of a story generator (e.g. mystorygenerators.short_story).
Positional arguments to pass to the story generator.
Keyword arguments to pass to the story generator.
The number of times to call the story generator per pass.
The maximum number of tries to respect a uniqueness constraint.
Table configurations.
All properties whose name matches the following regular expression must respect the following conditions
Property name regular expression:.*
A table configuration.
No Additional PropertiesWhether to export the table data.
The number of rows to generate per pass.
An array of row generators to create column values.
The name of a (built-in or custom) function (e.g. max or myrowgenerators.my_gen).
Positional arguments to pass to the function.
Keyword arguments to pass to the function.
One or more columns to assign the return value to.