All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- "String length exceeds the maximum length" error when querying large GIS data in Livy (#10)
- Updated Rust toolchain and minor dependencies
- Upgraded Spark version to
3.5.0
- Enabled ANSI mode by default
- Building a multi-platform image for
arm64
architectures
- Coalesce output of
RawQuery
to avoid a crash on multiple partitions
- Upgraded to unified ODF schema
- Escape special symbols in input view names
- Removed deprecated Spark-only ingest
- Upgraded to new ODF engine protocol
- Upgraded to new ODF schemas
- Transform engine will now read the list of input files as specified in the request instead of reading the entire directory of Parquet files
- Configuring Spark to write timestamps in
Timestamp(MILLIS,true)
logical Parquet format instead ofint96
that is considered deprecated and causing compatibility issues
- Removed ingest checkpoints
- Unique naming of temporary output directories
- Support for reading Apache Parquet files directly
- Updated to latest ODF schemas
- Adapter will now tar/untar checkpoints to have them managed as files
- Ignore
offset
column in merge strategies
- Handle Shapefiles with subdirectories
- Helpful error messages for invalid event time column upon ingest
- Upgraded to Spark 3.1.2 and latest Sedona
- Fixed data hashing when dataframe is empty
- Upgraded to Spark 3.0.1 and latest Sedona (GeoSpark)
- Upgraded to ODF manifests
- Upgraded to ODF manifests
- Windows support improvements
- Minimizing use of Hadoop FS
- Support watermarking in ingest and transform
- Further engine interface improvements
- Event time column is now required
- System time column is not considered when computing the hash
- Result files will be named using system time
- Ensuring data is sorted in Parquet files based on event time
- Naming output files using system time
- Major refactoring and combined with the ingress
- Separated engine build from
kamu-cli