TrinityLake

An Open Lakehouse Format for Big Data Analytics, ML & AI

Introduction

TrinityLake is an Open Lakehouse Format for Big Data Analytics, ML & AI. It defines a storage layout on top of objects like Apache Iceberg tables, Substrait views, etc. to form a complete storage-only lakehouse.

It offers the following key features:

Storage only as a lakehouse solution that works exactly the same way locally, on premise and in the cloud
Multi-object multi-statement transactions with standard SQL BEGIN and COMMIT semantics
Consistent time travel and snapshot export across all objects in the lakehouse
Distributed transactions for complicated write-audit-publish workflows to execute a transaction across multiple engines

For more details, please visit trinitylake.io.

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
.baseline		.baseline
.github/workflows		.github/workflows
aws		aws
bundled-guava		bundled-guava
core		core
docker/trinitylake-gravitino-iceberg-rest-server		docker/trinitylake-gravitino-iceberg-rest-server
docs		docs
gradle		gradle
proto		proto
python		python
spark		spark
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.gradle		build.gradle
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
mkdocs.yaml		mkdocs.yaml
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TrinityLake

Introduction

About

Releases

Contributors 8

Languages

License

trinitylake-io/trinitylake

Folders and files

Latest commit

History

Repository files navigation

TrinityLake

Introduction

About

Resources

License

Stars

Watchers

Forks

Releases

Contributors 8

Languages