Skip to content
View wxl24life's full-sized avatar
🚩
Focusing
🚩
Focusing

Block or report wxl24life

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,380 813 Updated Mar 27, 2025

The Java gRPC implementation. HTTP/2 based RPC

Java 11,661 3,888 Updated Mar 26, 2025

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,745 1,948 Updated Mar 28, 2025

Free Docker eBooks

68 30 Updated Apr 24, 2018
C++ 9 13 Updated May 16, 2024

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Java 938 386 Updated Mar 27, 2025

ByConity is an open source cloud data warehouse

C++ 2,172 306 Updated Mar 24, 2025

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 11,041 3,148 Updated Mar 28, 2025

Official electron build of draw.io

JavaScript 53,517 5,203 Updated Mar 27, 2025

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,677 1,218 Updated Mar 28, 2025

光 HikariCP・A solid, high-performance, JDBC connection pool at last.

Java 20,341 2,971 Updated Mar 24, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,911 1,799 Updated Mar 27, 2025

Readings in Databases

7,805 912 Updated Sep 9, 2024

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Scala 1,269 761 Updated Jan 28, 2025

Spark: The Definitive Guide's Code Repository

Scala 2,939 2,826 Updated Aug 26, 2020

Notes talking about the design and implementation of Apache Spark

5,309 1,838 Updated Apr 2, 2024

120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.

Python 30,084 4,521 Updated May 8, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 294,685 49,010 Updated Dec 2, 2024

A complete computer science study plan to become a software engineer.

313,773 78,274 Updated Dec 5, 2024
Kotlin 1 Updated Apr 16, 2019

Kafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, Landoop Tools, 20+ connectors

Shell 1 Updated Jun 18, 2019

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,829 28,474 Updated Mar 28, 2025

Scalable datastore for metrics, events, and real-time analytics

Rust 29,734 3,594 Updated Mar 27, 2025

A curated list of awesome big data frameworks, ressources and other awesomeness.

13,525 2,565 Updated Feb 14, 2025

Benchmark comparing serialization libraries on the JVM

Java 3,293 562 Updated Oct 7, 2023

Reversible conversions between types

Scala 658 123 Updated Nov 22, 2024

Web tool for Avro Schema Registry |

JavaScript 422 113 Updated Feb 13, 2024

Snippets and small examples demonstrating kafka features and configs

Java 648 383 Updated Jul 1, 2022

Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow

Shell 209 45 Updated Mar 12, 2025
Next