Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 998 165 Updated Mar 11, 2025

dataflint / spark

Performance Observability for Apache Spark

TypeScript 231 24 Updated Mar 10, 2025

ZachGoldberg / Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

10,339 501 Updated May 5, 2024

akuity / awesome-argo

A curated list of awesome projects and resources related to Argo (a CNCF graduated project)

2,127 168 Updated Jan 30, 2025

github / opensource.guide

📚 Community guides for open source creators

HTML 14,383 14,722 Updated Mar 11, 2025

awslabs / mountpoint-s3

A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.

Rust 4,965 189 Updated Mar 11, 2025

kuchin / awesome-cto

A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups

30,217 1,846 Updated Mar 24, 2024

astronomer / astronomer-cosmos

Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code

Python 873 201 Updated Mar 11, 2025

apache / celeborn

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Java 934 385 Updated Mar 11, 2025

open-metadata / OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 6,241 1,170 Updated Mar 11, 2025

datahub-project / datahub

The Metadata Platform for your Data and AI Stack

Java 10,390 3,067 Updated Mar 11, 2025

apache / seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Java 8,338 1,921 Updated Mar 11, 2025

apache / superset

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 64,903 14,643 Updated Mar 11, 2025

apache / uniffle

Uniffle is a high performance, general purpose Remote Shuffle Service.

Java 402 154 Updated Mar 11, 2025

minio / minio

MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.

Go 50,701 5,698 Updated Mar 11, 2025

datafold / data-diff

Compare tables within or across databases

Python 2,963 275 Updated May 17, 2024

prestodb / presto

The official home of the Presto distributed SQL query engine for big data

Java 16,241 5,428 Updated Mar 11, 2025

apache / kyuubi

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,155 938 Updated Mar 10, 2025

airbytehq / airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 17,461 4,363 Updated Mar 11, 2025

argoproj / argo-helm

ArgoProj Helm Charts

Mustache 1,871 1,921 Updated Mar 11, 2025

apache / iceberg

Apache Iceberg

Java 7,011 2,419 Updated Mar 11, 2025

kubeflow / spark-operator

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Go 2,878 1,401 Updated Mar 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Denis Krivenko dnskr

Achievements

Achievements

Block or report dnskr

Stars

dlt-hub / dlt

apache / spark-kubernetes-operator

ShaneCurcuru / fosssustainability

apache / ozone-helm-charts

apache / ozone

unitycatalog / unitycatalog

sdg-1 / consulting-handbook

prestodb / presto-helm-charts

apache / incubator-xtable