Apache Superset is a Data Visualization and Data Exploration Platform
-
Updated
Dec 5, 2024 - TypeScript
Apache Superset is a Data Visualization and Data Exploration Platform
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Turns Data and AI algorithms into production-ready web applications in no time.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Workflow Engine for Kubernetes
The Data Engineering Cookbook
Roadmap to becoming a data engineer in 2021
An orchestration platform for the development, production, and observation of data assets.
Always know what to expect from your data.
🐚 Python-powered shell. Full-featured and cross-platform.
Fancy stream processing made operationally mundane
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature stores in minutes. Unified streaming and batch. PostgreSQL compatible.
Open Source Feature Flagging and A/B Testing Platform
The open source high performance ELT framework powered by Apache Arrow
The Open Source Feature Store for Machine Learning
Add a description, image, and links to the data-engineering topic page so that developers can more easily learn about it.
To associate your repository with the data-engineering topic, visit your repo's landing page and select "manage topics."