data-ingestion

Orbital automates integration between data sources (APIs, Databases, Queues and Functions). BFF's, API Composition and ETL pipelines that adapt as your specs change.

kotlin java microservices typescript integration etl api-management api-gateway rest-api data-engineering data-ingestion bff api-integration data-piplines bff-api taxiql semantic-integration

Updated Sep 9, 2024
TypeScript

cuebook / cuelake

Star

Use SQL to build ELT pipelines on a data lakehouse.

sql apache-spark etl pipelines data-engineering data-lake data-transfer delta data-integration upsert elt data-pipeline datalake data-ingestion spark-sql zeppelin-notebook apache-iceberg lakehouse incremental-updates

Updated May 25, 2022
JavaScript

merantix-momentum / squirrel-core

Star

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way 🌰

Updated Nov 25, 2024
Python

thedataengineeringbook / thedataengineeringbook

Star

The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย

data book data-engineering data-integration hacktoberfest data-pipeline data-engineer data-ingestion data-infrastructure

Updated Oct 28, 2023
JavaScript

apache / paimon-rust

Star

Apache Paimon Rust The rust implementation of Apache Paimon.

rust big-data real-time-analytics data-ingestion table-store paimon streaming-datalake

Updated Oct 1, 2024
Rust

jgperrin / net.jgp.labs.spark

Star

Apache Spark examples exclusively in Java

java spark ingestion udf dataframe data-ingestion

Updated Apr 21, 2023
Java

XavientInformationSystems / Data-Ingestion-Platform

Star

spark storm apex flink batch-processing data-ingestion dip samza

Updated Feb 11, 2020
Java

merantix-momentum / squirrel-datasets-core

Star

Squirrel dataset hub

python data-science machine-learning natural-language-processing ai computer-vision deep-learning tensorflow cv ml collaboration pytorch distributed dataops cloud-computing datasets npl data-ingestion data-mesh

Updated Sep 7, 2023
Python

aws-samples / amazon-kinesis-data-processor-aws-fargate

Star

Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate

containers amazon-kinesis data-ingestion data-processor amazon-kcl kinesis-data-streams scalable-data-stream

Updated Jul 15, 2024
Python

Dynatrace / openkit-java

Star

OpenKit Java Reference Implementation

sdk dynatrace data-ingestion dev-program

Updated Aug 2, 2024
Java

Dynatrace / OneAgent-SDK-for-Java

Star

Enables custom tracing of Java applications in Dynatrace

agent sdk apm dynatrace data-ingestion sdk-java oneagent dev-program

Updated Sep 3, 2024
Java

fremantle-industries / history

Star

Download and warehouse historical trading data

data-science elixir trading data-visualization data-warehouse cryptocurrency trading-algorithms data-ingestion

Updated Mar 17, 2023
Elixir

tuannd89 / elasticsearch-full-text-search

Star

search-engine elasticsearch lucene data-ingestion

Updated Sep 7, 2020

linkedin / data-integration-library

Star

The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.

data-integration data-ingestion gobblin data-ingest data-egress

Updated Jul 25, 2024
Java

Improve this page

Add a description, image, and links to the data-ingestion topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-ingestion topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-ingestion

Here are 156 public repositories matching this topic...

apache / seatunnel

bruin-data / ingestr

apache / paimon

dashbitco / broadway

pravega / pravega

CrunchyData / pg_parquet

orbitalapi / orbital

cuebook / cuelake

merantix-momentum / squirrel-core

thedataengineeringbook / thedataengineeringbook

apache / paimon-rust

jgperrin / net.jgp.labs.spark

XavientInformationSystems / Data-Ingestion-Platform

merantix-momentum / squirrel-datasets-core

aws-samples / amazon-kinesis-data-processor-aws-fargate

Dynatrace / openkit-java

Dynatrace / OneAgent-SDK-for-Java

fremantle-industries / history

tuannd89 / elasticsearch-full-text-search

linkedin / data-integration-library

Improve this page

Add this topic to your repo