683 skills found · Page 1 of 23
airbytehq / AirbyteThe leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Unstructured-IO / UnstructuredConvert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
pentaho / Pentaho KettlePentaho Data Integration ( ETL ) a.k.a Kettle
Zipstack / UnstractLLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows
ucbepic / DocetlA system for agentic LLM-powered data processing and ETL
blockchain-etl / Ethereum EtlPython scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
datachain-ai / DatachainAnalytics, Versioning and ETL for multimodal data: video, audio, PDFs, images
thbar / KibaData processing & ETL framework for Ruby
compose / TransporterSync data between persistence engines, like ETL only not stodgy
wgzhao / AddaxA fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly
PatMartin / DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
paillave / Etl.NetMass processing data with a complete ETL for .net developers
digitalocean / FireboltGolang framework for streaming ETL, observability data pipeline, and event processing apps
elastic / ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
DataWithBaraa / Sql Data Warehouse ProjectA comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
PhantomInsights / Baby Names AnalysisData ETL & Analysis on the dataset 'Baby Names from Social Security Card Applications - National Data'.
flow-php / EtlPHP - ETL (Extract Transform Load) data processing library
long2ice / SynchSync data from the other DB to ClickHouse(cluster)
orbitalapi / OrbitalOrbital automates integration between data sources (APIs, Databases, Queues and Functions). BFF's, API Composition and ETL pipelines that adapt as your specs change.
renatootescu / ETL PipelineEducational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.