Dozer
Dozer is a real-time data movement tool that leverages CDC from various sources and moves data into various sinks.
Install / Use
/learn @getdozer/DozerREADME
Overview
Dozer is a real time data movement tool leveraging CDC from various sources to multiple sinks.
Dozer is magnitudes of times faster than Debezium+Kafka and natively supports stateless transformations. Primarily used for moving data into warehouses. In our own application, we move data to Clickhouse and build data APIs and integration with LLMs.
How to use it
Dozer runs with a single configuration file like the following:
app_name: dozer-bench
version: 1
connections:
- name: pg_1
config: !Postgres
user: user
password: postgres
host: localhost
port: 5432
database: customers
sinks:
- name: customers
config: !Dummy
table_name: customers
Full documentation can be found here
Supported Sources
| Connector | Extraction | Resuming | Enterprise | | -------------------- | ---------- | -------- | ------------------- | | Postgres | ✅ | ✅ | ✅ | | MySQL | ✅ | ✅ | ✅ | | Snowflake | ✅ | ✅ | ✅ | | Kafka | ✅ | 🚧 | ✅ | | MongoDB | ✅ | 🎯 | ✅ | | Amazon S3 | ✅ | 🎯 | ✅ | | Google Cloud Storage | ✅ | 🎯 | ✅ | | **Oracle | ✅ | ✅ | Enterprise Only | | **Aerospike | ✅ | ✅ | Enterprise Only |
Supported Sinks
| Database | Connectivity | Enterprise | | ---------- | ------------ | ------------------- | | Clickhouse | ✅ | | | Postgres | ✅ | | | MySQL | ✅ | | | Big Query | ✅ | | | Oracle | ✅ | Enterprise Only | | Aerospike | ✅ | Enterprise Only |
Related Skills
gh-issues
337.1kFetch GitHub issues, spawn sub-agents to implement fixes and open PRs, then monitor and address PR review comments. Usage: /gh-issues [owner/repo] [--label bug] [--limit 5] [--milestone v1.0] [--assignee @me] [--fork user/repo] [--watch] [--interval 5] [--reviews-only] [--cron] [--dry-run] [--model glm-5] [--notify-channel -1002381931352]
himalaya
337.1kCLI to manage emails via IMAP/SMTP. Use `himalaya` to list, read, write, reply, forward, search, and organize emails from the terminal. Supports multiple accounts and message composition with MML (MIME Meta Language).
oracle
337.1kBest practices for using the oracle CLI (prompt + file bundling, engines, sessions, and file attachment patterns).
tmux
337.1kRemote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.
