118 skills found · Page 1 of 4
aws / Aws SDK Pandaspandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
alanchn31 / Data Engineering ProjectsPersonal Data Engineering Projects
awslabs / Aws Lambda Redshift LoaderAmazon Redshift Database Loader implemented in AWS Lambda
tokern / PiicatcherScan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
xnuinside / Simple Ddl ParserLightweight SQL DDL parser for extracting tables, columns, and schema metadata with broad multi-dialect support (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects)
aws / Amazon Redshift Python DriverRedshift Python Connector. It supports Python Database API Specification v2.0.
airscholar / RedditDataEngineeringThis project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.
aws-samples / Serverless Data AnalyticsCloudFormation templates and scripts to setup the AWS services for the workshop, Athena & Redshift Spectrum queries
heroku / AwsdetailedbillingA toolkit for importing AWS detailed billing reports into Redshift
alanchn31 / Movalytics Data WarehouseData pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
iam-mhaseeb / Skytrax Data WarehouseA full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
Wittline / Uber Expenses TrackingThe goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.
tsaol / Web3 Serverless Analytics On Aws🔗 Serverless blockchain analytics pipeline on AWS - Extract, process and visualize Ethereum data using Kinesis, Lambda, Redshift Serverless & QuickSight
HariSekhon / DevOps Perl Tools25+ DevOps CLI Tools - Anonymizer, SQL ReCaser (MySQL, PostgreSQL, AWS Redshift, Snowflake, Apache Drill, Hive, Impala, Cassandra CQL, Microsoft SQL Server, Oracle, Couchbase N1QL, Dockerfiles), Hadoop HDFS & Hive tools, Solr/SolrCloud CLI, Nginx stats & HTTP(S) URL watchers for load-balanced web farms, Linux tools etc.
shravan-kuchkula / Udacity Data Eng Proj 1Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
terraform-aws-modules / Terraform Aws RedshiftTerraform module to create AWS Redshift resources 🇺🇦
aws-solutions-library-samples / Guidance For Clickstream Analytics On AwsGuidance for Clickstream Analytics on AWS source code
feast-dev / Feast Aws Credit Scoring TutorialFeast AWS guide using Redshift / Spectrum / DynamoDB to build a credit scoring model
KentHsu / Udacity Data Engineering NanodgreeUdacity Data Engineering Nanodegree Program
frankfarrell / Terraform Provider RedshiftProvider for AWS Redshift entities, eg Users, Groups, Permissions, Schemas, Databases