Results for "aws-glue"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

265 skills found · Page 1 of 9

aws / Aws SDK Pandas

4.1k

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

universal

amazon-athenaamazon-sagemaker-notebookapache-arrow+17

Updated 1d ago

aws-samples / Aws Glue Samples

1.5k

AWS Glue code samples

universal

Updated 16d ago

awslabs / Aws Glue Libs

697

AWS Glue Libraries are additions and enhancements to Spark for ETL operations.

universal

Updated 4d ago

dgomesbr / Awesome Aws Workshops

412

(Unofficial) curated list of awesome workshops found around in the internet. As we all have been there, finding that workshop that you have just attended shouldn't be hard. The idea is to provide an easy central repository, in a collaborative way.

universal

amazon-eks-workshopamazon-sagemakeramazon-sagemaker-workshop+13

Updated 4mo ago

tokern / Piicatcher

338

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

universal

aws-athenaaws-glueaws-redshift+8

Updated 1mo ago

streamthoughts / Jikkou

280

The Open source Resource as Code framework for Apache Kafka. Jikkou helps you implement GitOps for Kafka at scale!

universal

apache-kafkaautomationaws-glue+13

Updated 3d ago

data-dot-all / Dataall

251

A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.

universal

awsaws-glueaws-lake-formation+7

Updated 1d ago

awslabs / Aws Glue Data Catalog Client For Apache Hive Metastore

227

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions

universal

Updated 3mo ago

airscholar / RedditDataEngineering

210

This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and services including Apache Airflow, Celery, PostgreSQL, Amazon S3, AWS Glue, Amazon Athena, and Amazon Redshift.

universal

apache-airflowawscelery+3

Updated 6d ago

awsdocs / Aws Glue Developer Guide

201

The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.

universal

Updated 3mo ago

aws-samples / Data Lake As Code

173

Data Lake as Code, featuring ChEMBL and OpenTargets

universal

awsaws-cdkaws-cdk-constructs+2

Updated 5mo ago

awslabs / Aws Glue Schema Registry

149

AWS Glue Schema Registry Client library provides serializers / de-serializers for applications to integrate with AWS Glue Schema Registry Service. The library currently supports Avro, JSON and Protobuf data formats. See https://docs.aws.amazon.com/glue/latest/dg/schema-registry.html to get started.

universal

Updated 4d ago

awslabs / Athena Glue Service Logs

140

Glue scripts for converting AWS Service Logs for use in Athena

universal

alb-logsathenaaws-glue+7

Updated 5mo ago

aws-samples / Amazon Deequ Glue

Automated data quality suggestions and analysis with Deequ on AWS Glue

universal

awsaws-gluedata-quality+1

Updated 1mo ago

aws-samples / Cloud Experiments

Open innovation with 60 minute cloud experiments on AWS

universal

amazon-athenaamazon-comprehendamazon-rekognition+7

Updated 1mo ago

aws-samples / Streamlit Application Deployment On Aws

Streamlit EDA Dashboard Powered by AWS Cloud

universal

awsaws-athenaaws-cloudformation+4

Updated 7mo ago

aws-samples / Aws Glue Data Catalog Replication Utility

Replication utility for AWS Glue Data Catalog

universal

Updated 3mo ago

Ditectrev / Amazon Web Services Certified AWS Certified Machine Learning MLS C01 Practice Tests Exams Question

⛳️ PASS: Amazon Web Services Certified (AWS Certified) Machine Learning Specialty (MLS-C01) by learning based on our Questions & Answers (Q&A) Practice Tests Exams.

universal

amazon-athenaamazon-cloudwatchamazon-comprehend+17

Updated 7d ago

awslabs / Aws Glue Blueprint Libs

No description available

universal

Updated 3mo ago

aws-samples / Aws Ml Data Lake Workshop

As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges is getting visibility into their data for feature engineering and data format conversions for using AWS SageMaker. In this workshop, we demonstrate best practices and build data pipelines for training data using Amazon Kinesis Data Firehose, AWS Glue, and Amazon SageMaker, and then we use Amazon SageMaker for inference.

universal

Updated 11mo ago