Results for "data-parallel"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

587 skills found · Page 1 of 20

halide / Halide

6.6k

a language for fast, portable data-parallel computation

universal

compilerdslgpu+4

Updated 7h ago

diku-dk / Futhark

2.7k

:boom::computer::boom: A data-parallel functional programming language

universal

boomcompilercuda+7

Updated 10h ago

numaproj / Numaflow

2.4k

Kubernetes-native platform to run massively parallel data/streaming jobs

universal

data-processinghacktoberfestk8s+4

Updated 1d ago

VcDevel / Vc

1.5k

SIMD Vector Classes for C++

universal

avxavx2avx512+16

Updated 6d ago

tilo / Smarter Csv

1.5k

Fastest end-to-end CSV ingestion for Ruby (with C acceleration). SmarterCSV auto-detects formats, applies smart defaults, and returns Rails-ready hashes for seamless use with ActiveRecord, Sidekiq, parallel jobs, and S3 pipelines — even for messy user-uploaded real-world data.

universal

csvcsv-convertercsv-export+14

Updated 1d ago

functime-org / Functime

1.2k

Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.

universal

feature-engineeringforecastingmachine-learning+4

Updated 3d ago

yandex / YaFSDP

986

YaFSDP: Yet another Fully Sharded Data Parallel

universal

Updated 6h ago

Tiramisu-Compiler / Tiramisu

958

A polyhedral compiler for expressing fast and portable data parallel algorithms

universal

code-generationcompilerdeep-neural-networks+6

Updated 7d ago

GoogleCloudPlatform / DataflowJavaSDK

851

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

universal

big-datadata-analysisdata-mining+3

Updated 1mo ago

tuplex / Tuplex

814

Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tuplex has similar Python APIs to Apache Spark or Dask, but rather than invoking the Python interpreter, Tuplex generates optimized LLVM bytecode for the given pipeline and input data set.

zed

Updated 2d ago

hpcc-systems / HPCC Platform

612

HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for big data processing and analytics.

universal

Updated 1d ago

DiskFrame / Disk.frame

597

Fast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data

zed

datadata-sciencelarge-dataset+3

Updated 1mo ago

binpash / Pash

592

PaSh: Light-touch Data-Parallel Shell Processing

universal

bashbash-scriptingdata-analysis+4

Updated 1d ago

spcl / Dace

581

DaCe - Data Centric Parallel Programming

universal

cudafpgahigh-level-synthesis+3

Updated 1d ago

leimao / Voice Converter CycleGAN

531

Voice Converter Using CycleGAN and Non-Parallel Data

universal

cycleganspeechvoice-conversion

Updated 1mo ago

MicrosoftResearch / Naiad

527

The Naiad system provides fast incremental and iterative computation for data-parallel workloads

universal

Updated 9d ago

ufora / Ufora

489

Compiled, automatically parallel Python for data science

universal

Updated 3mo ago

cudpp / Cudpp

437

CUDA Data Parallel Primitives Library

universal

Updated 1d ago

timescale / Timescaledb Parallel Copy

432

A binary for parallel copying of CSV data into a TimescaleDB hypertable

universal

Updated 4d ago

lithops-cloud / Lithops

361

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

universal

big-databig-data-analyticscloud-computing+11

Updated 11d ago