122 skills found · Page 1 of 5
prestodb / PrestoThe official home of the Presto distributed SQL query engine for big data
trinodb / TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
orientechnologies / OrientdbOrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text and Geospatial models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries.
m3db / M3M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform
crate / CrateCrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.
eclipse-zenoh / Zenohzenoh unifies data in motion, data in-use, data at rest and computations. It carefully blends traditional pub/sub with geo-distributed storages, queries and computations, while retaining a level of time and space efficiency that is well beyond any of the mainstream stacks.
apache / DrillApache Drill is a distributed MPP query layer for self describing data
apache / Datafusion BallistaApache DataFusion Ballista Distributed Query Engine
apache / AuronThe Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
polardb / Polardbx SqlPolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.
eventql / EventqlDistributed "massively parallel" SQL query engine
opencypher / MorpheusMorpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
treasure-data / PrestogresPostgreSQL protocol gateway for Presto distributed SQL query engine
apisyouwonthate / Book Build Apis 2Learn how to build and leverage the latest technology in APIs, services, microservices, distributed systems, HATEOAS, query languages, circuit breakers, gateways, discombobulators, flux capacitors, spider monkeys...
datafusion-contrib / Ray SqlDistributed SQL Query Engine in Python using Ray
ModeShape / ModeshapeModeShape is a distributed, hierarchical, transactional, and consistent data store with support for queries, full-text search, events, versioning, references, and flexible and dynamic schemas. It is very fast, highly available, extremely scalable, and it is 100% open source and written in Java. Clients use the JSR-283 standard Java API for content repositories (aka, JCR) or ModeShape's REST API, and can query content through JDBC and SQL.
firebolt-db / Firebolt CoreFirebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performance data warehousing capabilities that can be deployed anywhere from a single laptop to enterprise datacenters.
SJTU-IPADS / WukongA graph-based distributed in-memory store that leverages efficient graph exploration to provide highly concurrent and low-latency queries over big linked data
traceloop / Opentelemetry MCP ServerUnified MCP server for querying OpenTelemetry traces across multiple backends (Jaeger, Tempo, Traceloop, etc.), enabling AI agents to analyze distributed traces for automated debugging and observability.
LiveRamp / Hank(DEPRECATED. This project is no longer used or maintained at LiveRamp.) Hank is a high performance distributed key-value NoSQL database that we built and use at LiveRamp. It is designed for very large data stores that dwarf the amount of available main memory and for randomly distributed read/write workloads that far exceed the capacity of memory-based caches. More specifically, it is optimized for very low latency random read queries and for very high throughput incremental batch writes.