38 skills found · Page 1 of 2
VictoriaMetrics / VictoriaLogsFast and easy to use database for logs, which can efficiently handle terabytes of logs
NVIDIA-Merlin / NVTabularNVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
infochimps-labs / WukongRuby on Hadoop: Efficient, effective Hadoop streaming & bulk data processing. Write micro scripts for terabyte-scale data
glassflow / Clickhouse EtlGlassFlow OSS: Purpose-built for running any terabyte-scale transformations in Kafka-to-ClickHouse pipelines
VictoriaMetrics / VictoriaTracesFast and easy to use database for traces, which can efficiently handle terabytes of trace spans.
alexandres / Terashufterashuf shuffles multi-terabyte text files using limited memory
zettadb / KunlunKunlunBase is a distributed relational database management system(RDBMS) with complete NewSQL capabilities and robust transaction ACID guarantees and is compatible with standard SQL. Applications which used PostgreSQL or MySQL can work with KunlunBase as-is without any code change or rebuild because KunlunBase supports both PostgreSQL and MySQL connection protocols and DML SQL grammars. MySQL DBAs can quickly work on a KunlunBase cluster because we use MySQL as storage nodes of KunlunBase. KunlunBase can elastically scale out as needed, and guarantees transaction ACID under error conditions, and KunlunBase fully passes TPC-C, TPC-H and TPC-DS test suites, so it not only support OLTP workloads but also OLAP workloads. Application developers can use KunlunBase to build IT systems that handles terabytes of data, without any effort on their part to implement data sharding, distributed transaction processing, distributed query processing, crash safety, high availability, strong consistency, horizontal scalability. All these powerful features are provided by KunlunBase. KunlunBase supports powerful and user friendly cluster management, monitor and provision features, can be readily used as DBaaS.
sean-t-smith / Extreme Breach MasksA set of prioritized Hashcat .hcmask files intelligently developed from terabytes of password breach datasets and organized by run time.
cparthiv / Annas TorrentsHelp preserve humanity's knowledge! This is a program to download torrent files given an amount of terabytes you would like to help contribute with.
Kawwabi / TerabyteTweakerTerabyte Tweaker is a program written in batch that allows your PC to run at better speeds, it transforms a "Bad PC" onto a "Medium PC", and turns a "Medium PC" into a monster.
VUKOZ-OEL / 3d ForestVisualization, processing and analysis of Lidar point clouds, mainly focused on forest environment. New version of 3D Forest. Process files with terabytes of data. Edit new point attributes. Simple addition of new features by plugins.
vgel / Flickr FuseTake advantage of Flickr's new terabyte storage limit by turning it into a bad network filesystem with FUSE
zhaoxiaofei / BindashFast and precise comparison of genomes and metagenomes (in the order of terabytes) on a typical personal laptop
ddiazdom / LcgEfficient, parallel compression for terabyte-scale data
unum-cloud / UcsbWide NoSQL benchmark for RocksDB, LevelDB, Redis, WiredTiger and MongoDB extending the Yahoo Cloud Serving Benchmark
arttumiettinen / Pi2C++ library and command-line software for processing and analysis of terabyte-scale volume images locally or on a computing cluster.
Terabyte17 / Terabyte17No description available
Baron-von-Riedesel / HimemSXeXtended Memory Manager (XMM) that can manage memory beyond the 4 GB barrier, up to 1 terabyte.
kijungs / DcubeD-Cube: Dense-Block Detection in Terabyte-Scale Tensors (WSDM'17 & Frontiers in Big Data'21)
multifacet / 0sim WorkspaceTools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.