33 skills found · Page 1 of 2
apache / ArrowApache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
vortex-data / VortexAn extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
scikit-hep / AwkwardManipulate JSON-like data with NumPy-like idioms.
facebookincubator / NimbleNew and extensible file format for storage of large columnar datasets.
parquet-go / Parquet GoHigh-performance Go package to read and write Parquet files
shunfei / IndexrAn open-source columnar data format designed for fast & realtime analytic with big data.
jcrobak / Parquet Pythonpython implementation of the parquet columnar file format.
fraugster / Parquet GoGo package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena.
hardwood-hq / HardwoodA fast minimal dependency implementation of Apache Parquet
qubole / RubixCache File System optimized for columnar formats and object stores
man-group / SparrowC++20 idiomatic APIs for the Apache Arrow Columnar Format
JuliaIO / Parquet.jlJulia implementation of Parquet columnar file format reader
thulab / TsfileTHIS REPO HAS MOVED TO https://github.com/apache/incubator-iotdb. TsFile is a columnar file format designed for time-series data, which supports efficient compression and query. It is easy to integrate TsFile with your IOT big data processing frameworks.
yahoojapan / YosegiYosegi is a Schema-less columnar storage format. Provide flexible representation like JSON and efficient reading similar to other columnar storage formats.
embryo-labs / EvaluationOfColumnarFormats[VLDB 2023 Vol 17] "An Empirical Evaluation of Columnar Storage Formats"
dask-contrib / Dask AwkwardNative Dask collection for awkward arrays, and the library to use it.
reproio / ColumnifyMake record oriented data to columnar format.
tideworks / Arvo2parquetExample program that writes Parquet formatted data to plain files (i.e., not Hadoop hdfs); Parquet is a columnar storage format.
voltrondata-labs / 2024 Arrow Format TutorialRepository for the Arrow Columnar Format Tutorial for PyCon DE 2024
antiguru / Column RsColumnar type support for Rust