HipDF
hipDF - GPU DataFrame Library
Install / Use
/learn @ROCm-DS/HipDFREADME
<div align="left"> hipDF - GPU DataFrames on AMD GPUs</div>
[!NOTE] This README is derived from the original RAPIDSAI project's README. More care is necessary to remove/modify parts that are only applicable to the original version.
[!NOTE] This ROCm™ port is derived work based on the NVIDIA RAPIDS® cuDF project (version 25.02). It aims to follow the latter's directory structure and API naming as closely as possible to minimize porting friction for users that are interested in using both projects.
Resources
RAPIDS resources:
<!-- - [Try cudf.pandas now](https://nvda.ws/rapids-cudf): Explore `cudf.pandas` on a free GPU enabled instance on Google Colab! - [Install](https://docs.rapids.ai/install): Instructions for installing cuDF and other [RAPIDS](https://rapids.ai) libraries. -->- cudf (Python) documentation
- libcudf (C++/CUDA) documentation
- RAPIDS Community: Get help, contribute, and collaborate.
Overview
Built based on the Apache Arrow columnar memory format, hipDF is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data.
hipDF provides a pandas-like API that will be familiar to data engineers & data scientists, so they can use it to easily accelerate their workflows without going into the details of HIP programming.
For example, the following snippet downloads a CSV, then uses the GPU to parse it into rows and columns and run calculations:
import cudf
tips_df = cudf.read_csv("https://github.com/plotly/datasets/raw/master/tips.csv")
tips_df["tip_percentage"] = tips_df["tip"] / tips_df["total_bill"] * 100
# display average tip by dining party size
print(tips_df.groupby("size").tip_percentage.mean())
Or, you can use hipDF as a no-code-change accelerator for pandas, using
cudf.pandas.
cudf.pandas supports 100% of the pandas API, utilizing hipDF for
supported operations and falling back to pandas when needed:
%load_ext cudf.pandas # pandas operations now use the GPU!
import pandas as pd
tips_df = pd.read_csv("https://github.com/plotly/datasets/raw/master/tips.csv")
tips_df["tip_percentage"] = tips_df["tip"] / tips_df["total_bill"] * 100
# display average tip by dining party size
print(tips_df.groupby("size").tip_percentage.mean())
Quick Start
[!NOTE] Currently, a docker image is not available for AMD GPUs.
Installation
[!CAUTION] Incompatibility notice: Mixing RAPIDS and ROCmDS packages/installations is not supported. To avoid conflicts, strictly separate and isolated environments must be maintained if it is required to install both RAPIDS and ROCm-DS packages on the same system.
[!NOTE] We support only AMD GPUs. Use the RAPIDS package for NVIDIA GPUs.
ROCm/GPU Requirements
- ROCm HIP SDK compilers version 7.0.2
- Build requirements:
rocthrust-dev,rocm-llvm-dev,hipcub(Ubuntu) - Runtime requirements:
rocm-llvm-dev(Ubuntu) - Officially supported architecture (gfx90a, gfx942).
- Ubuntu 22.04+
- Python 3.10 or later
Install from AMD PyPI
See install instructions.
Build/Install from Source
See build instructions.
Open GPU Data Science
The ROCm-DS suite of open source software libraries aims to enable execution of end-to-end data science and analytics pipelines entirely on AMD GPUs. It relies on ROCm HIP primitives for low-level compute optimization, but exposing that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.
Apache Arrow on GPU
The GPU version of Apache Arrow is a common API that enables efficient interchange of tabular data between processes running on the GPU. End-to-end computation on the GPU avoids unnecessary copying and converting of data off the GPU, reducing compute time and cost for high-performance analytics common in artificial intelligence workloads. As the name implies, hipDF uses the Apache Arrow columnar data format on the GPU. Currently, a subset of the features in Apache Arrow are supported.
Related Skills
feishu-drive
339.5k|
things-mac
339.5kManage Things 3 via the `things` CLI on macOS (add/update projects+todos via URL scheme; read/search/list from the local Things database)
clawhub
339.5kUse the ClawHub CLI to search, install, update, and publish agent skills from clawhub.com
yu-ai-agent
2.0k编程导航 2025 年 AI 开发实战新项目,基于 Spring Boot 3 + Java 21 + Spring AI 构建 AI 恋爱大师应用和 ReAct 模式自主规划智能体YuManus,覆盖 AI 大模型接入、Spring AI 核心特性、Prompt 工程和优化、RAG 检索增强、向量数据库、Tool Calling 工具调用、MCP 模型上下文协议、AI Agent 开发(Manas Java 实现)、Cursor AI 工具等核心知识。用一套教程将程序员必知必会的 AI 技术一网打尽,帮你成为 AI 时代企业的香饽饽,给你的简历和求职大幅增加竞争力。
