Harpy
Single-cell spatial omics analysis that makes you happy!
Install / Use
/learn @saeyslab/HarpyREADME
💫 If you find Harpy useful, please give us a ⭐! It helps others discover the project and supports continued development.
Why Harpy?
Harpy is a spatial omics analysis library for spatial transcriptomics and proteomics. Within the scverse stack, it bridges SpatialData and downstream analysis tools such as AnnData, Scanpy, and Squidpy. It provides scalable, image- and geometry-aware computation to transform raw spatial data into analysis-ready representations, with a strong emphasis on interoperability and large-scale workflows.
In practice, Harpy offers fast, out-of-core image preprocessing, tiled segmentation, along with efficient aggregation workflows to generate AnnData tables and compute per-cell features from images, segmentation masks, and transcript coordinates. It also supports deep feature extraction, pixel- and cell-level clustering, and the construction of single-cell representations from highly multiplexed images.
- Multi-platform support for spatial transcriptomics and proteomics data.
- Interoperable outputs built on SpatialData.
- Scales to (very) large images: tiled workflows with Dask; optional GPU acceleration with CuPy and PyTorch.
- Scalable computational building blocks for segmentation, feature extraction, clustering, and spatial analysis.
Installation
pip install harpy-analysis
With extras
pip install "harpy-analysis[extra]"
[extra] installs optional dependencies for:
- Segmentation:
cellpose - OpenCV support:
opencv-python-headless - FlowSOM Clustering:
flowsom - Notebook workflows:
ipywidgets,tqdm,bokeh,textalloc,joypy,supervenn,nbconvert,ipython - CLI workflows:
hydra-core
With extras and napari
pip install "harpy-analysis[extra,napari]"
[napari] adds:
napari[all]napari-spatialdata
Only for developers. Clone this repository locally, install the .[dev] instead of the [extra] dependencies and read the contribution guide.
# Clone repository from GitHub
uv venv --python=3.12 # create venv, set python version (>=3.11)
source .venv/bin/activate # activate the virtual environment
uv pip install -e '.[dev]' # editable install with dev tooling
python -c 'import harpy; print(harpy.__version__)' # check if the package is installed
# make changes
python -m pytest # run the tests
It is possible to install Harpy using Anaconda although we recommend uv, see the installation guide.
Quickstart
See the short, runnable guide.
🧭 Tutorials and Guides
Explore how to use Harpy for segmentation, shallow and deep feature extraction, clustering, and spatial analysis of gigapixel-scale multiplexed data with these step-by-step notebooks:
-
🚀 Basic Usage of Harpy
Learn how to read in data, perform tiled segmentation using Cellpose and Dask-CUDA, extract features, perform QC and analyze results downstream with
ScanpyandSquidpy.👉 Tutorial image based transcriptomics, Human Ovarian Cancer, Xenium 10x Genomics
-
🔧 Technology-specific advice
Learn which technologies Harpy supports. 👉 Tutorial
-
🧩 Pixel and Cell Clustering
Learn how to perform unsupervised pixel- and cell-level clustering using
Harpytogether with FlowSOM. 👉 Tutorial -
✂️ Cell Segmentation
Explore segmentation workflows in
Harpyusing different tools:💡 Want us to add support for another segmentation method? 👉 Open an issue and let us know!
-
🧪 Single-cell representations from highly multiplexed images and downstream use with PyTorch
Learn how single-cell representations can be generated from highly multiplexed images. These representations can then be used downstream to train classifiers in PyTorch. 👉 Tutorial
-
🧠 Deep Feature Extraction
Discover how
Harpyenables fast, scalable extraction of deep, cell-level features from multiplex imaging data with the KRONOS foundation model for proteomics. 👉 Tutorial💡 Want us to add support for another deep feature extraction method? 👉 Open an issue and let us know!
-
🔬 Shallow Feature Extraction
Learn to extract shallow features—such as mean, median, and standard deviation of intensities—from multiplex imaging data with
Harpy. 👉 Tutorial -
🧬 Spatial Transcriptomics
Learn how to analyze spatial transcriptomics data with
Harpy.
-
🌐 Multiple samples and coordinate systems
Learn how to work with multiple samples, intrinsic and micron coordinates. 👉 Tutorial
-
📐 Rasterize and vectorize labels and shapes
Learn how to convert a segmentation mask (array) into its vectorized form, and segmentation boundaries (polygons) into their rasterized equivalents. This conversion is useful, for example, when integrating annotations (e.g., from QuPath) into downstream spatial omics analysis.👉 Tutorial
📚 For a complete list of tutorials, visit the Harpy documentation.
Computational benchmark
Explore the benchmark pe
Related Skills
YC-Killer
2.7kA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
flutter-tutor
Flutter Learning Tutor Guide You are a friendly computer science tutor specializing in Flutter development. Your role is to guide the student through learning Flutter step by step, not to provide d
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
last30days-skill
16.9kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
