English | 简体中文

<a href="https://kornia.readthedocs.io">Docs</a> • <a href="https://colab.sandbox.google.com/github/kornia/tutorials/blob/master/nbs/hello_world_tutorial.ipynb">Try it Now</a> • <a href="https://kornia.github.io/tutorials/">Tutorials</a> • <a href="https://github.com/kornia/kornia-examples">Examples</a> • <a href="https://kornia.github.io//kornia-blog">Blog</a> • <a href="https://discord.gg/HfnywwpBnD">Community</a>

</div>

Kornia is a differentiable computer vision library that provides a rich set of differentiable image processing and geometric vision algorithms. Built on top of PyTorch, Kornia integrates seamlessly into existing AI workflows, allowing you to leverage powerful batch transformations, auto-differentiation and GPU acceleration. Whether you're working on image transformations, augmentations, or AI-driven image processing, Kornia equips you with the tools you need to bring your ideas to life.

📢 Announcement: Kornia is shifting towards end-to-end vision models. We are focusing on integrating state-of-the-art Vision Language Models (VLM) and Vision Language Agents (VLA) to provide comprehensive end-to-end vision solutions.

Key Components

Differentiable Image Processing Kornia provides a comprehensive suite of image processing operators, all differentiable and ready to integrate into deep learning pipelines.
- Filters: Gaussian, Sobel, Median, Box Blur, etc.
- Transformations: Affine, Homography, Perspective, etc.
- Enhancements: Histogram Equalization, CLAHE, Gamma Correction, etc.
- Edge Detection: Canny, Laplacian, Sobel, etc.
- ... check our docs for more.
Advanced Augmentations Perform powerful data augmentation with Kornia’s built-in functions, ideal for training AI models with complex augmentation pipelines.
- Augmentation Pipeline: AugmentationSequential, PatchSequential, VideoSequential, etc.
- Automatic Augmentation: AutoAugment, RandAugment, TrivialAugment.
AI Models Leverage pre-trained AI models optimized for a variety of vision tasks, all within the Kornia ecosystem.
- Face Detection: YuNet
- Feature Matching: LoFTR, LightGlue
- Feature Descriptor: DISK, DeDoDe, SOLD2
- Segmentation: SAM
- Classification: MobileViT, VisionTransformer.

<details> <summary>See here for some of the methods that we support! (>500 ops in total !)</summary>

| Category |----------------------------|--- | Image Processing | Augmentation | Feature Detection | Geometry | Deep Learning Layers | Photometric Functions | Filtering | Color | Stereo Vision | Image Registration | Pose Estimation | Optical Flow | 3D Vision | Image Denoising | Edge Detection | Transformations | Loss Functions | Morphological | Methods/Models | ------------------------------------------------------------------------------------------------------------------| | - Color conversions (RGB, Grayscale, HSV, etc.) - Geometric transformations (Affine, Homography, Resizing, etc.) - Filtering (Gaussian blur, Median blur, etc.) - Edge detection (Sobel, Canny, etc.) - Morphological operations (Erosion, Dilation, etc.) | | - Random cropping, Erasing - Random geometric transformations (Affine, flipping, Fish Eye, Perspecive, Thin plate spline, Elastic) - Random noises (Gaussian, Median, Motion, Box, Rain, Snow, Salt and Pepper) - Random color jittering (Contrast, Brightness, CLAHE, Equalize, Gamma, Hue, Invert, JPEG, Plasma, Posterize, Saturation, Sharpness, Solarize) - Random MixUp, CutMix, Mosaic, Transplantation, etc. | | - Detector (Harris, GFTT, Hessian, DoG, KeyNet, DISK and DeDoDe) - Descriptor (SIFT, HardNet, TFeat, HyNet, SOSNet, and LAFDescriptor) - Matching (nearest neighbor, mutual nearest neighbor, geometrically aware matching, AdaLAM LightGlue, and LoFTR) | | - Camera models and calibration - Stereo vision (epipolar geometry, disparity, etc.) - Homography estimation - Depth estimation from disparity - 3D transformations | | - Custom convolution layers - Recurrent layers for vision tasks - Loss functions (e.g., SSIM, PSNR, etc.) - Vision-specific optimizers | | - Photometric loss functions - Photometric augmentations | | - Bilateral filtering - DexiNed - Dissolving - Guided Blur - Laplacian - Gaussian - Non-local means - Sobel - Unsharp masking | | - Color space conversions - Brightness/contrast adjustment - Gamma correction | | - Disparity estimation - Depth estimation - Rectification | | - Affine and homography-based registration - Image alignment using feature matching | | - Essential and Fundamental matrix estimation - PnP problem solvers - Pose refinement | | - Farneback optical flow - Dense optical flow - Sparse optical flow | | - Depth estimation - Point cloud operations | | - Gaussian noise removal - Poisson noise removal | | - Sobel operator - Canny edge detection | | | - Rotation - Translation - Scaling - Shearing | | - SSIM (Structural Similarity Index Measure) - PSNR (Peak Signal-to-Noise Ratio) - Cauchy - Charbonnier - Depth Smooth - Dice - Hausdorff - Tversky - Welsch | | Operations| - Dilation - Erosion - Opening - Closing |

</details>

Half-Precision Support

| Module | float16 | bfloat16 | Notes | |--------|:-------:|:--------:|-------| | kornia.color | ⚠️ | ⚠️ | Most conversions work for both; FFT-based ops may fail | | kornia.filters | ⚠️ | ⚠️ | Basic filters work; FFT-based ops may fail on CUDA | | kornia.enhance | ⚠️ | ⚠️ | Histogram eq / gamma / ZCA work (linalg ops use cast helpers) | | kornia.morphology | ✅ | ✅ | Pure conv/pool ops; no dtype restrictions | | kornia.augmentation | ⚠️ | ⚠️ | Most ops work; precision-sensitive transforms may be inaccurate | | kornia.geometry.transform | ⚠️ | ⚠️ | Affine/warp/resize work via cast helpers; thin-plate spline may fail | | kornia.geometry.camera | ⚠️ | ⚠️ | Pinhole model and most camera ops work; StereoCamera accepts both | | kornia.geometry.calibration | ❌ | ❌ | Explicitly accepts float32/float64 only (PnP solver) | | kornia.geometry.epipolar | ⚠️ | ⚠️ | SVD/inverse use cast helpers; both dtypes work | | kornia.geometry.homography | ⚠️ | ⚠️ | Uses _torch_svd_cast — both dtypes work via casting | | kornia.geometry.liegroup | ⚠️ | ⚠️ | Most ops work via cast helpers; some linalg paths may fail | | kornia.geometry.solvers | ⚠️ | ⚠️ | Uses _torch_solve_cast — both dtypes work via casting | | kornia.geometry.subpix | ⚠️ | ⚠️ | Soft-argmax works; precision-sensitive ops may be inaccurate | | kornia.losses | ⚠️ | ⚠️ | Photometric losses work; linalg-based losses may not | | kornia.feature | ⚠️ | ⚠️ | Detectors/descriptors work; matching uses manual cdist fallback | | kornia.metrics | ⚠️ | ⚠️ | Pixel-level metrics work; linalg-based metrics may not | | kornia.models | ⚠️ | ⚠️ | Conv-based models work; attention-based models may have dtype mismatches |

✅ Supported ⚠️ Partial ❌ Not supported

Test results (commit 6131e98, 2026-03-21):

| Run | Passed | Failed | Skipped | Pass% | |-----|-------:|-------:|--------:|------:| | CPU float32 (baseline) | 7647 | 3 | 3269 | 99.9% | | CUDA float32 (baseline) | 7634 | 3 | 3280 | 99.9% | | CPU float16 | 6866 | 747 | 3306 | 90.1% | | CPU bfloat16 | 6838 | 812 | 3269 | **

Kornia

Install / Use

README

Key Components

Half-Precision Support