SkillAgentSearch skills...

Learnopencv

Learn OpenCV : C++ and Python Examples

Install / Use

/learn @spmallick/Learnopencv

README

LearnOpenCV

This repository contains code for Computer Vision, Deep learning, and AI research articles shared on our blog LearnOpenCV.com.

Want to become an expert in AI? AI Courses by OpenCV is a great place to start.

<a href="https://opencv.org/courses/"> <p align="center"> <img src="https://learnopencv.com/wp-content/uploads/2023/01/AI-Courses-By-OpenCV-Github.png"> </p> </a>

List of Blog Posts

| Blog Post | Code| | ------------- |:-------------| | YOLO26 Instance Segmentation: Pixel-Perfect AI at Real-Time Speed | Code | | Multi-Object Tracking with Roboflow Trackers and OpenCV | Code | | Real-Time Face Blur and Pixelation with OpenCV YuNet | Code | | Breaking the Bottleneck: Achieving Native NMS-Free Inference with YOLO26 | Code | | YOLOv26: An Object Detector Built for Real-Time Deployment | Code | | Beyond Transformers: A Deep Dive into HOPE | | | Serving SGLang: Launch a Production-Style Server | | |Deployment on Edge: LLM Serving on Jetson using vLLM|Code| |Nested Learning: Is Deep Learning Architecture an Illusion?|| | How to Build a GitHub Code-Analyser Agent for Developer Productivity | Code | | The Existential Problems in LLM Serving | | | SAM 3D: Foundation Model for Single-Image 3D Reconstruction | | | SAM-3: What’s New, How It Works, and Why It Matters | Code | | Image-GS: Adaptive Image Reconstruction using 2D Gaussians | Code | | Ultimate Guide to Vector Databases and RAG Pipeline | Code | |What Makes DeepSeek OCR So Powerful|Code| | 2D Gaussian Splatting: Geometrically Accurate Radiance Field Reconstruction | Code | | TRM: Tiny Recursive Models | Code | |Deploying ML Models on Arduino: From Blink to Think|Code| | VideoRAG: Redefining Long-Context Video Comprehension | | | AI Agent in Action: Automating Desktop Tasks with VLMs | Code | | Top VLM Evaluation Metrics for Optimal Performance Analysis | Code | |Getting Started with VLM on Jetson Nano|Code| | VLM on Edge: Worth the Hype or Just a Novelty? | Code | | AnomalyCLIP : Harnessing CLIP for Weakly-Supervised Video Anomaly Recognition | Code | | AI_for_Video_Understanding_From_Content_Moderation_to_Summarization | Code | | Video-RAG: Training-Free Retrieval for Long-Video LVLMs | Code | | Object Detection and Spatial Understanding with VLMs ft. Qwen2.5-VL | Code | | LangGraph: Building Self-Correcting RAG Agent for Code Generation | Code | | Inside Sinusoidal Position Embeddings: A Sense of Order | Code | | Inside RoPE: Rotary Magic into Position Embeddings | Code | | SimLingo-Vision-Language-Action-Model-for-Autonomous-Driving | Code | | FineTuning Gemma 3n for Medical VQA on ROCOv2 | Code | | SmolLM3 Blueprint: SOTA 3B-Parameter LLM | | | LangGraph-A-Visual-Automation-and-Summarization-Pipeline | Code | | Fine-Tuning AnomalyCLIP: Class-Agnostic Zero-Shot Anomaly Detection | Code | | SigLIP 2: DeepMind’s Multilingual Vision-Language Model | | | MedGemma: Google’s Medico VLM for Clinical QA, Imaging, and More | Code | | Nanonets-OCR-s: Enabling Rich, Structured Markdown for Document Understanding | | | Optimizing VJEPA-2: Tackling Latency & Context in Real-Time Video Classification Scripts | Code | | V-JEPA 2: Meta’s Breakthrough in AI for the Physical World | Code | | NVIDIA Cosmos Reason1: Video Understanding | Code | | GR00T N1.5 Explained | | | LLaVA | Code | | SmolVLA: Affordable & Efficient VLA Robotics on Consumer GPUs | Code | | Fine-Tuning Grounding DINO: Open-Vocabulary Object Detection | Code | | Getting Started with Qwen3 – The Thinking Expert | Code | | [Inside the GPU: A Comprehensive Guide to Modern Graphics Architecture](https://learnopencv.com/modern-gpu-archit

View on GitHub
GitHub Stars22.9k
CategoryEducation
Updated5h ago
Forks11.7k

Languages

Jupyter Notebook

Security Score

85/100

Audited on Apr 2, 2026

No findings