Nos
Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!
Install / Use
/learn @nebuly-ai/NosREADME
Nebuly Operating System (nos)

Documentation: <a href="https://nebuly-ai.github.io/nos/overview" target="_blank"> docs.nebuly.com/nos/overview </a>
If you like the project please support it by leaving a star ✨
nos is the open-source module to efficiently run AI workloads on Kubernetes,
increasing GPU utilization, cutting down infrastructure costs and improving workloads performance.
Currently, the available features are:
-
Dynamic GPU partitioning: allow to schedule Pods requesting fractions of GPU. GPU partitioning is performed automatically in real-time based on the Pods pending and running in the cluster, so that Pods can request only the resources that are strictly necessary and GPUs are always fully utilized.
-
Elastic Resource Quota management: increase the number of Pods running on the cluster by allowing namespaces to borrow quotas of reserved resources from other namespaces as long as they are not using them.

Getting started
Prerequisites
- Kubernetes v1.23 or newer
- GPU Support must be enabled
- Nebuly k8s-device-plugin (optional, required only if you want to enable MPS partitioning)
- cert-manager (optional, but recommended)
Installation
You can install nos using Helm 3 (recommended).
You can find all the available configuration values in the Chart documentation.
helm install oci://ghcr.io/nebuly-ai/helm-charts/nos \
--version 0.1.2 \
--namespace nebuly-nos \
--generate-name \
--create-namespace
Alternatively, you can use Kustomize by cloning the repository and running make deploy.
Related Skills
node-connect
339.1kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
83.8kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
339.1kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
83.8kCommit, push, and open a PR
