Cortex
Production infrastructure for machine learning at scale
Install / Use
/learn @cortexlabs/CortexREADME
Note: This project is no longer actively maintained by its original authors.
Production infrastructure for machine learning at scale
Deploy, manage, and scale machine learning models in production.
<br>Serverless workloads
Realtime - respond to requests in real-time and autoscale based on in-flight request volumes.
Async - process requests asynchronously and autoscale based on request queue length.
Batch - run distributed and fault-tolerant batch processing jobs on-demand.
<br>Automated cluster management
Autoscaling - elastically scale clusters with CPU and GPU instances.
Spot instances - run workloads on spot instances with automated on-demand backups.
Environments - create multiple clusters with different configurations.
<br>CI/CD and observability integrations
Provisioning - provision clusters with declarative configuration or a Terraform provider.
Metrics - send metrics to any monitoring tool or use pre-built Grafana dashboards.
Logs - stream logs to any log management tool or use the pre-built CloudWatch integration.
<br>Built for AWS
EKS - Cortex runs on top of EKS to scale workloads reliably and cost-effectively.
VPC - deploy clusters into a VPC on your AWS account to keep your data private.
IAM - integrate with IAM for authentication and authorization workflows.
Related Skills
tmux
353.3kRemote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.
diffs
353.3kUse the diffs tool to produce real, shareable diffs (viewer URL, file artifact, or both) instead of manual edit summaries.
blogwatcher
353.3kMonitor blogs and RSS/Atom feeds for updates using the blogwatcher CLI.
product
Cloud-agnostic Kubernetes infrastructure with Terraform & Helm for homelabs, edge, and production clusters.
