45 skills found · Page 1 of 2
horovod / HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
tencentmusic / Cube Studiocube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国产cpu/gpu/npu 昇腾生态,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式
tony-framework / TonYTonY is a framework to natively run deep learning frameworks on Apache Hadoop.
kubeflow / Mpi OperatorKubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
tensorlayer / Awesome TensorlayerA curated list of dedicated resources and applications
jzlianglu / Pykaldi2Yet another speech toolkit based on Kaldi and PyTorch
guotong1988 / BERT Pre Trainingmulti-gpu pre-training in one machine for BERT without horovod (Data Parallelism)
Photon-AI-Research / NeuralSolversNeural network based solvers for partial differential equations and inverse problems :milky_way:. Implementation of physics-informed neural networks in pytorch.
open-ce / Open CeThis repository provides the Open-CE environment files and version definitions for each Open-CE release.
horovod / TutorialsTutorials for Horovod
saforem2 / L2hmc QcdApplication of the L2HMC algorithm to simulations in lattice QCD.
polyaxon / Polyaxon ExamplesCode for tutorials and examples
davidrpugh / Horovod Gpu Data Science ProjectTemplate repository for a Python 3-based data science project that uses Horovod.
NUS-HPC-AI-Lab / LARS ImageNet PyTorchAccuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional accumulated gradient and NVIDIA DALI dataloader.
ShomyLiu / Torch Ddp ExamplesA text classification example using ddp horovod and accelerate
heyfey / VodaschedulerGPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)
aws-samples / Sagemaker Horovod Distributed TrainingDistributed training with SageMaker's script mode using Horovod distributed deep learning framework
Qznan / QizNLPQuick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)
ankurhanda / Tf Unettensorflow version of unet
msalvaris / BatchAIHorovodBenchmarkBenchmarking Horovod and TF on Batch AI