228 skills found · Page 1 of 8
EMI-Group / EvoxDistributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.
openproblems-bio / OpenproblemsFormalizing and benchmarking open problems in single-cell genomics
power-grid-lib / Pglib OpfBenchmarks for the Optimal Power Flow Problem
android-bench / Android BenchAndroid Bench is a framework for benchmarking Large Language Models (LLMs) on Android development tasks. It evaluates an AI model's ability to understand mobile codebases, generate accurate patches, and solve Android-specific engineering problems.
Aider-AI / Polyglot BenchmarkCoding problems used in aider's polyglot benchmark
robin-shaun / Multi UAV Task Assignment BenchmarkA Benchmark for Multi-UAV Task Allocation of an Extended Team Orienteering Problem
OpenBMB / OlympiadBench[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
scicode-bench / SciCodeA benchmark that challenges language models to code solutions for scientific problems
ai-for-decision-making-tue / Job Shop Scheduling Benchmark Environments And InstancesA benchmarking repo with various solution methods to various machine scheduling problems
FrontierCS / Frontier CSA benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.
thieu1995 / OpfunuA collection of Benchmark functions for numerical optimization problems
automl / HPOBenchCollection of hyperparameter optimization benchmark problems
NVIDIA / SOL ExecBenchA benchmark of real-world DL kernel problems
aks2203 / Poisoning BenchmarkA unified benchmark problem for data poisoning attacks
logicchains / LPATHBenchBenchmarks of the longest path problem in various languages
GeminiLight / Virne[ICLR '26 - Virne] A simulator & benchmark for resource allocation (RA) problems in network function virtualization (NFV), i.e., NFV-RA, including virtual network embedding, service function chain deployment, network slicing, etc.
apexrl / GCRL CollectionThis repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmark environments and conclude a series of research works for goal-conditioned reinforcement learning (GCRL).
EMI-Group / EvoxbenchTransforming Neural Architecture Search (NAS) into multi-objective optimization problems. A benchmark suite for testing evolutionary algorithms in deep learning.
tamy0612 / JSPLIBBenchmark instances for job-shop scheduling problem
google / Ceviche ChallengesA suite of photonic inverse design challenge problems for topology optimization benchmarking