142 skills found · Page 1 of 5
StonyBrookNLP / Appworld🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource Paper.
TalEliyahu / Awesome CISO Maturity ModelsMaturity models help integrate traditionally separate organizational functions, set process improvement goals and priorities, provide guidance for quality processes, and provide benchmark for appraising current processes outcomes.
Compaile / CtrackA lightweight, high-performance C++ benchmarking and tracking library for effortless function profiling in both development and production environments. Features single-header integration, minimal overhead, multi-threaded support, customizable output, and advanced metrics for quick bottleneck detection in complex codebases.
lobocv / PyperformAn easy and convienent way to performance test python code.
lif314 / X KANeRFX-KANeRF [KANeRF-benchmarking]: KAN based NeRF with various basis functions like B-Splines, Fourier, Gaussians, Wavelets, Polynomials, etc
thieu1995 / OpfunuA collection of Benchmark functions for numerical optimization problems
chengzhengxin / Groupsoftmax SimpledetGroupSoftmax cross entropy loss function for training with multiple different benchmark datasets
zai-org / ComplexFuncBenchComplex Function Calling Benchmark.
GeminiLight / Virne[ICLR '26 - Virne] A simulator & benchmark for resource allocation (RA) problems in network function virtualization (NFV), i.e., NFV-RA, including virtual network embedding, service function chain deployment, network slicing, etc.
philschmid / AI Agent Benchmark CompendiumCompendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction.
jamboree / CxxFunctionBenchmarkbenchmark for various C++ function implementations; focus on invocation time
ComposioHQ / Composio Function Calling BenchmarkFunction Calling Benchmark & Testing
cameron314 / MicrobenchA lightweight (3 file, single function) library for running micro-benchmarks on C++ code
tsingke / CEC Benchmark FunctionsCEC-国际进化计算会议-测试函数 CEC Benchmark Functions
mazhar-ansari-ardeh / BenchmarkFcnsA Python and MATLAB implementation of mathematical test functions for benchmarking optimization algorithms.
multimodal-interpretability / FINDOfficial implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents
sigopt / EvalsetBenchmark suite of test functions suitable for evaluating black-box optimization strategies
fredrikwidlund / Hash Function BenchmarkBenchmark of common hash functions
roberto-trani / Mphf BenchmarkA Benchmark of Minimal Perfect Hash Function Algorithms.
jungtaekkim / Bayeso BenchmarksBenchmark functions for Bayesian optimization