871 skills found · Page 1 of 30
LearningCircuit / Local Deep ResearchLocal Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and your private documents. Everything Local & Encrypted.
denji / Awesome Http BenchmarkHTTP(S) benchmark tools, testing/debugging, & restAPI (RESTful)
minitest / Minitestminitest provides a complete suite of testing facilities supporting TDD, BDD, and benchmarking.
bojand / GhzSimple gRPC benchmarking and load testing tool
phoronix-test-suite / Phoronix Test SuiteThe Phoronix Test Suite open-source, cross-platform automated testing/benchmarking software.
joedicastro / Vps ComparisonA comparison between some VPS providers. It uses Ansible to perform a series of automated benchmark tests over the VPS servers that you specify. It allows the reproducibility of those tests by anyone that wanted to compare these results to their own. All the tests results are available in order to provide independence and transparency.
kubernetes / Perf TestsPerformance tests and benchmarks
HewlettPackard / NetperfNetperf is a benchmark that can be used to measure the performance of many different types of networking. It provides tests for both unidirectional throughput, and end-to-end latency.
n-st / NenchVPS benchmark script — based on the popular bench.sh, plus CPU and ioping tests, and dual-stack IPv4 and v6 speedtests by default
microsoft / WindowsAgentArenaWindows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
maxim-saplin / CrossPlatformDiskTestWindows, macOS and Android storage (HDD, SSD, RAM) speed testing/performance benchmarking app
OWASP-Benchmark / BenchmarkJavaOWASP Benchmark is a test suite designed to verify the speed and accuracy of software vulnerability detection tools. A fully runnable web app written in Java, it supports analysis by Static (SAST), Dynamic (DAST), and Runtime (IAST) tools that support Java. The idea is that since it is fully runnable and all the vulnerabilities are actually exploitable, it’s a fair test for any kind of vulnerability detection tool. For more details on this project, please see the OWASP Benchmark Project home page.
dotnet / PerformanceThis repo contains benchmarks used for testing the performance of all .NET Runtimes
howardjohn / Gateway Api BenchGateway API Benchmarks provides a common set of tests to evaluate a Gateway API implementation.
SanMuzZzZz / LuaN1aoAgentLuaN1aoAgent is a cognitive-driven AI hacker. It is a fully autonomous AI penetration testing agent powered by DeepSeek V3.2. Using dual-graph reasoning, LuaN1ao achieves a success rate of over 90% on the XBOW Benchmark, with a median exploit cost of just $0.09.
ServiceNow / AgentLabAgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.
petabridge / NBenchPerformance benchmarking and testing framework for .NET applications :chart_with_upwards_trend:
mrdbourke / M1 Machine Learning TestCode for testing various M1 Chip benchmarks with TensorFlow.
Voultapher / Sort Research RsTest and benchmark suite for sort implementations.
alipay / Ant Application Security Testing BenchmarkxAST评价体系,让安全工具不再“黑盒”. The xAST evaluation benchmark makes security tools no longer a "black box".