17 skills found
google-gemini / behavioral-evalsGuidance for creating, running, fixing, and promoting behavioral evaluations
ValueCell-ai / valuecellValueCell is a community-driven, multi-agent platform for financial applications.
google / adk-goAn open-source, code-first Go toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Kiln-AI / KilnBuild, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.
evalstate / fast-agentCode, Build and Evaluate agents - excellent Model and Skills/MCP/ACP Support
Tencent / AI-Infra-GuardA full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.
sahibzada-allahyar / YC-KillerA library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.
refreshdotdev / web-eval-agentAn MCP server that autonomously evaluates web applications.
Devin-AXIS / A2VA2V: Next-Gen AI Value Compute Protocol.
luongnv89 / claude-howtoA visual, example-driven guide to Claude Code — from basic concepts to advanced agents, with copy-paste templates that bring immediate value.
VoAPI / VoAPI🎉 全新下一代高颜值、高性能、高扩展的智能AI大模型API聚合分发系统 | A new next-generation high-value, high-performance, and highly scalable intelligent AI large-model API aggregation and distribution
oxbshw / LLM-Agents-Ecosystem-HandbookOne-stop handbook for building, deploying, and understanding LLM agents with 60+ skeletons, tutorials, ecosystem guides, and evaluation tools.
dedalus-labs / dedalus-mcp-pythonA simple and performant Model Context Protocol framework for Python.
MCP server providing semantic Java code analysis for AI agents. Built on Eclipse JDT with tools for navigation, refactoring, search, and metrics.
dannySubsense / youtube-mcp-serverA comprehensive Model Context Protocol (MCP) server providing real-time YouTube Data API access for AI assistants. Features 14 functions including intelligent content evaluation with technology freshness scoring for knowledge base curation.
agentic-community / openapi-to-mcpTransform OpenAPI specifications into production-ready MCP servers with AI-powered evaluation and enhancement. Leverages LLMs to analyze, improve, and generate Model Context Protocol implementations from your existing API documentation.
Fuenfgeld / pydantic-ai-skillsProduction-ready Claude Code skills for building AI agents with Pydantic AI. Includes dependency injection, tools, validators, streaming, multi-agent orchestration, and evaluation framework patterns.