8 skills found
xlang-ai / Xlang Paper ReadingPaper collection on building and evaluating language model agents via executable language grounding
VisualWebBench / VisualWebBenchEvaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"
MaxiDonkey / DelphiGeminiDelphi wrapper for the Google Gemini API: stateless generation and agent workflows with multimodal, streaming, persistent interactions, structured outputs, vector search, batch, image/video generation, and grounding (Search/Maps).
ru4ls / ComfyUI Nano BananaA set of ComfyUI NanoBanana 2 and NanoBanana Pro custom nodes for Text-to-Image, Image-to-Image, and Image Fusion generation using both Gemini API and VertexAI with multi-image, grounding with web and image search, and multiturn chat features
carlyrichmond / Webdevcon Grounding Rag Applications WorkshopGrounding RAG Applications with JavaScript, Langchain and Elasticsearch @ Webdevcon NL
Princeton-AI2-Lab / Avenir WebOfficial implementation of the paper: "Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts"
KiwiGaze / OmniSearchesOmniSearches: A free, open-source AI search engine. Get AI-powered answers with web sources, powered by Gemini 2.0 Flash, Deepseek, and real-time Google Search grounding. Image search via Wikimedia Commons.
santanaraphael / Grounding WebsiteA open-source software designed to design earthing grids on AC substations. Made by an engineer to engineers.