Results for "llm-crawler"

Claude Code Claude Desktop GitHub Copilot Cursor Windsurf Cline Zed JetBrains

📄SKILL.md 🤖CLAUDE.md ⚡Claude Commands 📐.cursorrules 📐Cursor Rules 🕹️AGENTS.md 🧬codex.md 🏄.windsurfrules 🔧.clinerules 🧑‍✈️Copilot Instructions

All Development Operations Data Product Marketing Customer Design Sales

30 skills found

unclecode / Crawl4ai

62.6k

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

universal

Updated just now

any4ai / AnyCrawl

2.8k

AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.

universal

ai-scrapingaitoolscrawl+7

Updated 8h ago

oxylabs / Oxylabs AI Studio Py

2.6k

Structured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio python SDK for intelligent web data gathering.

universal

ai-crawlerai-scraperai-scraping+9

Updated 7h ago

watercrawl / WaterCrawl

1.8k

Transform Web Content into LLM-Ready Data

universal

aicrawlercrawl4aicrawler+5

Updated 13h ago

paulpierre / Markdown Crawler

435

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG

universal

html-to-markdownhtml-to-markdown-converterhtml2md+9

Updated 18h ago

BrowserCash / Teracrawl

237

High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using remote browsers. Firecrawl alternative

zedclaude code+1

ai-agentsai-crawlerai-scraping+15

Updated 7d ago

eddyhhlure1Eddy / News Analyzer

160

This is an open-source RSS crawler with an LLM interface, and it can use LLM to analyze news feeds

universal

Updated 3mo ago

Aavache / LLMWebCrawler

A Web Crawler based on LLMs implemented with Ray and Huggingface. The embeddings are saved into a vector database for fast clustering and retrieval. Use it for your RAG.

universal

apidistributed-computingfastapi+15

Updated 22d ago

Sriram-PR / Doc Scraper

Go web crawler to scrape documentation sites and convert content to clean Markdown for LLM ingestion (RAG, training data).

universal

data-preparationllmweb-scraper

Updated 1d ago

pc8544 / Website Crawler

Extract data from websites in LLM ready JSON or CSV format. Crawl or Scrape entire website with Website Crawler

universal

crawlerdatajson+4

Updated 1mo ago

oxylabs / Oxylabs AI Studio Js

universal

ai-crawlerai-mapai-scraper+11

Updated 5d ago

rowyio / LLM Web Crawler

Web Scraper and Crawler for LLM Apps and AI Workflows with NoCode / LowCode. Plug and play with your own logic and customize it flexibly and scalably on BuildShip.

universal

aiautomationcrawler+7

Updated 2mo ago

hoangsonww / AI Gov Content Curator

💡An end-to-end solution for aggregating, summarizing, and displaying news articles using an AI-powered backend, an automated CRON crawler & newsletter emailer, and a responsive Next.js frontend. It integrates technologies like Express.js, MongoDB, Puppeteer, and GenAI/LLMs to deliver up-to-date, curated content to government staff and other users.

universal

artificial-intelligenceaxioscheerio+17

Updated 19h ago

eavae / Feilian

llm based crawler

universal

Updated 2mo ago

lennyerik / Crawl4ai Proxy

A simple proxy server to integrate crawl4ai with OpenWebUI

universal

adapteraiai-crawler+11

Updated 3d ago

Kenn3o3 / Easy LLM ArXiv Paper Crawler

A Python tool to crawl historical arXiv papers from specified categories, filter them using a custom LLM prompt via Alibaba Cloud's DashScope API, and export results to a CSV file with paper names and PDF links. Ideal for researchers seeking comprehensive, tailored paper collections.

universal

Updated 5mo ago

GramosoftAI / GcrawlAI

Turn any website into clean, LLM-ready data. Open-source web crawler with stealth mode, distributed crawling, real-time WebSocket progress & Markdown output. Power your AI apps with GcrawlAI.

universal

aicelerydata-pipeline+16

Updated 8d ago

us / Crw

⚡Lightweight Firecrawl alternative in Rust — 91.5% coverage, 5x faster, 3MB RAM. Web scraper & crawler with MCP server for Claude, LLM extraction, JS rendering.

claude codeclaude desktop+1

aiai-agentscrawler+13

Updated 10h ago

malvads / Mojo

Non sucking cross-platform extremely fast C++ crawler to convert entire websites into LLM readable data

universal

crawlerllmrag

Updated 13d ago

xVc323 / Omnidocs

Automated documentation crawler that generates LLM-friendly Markdown from any docs site. Export as single or multi-file, ready for AI ingestion.

universal

crawlerdocumentationllm+1

Updated 1mo ago