Stormcrawlertest
Stormcrawler with Elasticsearch
Install / Use
/learn @cnf271/StormcrawlertestREADME
Web scraping and indexing with StormCrawler and Elasticsearch

This repository contains elasticsearch configurations for StormCrawler project.
StormCrawler is an open source SDK for building distributed web crawlers based on Apache Storm. Further clarifications on StormCrawler http://stormcrawler.net/
Run the injector using,
storm jar target/stormcrawlertest-1.0-SNAPSHOT.jar org.apache.storm.flux.Flux --local es-injector.flux
Run the crawler using,
storm jar target/stormcrawlertest-1.0-SNAPSHOT.jar org.apache.storm.flux.Flux --local es-crawler.flux
Related Skills
node-connect
341.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
84.5kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
341.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
commit-push-pr
84.5kCommit, push, and open a PR
