SkillAgentSearch skills...

Stormcrawlertest

Stormcrawler with Elasticsearch

Install / Use

/learn @cnf271/Stormcrawlertest
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

Web scraping and indexing with StormCrawler and Elasticsearch

Related Medium Article

This repository contains elasticsearch configurations for StormCrawler project.

StormCrawler is an open source SDK for building distributed web crawlers based on Apache Storm. Further clarifications on StormCrawler http://stormcrawler.net/

Run the injector using,

storm jar target/stormcrawlertest-1.0-SNAPSHOT.jar  org.apache.storm.flux.Flux --local es-injector.flux

Run the crawler using,

storm jar target/stormcrawlertest-1.0-SNAPSHOT.jar  org.apache.storm.flux.Flux --local es-crawler.flux

Related Skills

View on GitHub
GitHub Stars4
CategoryDevelopment
Updated4y ago
Forks0

Languages

Shell

Security Score

55/100

Audited on Aug 2, 2021

No findings