Uscrapper
Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.
Install / Use
/learn @z0m31en7/UscrapperREADME
Uscrapper Vanta:
-
Dark Web Support: Uscrapper Vanta now has the capability to handle .onion or dark web links. This expanded functionality enables users to extract crucial information from previously inaccessible sources, providing a more comprehensive view of the digital landscape.
-
Keyword-Based Scraping: With the introduction of a new model, Uscrapper Vanta now allows users to scrape web pages for specific keywords or a list of keywords. This tailored approach enhances the tool's versatility, enabling users to focus on extracting only the information relevant to their needs.
Uscrapper extracts the following details from the provided website:
- Email Addresses: Displays email addresses found on the website.
- Social Media Links: Displays links to various social media platforms found on the website.
- Author Names: Displays the names of authors associated with the website.
- Geolocations: Displays geolocation information associated with the website.
- Non-Hyperlinked Details: Displays non-hyperlinked details found on the website including email addresses phone numbers and usernames.
- Keyword Based Extraction: Displays relevant data by specifying terms or curating comprehensive keyword lists.
<br><h2>📽 Preview:</h2><br>
<p align="center"><img src="https://github.com/z0m31en7/Uscrapper/blob/main/Images/Preview.png" alt="project-ss"></p><br> <p align="center"><img src="https://github.com/z0m31en7/Uscrapper/blob/main/Images/Preview2.png" alt="project-ss2"></p> <h2>🛠️ Installation Steps:</h2><br>git clone https://github.com/z0m31en7/Uscrapper.git
cd Uscrapper/install/
chmod +x ./install.sh && ./install.sh #For Unix/Linux systems
<br><h2>🔮 Usage:</h2>
<p>To run Uscrapper-vanta, use the following command-line syntax:</p>python Uscrapper-vanta.py [-h] [-u URL] [-O] [-ns] [-c CRAWL] [-t THREADS] [-k KEYWORDS [KEYWORDS ...]] [-f FILE]
<br><b>Arguments:</b>
- -u URL, --url URL (URL of the website)
- -O, --generate-report (Generate a report)
- -ns, --nonstrict (Display non-strict usernames (may show inaccurate results))
- -c CRAWL, --crawl (CRAWL) specify max number of links to Crawl and scrape within the same scope
- -t THREADS, --threads THREADS (Number of threads to utilize while crawling (default=4))
- -k KEYWORDS [KEYWORDS ...], --keywords KEYWORDS [KEYWORDS ...] (Keywords to search for (as space-separated arguments)
- -f FILE, --file FILE (Path to a text file containing keywords)
<br><h2>📜 Note:</h2>
-
Uscrapper relies on web scraping techniques to extract information from websites. Make sure to use it responsibly and in compliance with the website's terms of service and applicable laws.
-
The accuracy and completeness of the extracted details depend on the structure and content of the website being analyzed.
-
To bypass some Anti-Webscrapping methods we have used selenium which can make the overall process slower.
<br><h2>💌 Contribution:</h2><br> <b>Want a new feature to be added?</b><br>
- Make a pull request with all the necessary details and it will be merged after a review.
- You can contribute by making the regular expressions more efficient and accurate, or by suggesting some more features that can be added.
Related Skills
openai-image-gen
329.0kBatch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
claude-opus-4-5-migration
81.1kMigrate prompts and code from Claude Sonnet 4.0, Sonnet 4.5, or Opus 4.1 to Opus 4.5
model-usage
329.0kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
TrendRadar
49.5k⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。
