SkillAgentSearch skills...

LinkedinScraper

Python Scrapy project parse people profiles of Linkedin Search and arrange result content in Excel and Json file

Install / Use

/learn @khaleddallah/LinkedinScraper
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

Linkedin Scraper using Scrapy

  • Scrape number of profiles that exist in result of Linkedin searchUrl.
  • Export the content of profiles to Excel and Json files.

Installation

  • Use the package manager pip to install Scrapy.
    (Anaconda Recomended)
cd LinkedinScraperProject     
pip install -r requirements.txt    
  • clone the project
git clone https://github.com/khaleddallah/GoogleImageScrapyDownloader.git

Usage

  • get into the directory of the project:
cd LinkedinScraperProject   
  • to get help :
python LinkedinScraper -h
<pre> <b>usage:</b> python LinkedinScraper [-h] [-n NUM] [-o OUTPUT] [-p] [-f format] [-m excelMode] (searchUrl or profilesUrl) <b>positional arguments:</b> searchUrl URL of Linkedin search URL or Profiles URL <b>optional arguments:</b> -h, --help show this help message and exit -n NUM num of profiles ** the number must be lower or equal of result number 'page' will parse profiles of url page (10 profiles) (Default) -o OUTPUT Output file -p Enable Parse Profiles -f FORMAT json Json output file excel Excel file output all Json and Excel output files -m EXCELMODE 1 to make each profile in Excel file appear in one row m to make each profile in Excel file appear in multi row </pre>

Examples

  • Parse <b>(</b> https://www.linkedin.com/in/khaled-dallah/ and https://www.linkedin.com/in/linustorvalds/ <b>) profiles</b> and export the result content to <b>ABC.xlsx</b> and <b>ABC.json</b> <br>(<b>-p</b>) because of parsing single profiles
python LinkedinScraper -p -o 'ABC' 'https://www.linkedin.com/in/khaled-dallah/' 'https://www.linkedin.com/in/linustorvalds/'
python LinkedinScraper -n 23 'https://www.linkedin.com/search/results/all/?keywords=Robotic&origin=GLOBAL_SEARCH_HEADER'
python LinkedinScraper -n 17 -f excel -m 1 'https://www.linkedin.com/search/results/all/?keywords=Robotic&origin=GLOBAL_SEARCH_HEADER'

Built with

  • Python 3.7
  • Scrapy
  • openpyxl

Author

  • Khaled Dallah - Software Engineer | Python/c++ Developer
    khaled.dallah0@gmail.com

Issues:

Report bugs and feature requests here.

Contribute

Contributions are always welcome!

License

This project is licensed under the LGPL-V3.0 License - see the LICENSE.md file for details

Related Skills

View on GitHub
GitHub Stars6
CategoryContent
Updated1y ago
Forks2

Languages

Python

Security Score

75/100

Audited on Apr 25, 2024

No findings