SkillAgentSearch skills...

Wikifier

Pytorch implementation of a BiLSTM model for the Wikification project.

Install / Use

/learn @LeonardoEmili/Wikifier
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

Wikifier

Wikification is the process of labeling input sentences into concepts from Wikipedia. The repository contains a major script for scraping text from Wikipedia dumps and parsing it into a dataset, the model for annotating sentences and an asynchronous web scraper for generating the dataset dynamically starting from a Wikipedia page used as seed.

Prerequisites

You can install the required dependencies using the Python package manager (pip):

pip3 install aiohttp
pip3 install cchardet
pip3 install aiodns
pip3 install wikipedia
pip3 install requests

Getting Started

First, we need to get the data. Wikiparser is a web scraper that loads dumps from XML files and stores the dataset as a collection of compressed files. You can run the script using the following syntax:

python3 WikiParser.py [OPTION]... URL... [-n NUM]
python3 WikiParser.py [OPTION]... [-n NUM]
python3 WikiParser.py [OPTION]... URL...

Built With

Authors

Related Skills

View on GitHub
GitHub Stars19
CategoryEducation
Updated1y ago
Forks0

Languages

Jupyter Notebook

Security Score

80/100

Audited on Mar 16, 2025

No findings