SkillAgentSearch skills...

ItalianModelExtractor

Extract a model of the Italian language for verb conjugations, PoS and hyponyms/hypernyms, using en.wiktionary, ConceptNet and WordNet

Install / Use

/learn @jacopofar/ItalianModelExtractor
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

Italian model generator


NOTE

This 8 years old project is archived because there are now better tools to do it. In particular, Wiktextract provides an excellent source of verb conjugations, PoS tags and of course a dictionary.

Regarding the pattern matching part, have a look at SpaCy, it is the way to go for NLP tasks in Python, and supports Italian.


This program uses various sources (WordNet, ConceptNet, and en.wiktionary) in order to generate different dataset regarding the Italian language, specifically:

  • A basic English -> Italian dictionary (this is actually just a precondition for the other steps)
  • A list of Italian hyponym/hypernym
  • A list of Italian PoS tags
  • A list of Italian verb conjugations

The data is produced both as TSV files and a single SQLLite database

Related Skills

View on GitHub
GitHub Stars5
CategoryDevelopment
Updated3y ago
Forks0

Languages

Java

Security Score

55/100

Audited on Jan 27, 2023

No findings