Scienceie17
NTNU at SemEval 2017 Task 10: Extracting Keyphrases and Relations from Scientific Publications
Install / Use
/learn @OC-ScienceIE/Scienceie17README
NTNU at SemEval 2017 Task 10: Extracting Keyphrases and Relations from Scientific Publications
https://scienceie.github.io/
Setup
-
Install Anaconda Python distibution from https://www.continuum.io/downloads
-
Create virtual Python environment
$ conda create -n scienceie17 python=3 scikit-learn
-
Activate environment
$ source activate scienceie17
-
Clone git repos
$ git clone https://github.com/OC-ScienceIE/scienceie17.git
-
Extend Python search path
$ export PYTHONPATH=<.../scienceie17>/bin:<.../scienceie17>/lib/python:$PYTHON
where <.../scienceie17> is the full path to the local repository.
-
Install additional dependencies
$ pip install sklearn-crfsuite $ pip install -U spacy $ python -m spacy.en.download all
-
Preprocess data (run Spacy NLP, derive IOB tags)
$ bin/preproc.py
For subsequent use, step 3 & 5 can also be carried out by running
$ source setup_env.sh
