SkillAgentSearch skills...

Chemdataextractor

Pipeline for automated extraction of chemical property information from scientific documents

Install / Use

/learn @CambridgeMolecularEngineering/Chemdataextractor
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

ChemDataExtractor

.. image:: http://img.shields.io/pypi/v/ChemDataExtractor.svg?style=flat-square :target: https://pypi.python.org/pypi/ChemDataExtractor

.. image:: http://img.shields.io/pypi/l/ChemDataExtractor.svg?style=flat-square :target: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE

.. image:: http://img.shields.io/travis/mcs07/ChemDataExtractor.svg?style=flat-square :target: https://travis-ci.org/mcs07/ChemDataExtractor

ChemDataExtractor is a toolkit for extracting chemical information from the scientific literature.

Features

  • HTML, XML and PDF document readers
  • Chemistry-aware natural language processing pipeline
  • Chemical named entity recognition
  • Rule-based parsing grammars for property and spectra extraction
  • Table parser for extracting tabulated data
  • Document processing to resolve data interdependencies

Installation

To install ChemDataExtractor, simply run::

pip install chemdataextractor

Or if you are an Anaconda user, run::

conda install -c chemdataextractor chemdataextractor

Alternatively, try one of the other installation options_.

Documentation

Full documentation is available at http://chemdataextractor.org/docs

License

ChemDataExtractor is licensed under the MIT license_, a permissive, business-friendly license for open source software.

.. _installation options: http://chemdataextractor.org/docs/install .. _MIT license: https://github.com/mcs07/ChemDataExtractor/blob/master/LICENSE

View on GitHub
GitHub Stars20
CategoryDevelopment
Updated1mo ago
Forks13

Languages

Python

Security Score

90/100

Audited on Feb 24, 2026

No findings