SkillAgentSearch skills...

Tablassert

Extract knowledge assertions from tabular data into NCATS Translator-compliant KGX NDJSON — declaratively, with entity resolution and quality control built in.

Install / Use

/learn @SkyeAv/Tablassert

README

Tablassert

PyPI Python License Docs

Extract knowledge assertions from tabular data into NCATS Translator-compliant KGX NDJSON — declaratively, with entity resolution and quality control built in.

pip install tablassert
tablassert build-knowledge-graph config.yaml

Full Documentation — installation guides, tutorials, configuration reference, and API docs.

Installation

pip install tablassert

All dependencies (ML, web, Excel support) are included in the base install. An optional extra is available for CPU compatibility:

pip install "tablassert[rtcompat]"  # Polars build for CPUs without required instructions
<details> <summary><strong>Docker</strong></summary>
docker pull ghcr.io/skyeav/tablassert:latest

docker run --rm \
  -v /path/to/config:/data \
  -v /path/to/datassert:/datassert \
  ghcr.io/skyeav/tablassert:latest \
  build-knowledge-graph /data/graph-config.yaml
</details>

Quick Demo

# Build a knowledge graph from a YAML configuration
$ tablassert build-knowledge-graph graph-config.yaml
⠋ Loading table configurations...
⠋ Resolving entities across 16 DuckDB shards...
⠋ Compiling subgraphs...
⠋ Deduplicating nodes and edges...
✓ Done — wrote nodes.ndjson and edges.ndjson to .storassert/

Define your entities and relationships in YAML, point tablassert at your data, and get NCATS Translator-compliant KGX NDJSON out the other side — no code required.

Key Features

  • Declarative Configuration — YAML-based, no code required
  • Entity Resolution — Maps text to biological entities (genes, diseases, chemicals)
  • Quality Control — Three-stage validation (exact → fuzzy → BERT embeddings)
  • KGX Compliance — NCATS Translator-compatible NDJSON output
  • Performance — Lazy evaluation pipelines with Polars and DuckDB-accelerated entity resolution

Contributing

See CONTRIBUTING.md for development setup, code style, and pull request guidelines.

License

Apache License 2.0

Contributors

Skye Lane Goetz — Institute for Systems Biology, CalPoly SLO

Gwênlyn Glusman — Institute for Systems Biology

Jared C. Roach — Institute for Systems Biology

View on GitHub
GitHub Stars4
CategoryDevelopment
Updated5d ago
Forks0

Languages

Python

Security Score

90/100

Audited on Apr 4, 2026

No findings