HiddenSchemaNetworks

Source code for Hidden Schema Network model --- a deep discrete-relational representation learning algorithm for natural language

Generate Convert Improve

Install / Use

/learn @ramsesjsf/HiddenSchemaNetworks

About this skill

Quality Score

0/100

README

==================== HiddenSchemaNetworks

Installation

To install the code and all dependencies first install Python 3.8. Then run in the project directory::

pip install -e .

Experiments

Synthetic Data

Training

To reproduce any of the synthetic data experiments, run::

python scripts/train_synthetic_schema.py -c experiments/synthetic/<config_file>

where <config_file> is one of the .yaml files in experiments/synthetic.

In the config file, you can change a number of parameters and most importantly the path where results and trained models are saved.

Evaluation

To compute metrics for a whole directory of models, run::

python scripts/evaluation_toolbox_synth.py -p results -n synth/<experiment_name>

where <experiment_name> is the name of the .yaml file, e.g. erdos_schema

Real Data

Training

To train GPT2 or Schema models on the PTB, Yahoo, or Yelp data sets, run::

python scripts/train_model.py -c experiments/<path>

with <path> path to a config file, e.g. ptb/gpt2.yaml

For different numbers of symbols, you can modify the option n_symbols in the .yaml file, and for different random walk lengths, change rw_length in the encoder section. You can also change the directory of the saved models here.

Evaluation

To compute metrics for a trained model saved in results/<path>, run::

python scripts/evaluation_toolbox.py -p results -n <path>

Functions for generating text, interpolations, and other graph statistics are all available in evaluation_toolbox.py

Note

This project has been set up using PyScaffold 3.2.1. For details and usage information on PyScaffold see https://pyscaffold.org/.

Related Skills

YC-Killer

2.7k

A library of enterprise-grade AI agents designed to democratize artificial intelligence and provide free, open-source alternatives to overvalued Y Combinator startups. If you are excited about democratizing AI access & AI agents, please star ⭐️ this repository and use the link in the readme to join our open source AI research team.

groundhog

398

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).

sec-edgar-agentkit

AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.

Kiln

4.7k

Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.