Duorc
A Large-Scale Dataset for Paraphrased Reading Comprehension
Install / Use
/learn @duorc/DuorcREADME
DuoRC
DuoRC contains 186,089 unique question-answer pairs created from a collection of 7680 pairs of movie plots where each pair in the collection reflects two versions of the same movie.
Evaluation
For evaluation, the answers should be stored in a JSON file with the following format:
{
'id1': 'answer1',
'id2': 'NA',
...
}
Note: NA is the expected string for no answer
Next, use the provided evaluation script to calculate the exact match and F1 scores:
python evaluate.py <test-file-path>.json <answer-file-path>.json
See https://duorc.github.io/ for more details
Related Skills
proje
Interactive vocabulary learning platform with smart flashcards and spaced repetition for effective language acquisition.
groundhog
398Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
last30days-skill
17.5kAI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
sec-edgar-agentkit
10AI agent toolkit for accessing and analyzing SEC EDGAR filing data. Build intelligent agents with LangChain, MCP-use, Gradio, Dify, and smolagents to analyze financial statements, insider trading, and company filings.
