SkillAgentSearch skills...

PushshiftDumps

Example scripts for the pushshift dump files

Install / Use

/learn @Watchful1/PushshiftDumps
About this skill

Quality Score

0/100

Supported Platforms

Universal

README

This repo contains example python scripts for processing the reddit dump files created by pushshift. The files can be torrented from here.

  • single_file.py decompresses and iterates over a single zst compressed file
  • iterate_folder.py does the same, but for all files in a folder
  • combine_folder_multiprocess.py uses separate processes to iterate over multiple files in parallel, writing lines that match the criteria passed in to text files, then combining them into a final zst compressed file

Related Skills

View on GitHub
GitHub Stars481
CategoryDevelopment
Updated9h ago
Forks86

Languages

Python

Security Score

95/100

Audited on Apr 3, 2026

No findings