PushshiftDumps
Example scripts for the pushshift dump files
Install / Use
/learn @Watchful1/PushshiftDumpsREADME
This repo contains example python scripts for processing the reddit dump files created by pushshift. The files can be torrented from here.
single_file.pydecompresses and iterates over a single zst compressed fileiterate_folder.pydoes the same, but for all files in a foldercombine_folder_multiprocess.pyuses separate processes to iterate over multiple files in parallel, writing lines that match the criteria passed in to text files, then combining them into a final zst compressed file
Related Skills
node-connect
347.2kDiagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps
frontend-design
108.0kCreate distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.
openai-whisper-api
347.2kTranscribe audio via OpenAI Audio Transcriptions API (Whisper).
qqbot-media
347.2kQQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
