Sparkonda
Minimalistic utility library to manage conda environments for pyspark jobs on yarn clusters
Install / Use
/learn @moutai/SparkondaREADME
=============================== Sparkonda
Minimalistic utility library to manage conda environments for PySpark jobs on Yarn clusters.
Features
Manage conda environments on PySpark executors to use specific packages on the remote workers without involving admins to install needed software on a Hadoop cluster.
Docs
http://sparkonda.readthedocs.org
