I use pipenv with pyenv together. This works pretty well, also in cron jobs. Just add pipenv run python script.py
to the cron table.
The data is integrated into the Internet archive and available e.g. via the way back machine. Not sure if you can get the whole reddit dataset.
The archive warriors are downloading Reddit for a while already. 15.6 billion items and counting. You can help too:
Over the internet for file sync. Desktop devices.