archives unleashed toolkit

Enhancing Archives Unleashed Toolkit Usability with Spark-Submit

Originally posted here. Over the last month, we have put out several Toolkit releases. The primary focus of the releases has been firming up and improving spark-submit support. What does this mean? The short answer is that it makes the Toolkit easier to use. Think of the “Let’s move tools towards our users” graphic from my “Cloud-hosted web archive data: The winding path to web archive collections as data” post from a few weeks back.