Protectr
A Scala / Java / Python library for anonymization, encryption and redaction operations for large datasets on Apache Spark.
It is currently maintained by a team of developers from ThoughtWorks.
Our aim is to provide a set of algorithms for encription and anonymization for very large data sets.