Protectr

A Scala / Java / Python library for anonymization, encryption and redaction operations for large datasets on Apache Spark.

It is currently maintained by a team of developers from ThoughtWorks.

Our aim is to provide a set of algorithms for encription and anonymization for very large data sets.