Percona Live Presents: Gonymizer, A Tool to Anonymize Sensitive PostgreSQL Data Tables for Use in QA and Testing

by Levi Junkert


gonymizer postgres data anonymizer
is a next generation pharmacy benefit platform that is using the latest technology to radically reshape the prescription benefit management industry. To move quickly, we require the ability to iterate and test new versions of our software using production like data without violating Health Information Portability and Accountability Act (HIPAA) regulations.

At Percona Live 2019, we are introducing a project we open sourced to anonymize our sensitive production data for use in rapid QA and testing of our software. The talk will cover:

  • An introduction to HIPAA and Protected Health Information (PHI)
  • Deciding which parts of your data need to be anonymized
  • Column mapping and how to represent relations that need to be anonymized
  • An introduction to the design of the software and how it works
  • Dumping data from a sensitive source
  • Processing the sensitive data to create an anonymized data set
  • Loading of the anonymized data set to a QA environment
  • How SmithRx is using multiple Kubernetes CronJob to reload our Q/A and development environments daily
  • Other examples on how Gonymizer can be used in other scheduling systems such as AWS Lambda
  • What this means for you and how you can contribute

Who’d get the most from the presentation?

This presentation is intended for software engineers that need a quick and easy way to anonymize their data. Intended for middle level database infrastructure (devops), and continuous integration systems. This presentation is also appropriate for Go developers looking to contribute to Β an open source project that is database related. Currently Gonymizer only supports PostgreSQL, but the software has been designed to handle multiple RDBMS in the future so anyone with HIPAA, DISA (Defense Information Systems Agency), or PCI () experience in other RDBMS may find this presentation useful for getting you started on porting Gonymizer to your RDBMS.

Whose presentations are you most looking forward to?

At SmithRx we are currently growing our infrastructure size, automation management, and monitoring systems for our PostgreSQL database tier. There are many presentations we look forward to attending, but the following four talks will be a focus for SmithRx:


Photo by Viktor Talashuk on Unsplash ∎

Levi Junkert

Levi is an infrastructure engineer who works at SmithRx creating HIPAA compliant infrastructure and software for the prescription benefits management industry. He has been building automated systems for over 10 years and working with MySQL and PostgreSQL database management systems for the past 18 years. Previous to SmithRx software that Levi has written can be found at companies such as: Facebook, Cisco, Oracle, JPL-NASA, and other small start-ups in The Bay area.

See all posts by Levi Junkert »


We invite you to our forum for discussion. You are welcome to use the widget below.

✎ Edit this page on GitHub