tensorflow_privacy/README.md

# TensorFlow Privacy

This repository contains the source code for TensorFlow Privacy, a Python
library that includes implementations of TensorFlow optimizers for training
machine learning models with differential privacy. The library comes with
tutorials and analysis tools for computing the privacy guarantees provided.

The TensorFlow Privacy library is under continual development, always welcoming
contributions. In particular, we always welcome help towards resolving the
issues currently open.

## Latest Updates

2020-12-21: A new
[vectorized version of the TF 2 optimizer](https://github.com/tensorflow/privacy/blob/master/tensorflow_privacy/privacy/optimizers/dp_optimizer_keras_vectorized.py)
is available, which can deliver much faster performance. We recommend trying it
first, and to fall back to using the original non-vectorized version only if
this fails. We are thankful to the
[authors of this paper](https://arxiv.org/abs/2010.09063) for spurring this
change.

## Setting up TensorFlow Privacy

### Dependencies

This library uses [TensorFlow](https://www.tensorflow.org/) to define machine
learning models. Therefore, installing TensorFlow (>= 1.14) is a pre-requisite.
You can find instructions [here](https://www.tensorflow.org/install/). For
better performance, it is also recommended to install TensorFlow with GPU
support (detailed instructions on how to do this are available in the TensorFlow
installation documentation).

In addition to TensorFlow and its dependencies, other prerequisites are:

  * `scipy` >= 0.17

  * `mpmath` (for testing)

  * `tensorflow_datasets` (for the RNN tutorial `lm_dpsgd_tutorial.py` only)

### Installing TensorFlow Privacy

If you only want to use TensorFlow Privacy as a library, you can simply execute

`pip install tensorflow-privacy`

Otherwise, you can clone this GitHub repository into a directory of your choice:

```
git clone https://github.com/tensorflow/privacy
```

You can then install the local package in "editable" mode in order to add it to
your `PYTHONPATH`:

```
cd privacy
pip install -e .
```

If you'd like to make contributions, we recommend first forking the repository
and then cloning your fork rather than cloning this repository directly.

## Contributing

Contributions are welcomed! Bug fixes and new features can be initiated through
GitHub pull requests. To speed the code review process, we ask that:

*   When making code contributions to TensorFlow Privacy, you follow the `PEP8
    with two spaces` coding style (the same as the one used by TensorFlow) in
    your pull requests. In most cases this can be done by running `autopep8 -i
    --indent-size 2 <file>` on the files you have edited.

*   You should also check your code with pylint and TensorFlow's pylint
    [configuration file](https://raw.githubusercontent.com/tensorflow/tensorflow/master/tensorflow/tools/ci_build/pylintrc)
    by running `pylint --rcfile=/path/to/the/tf/rcfile <edited file.py>`.

*   When making your first pull request, you
    [sign the Google CLA](https://cla.developers.google.com/clas)

*   We do not accept pull requests that add git submodules because of
    [the problems that arise when maintaining git submodules](https://medium.com/@porteneuve/mastering-git-submodules-34c65e940407)

## Tutorials directory

To help you get started with the functionalities provided by this library, we
provide a detailed walkthrough [here](tutorials/walkthrough/README.md) that
will teach you how to wrap existing optimizers
(e.g., SGD, Adam, ...) into their differentially private counterparts using
TensorFlow (TF) Privacy. You will also learn how to tune the parameters
introduced by differentially private optimization and how to
measure the privacy guarantees provided using analysis tools included in TF
Privacy.

In addition, the
`tutorials/` folder comes with scripts demonstrating how to use the library
features. The list of tutorials is described in the README included in the
tutorials directory.

NOTE: the tutorials are maintained carefully. However, they are not considered
part of the API and they can change at any time without warning. You should not
write 3rd party code that imports the tutorials and expect that the interface
will not break.

## Research directory

This folder contains code to reproduce results from research papers related to
privacy in machine learning. It is not maintained as carefully as the tutorials
directory, but rather intended as a convenient archive.

## TensorFlow 2.x

TensorFlow Privacy now works with TensorFlow 2! You can use the new
Keras-based estimators found in
`privacy/tensorflow_privacy/privacy/optimizers/dp_optimizer_keras.py`.

For this to work with `tf.keras.Model` and `tf.estimator.Estimator`, however,
you need to install TensorFlow 2.4 or later.

## Remarks

The content of this repository supersedes the following existing folder in the
tensorflow/models [repository](https://github.com/tensorflow/models/tree/master/research/differential_privacy)

## Contacts

If you have any questions that cannot be addressed by raising an issue, feel
free to contact:

* Galen Andrew (@galenmandrew)
* Steve Chien (@schien1729)
* Nicolas Papernot (@npapernot)

## Copyright

Copyright 2019 - Google LLC
Project import generated by Copybara. FolderOrigin-RevId: /google/src/cloud/papernot/os_privacy 2018-12-02 14:06:57 -07:00			`# TensorFlow Privacy`

Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`This repository contains the source code for TensorFlow Privacy, a Python`
			`library that includes implementations of TensorFlow optimizers for training`
			`machine learning models with differential privacy. The library comes with`
			`tutorials and analysis tools for computing the privacy guarantees provided.`
Project import generated by Copybara. FolderOrigin-RevId: /google/src/cloud/papernot/os_privacy 2018-12-02 14:06:57 -07:00
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`The TensorFlow Privacy library is under continual development, always welcoming`
			`contributions. In particular, we always welcome help towards resolving the`
			`issues currently open.`

Add update about new vectorized TF 2 optimizer to README.md PiperOrigin-RevId: 349339226 2020-12-28 16:31:55 -07:00			`## Latest Updates`

			`2020-12-21: A new`
			`[vectorized version of the TF 2 optimizer](https://github.com/tensorflow/privacy/blob/master/tensorflow_privacy/privacy/optimizers/dp_optimizer_keras_vectorized.py)`
			`is available, which can deliver much faster performance. We recommend trying it`
			`first, and to fall back to using the original non-vectorized version only if`
			`this fails. We are thankful to the`
			`[authors of this paper](https://arxiv.org/abs/2010.09063) for spurring this`
			`change.`

Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`## Setting up TensorFlow Privacy`

			`### Dependencies`

			`This library uses [TensorFlow](https://www.tensorflow.org/) to define machine`
Update prerequisite to TF 1.14 (to include vectorized_map). PiperOrigin-RevId: 267482807 2019-09-05 17:34:48 -06:00			`learning models. Therefore, installing TensorFlow (>= 1.14) is a pre-requisite.`
Specifying minimal TF version required (currently 1.13, due to dependency on the train module). PiperOrigin-RevId: 248809713 2019-05-17 17:39:24 -06:00			`You can find instructions [here](https://www.tensorflow.org/install/). For`
			`better performance, it is also recommended to install TensorFlow with GPU`
			`support (detailed instructions on how to do this are available in the TensorFlow`
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`installation documentation).`

- Fixing dependencies in setup.py and requirements.txt. PiperOrigin-RevId: 227742524 2019-01-03 14:42:00 -07:00			`In addition to TensorFlow and its dependencies, other prerequisites are:`
Format in main README PiperOrigin-RevId: 229233679 2019-01-14 13:37:11 -07:00
- Fixing dependencies in setup.py and requirements.txt. PiperOrigin-RevId: 227742524 2019-01-03 14:42:00 -07:00			* `scipy` >= 0.17
Specifying minimal TF version required (currently 1.13, due to dependency on the train module). PiperOrigin-RevId: 248809713 2019-05-17 17:39:24 -06:00
- Fixing dependencies in setup.py and requirements.txt. PiperOrigin-RevId: 227742524 2019-01-03 14:42:00 -07:00			* `mpmath` (for testing)
Specifying minimal TF version required (currently 1.13, due to dependency on the train module). PiperOrigin-RevId: 248809713 2019-05-17 17:39:24 -06:00
add a Penn TreeBank example PiperOrigin-RevId: 237160380 2019-03-06 19:27:15 -07:00			* `tensorflow_datasets` (for the RNN tutorial `lm_dpsgd_tutorial.py` only)
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00
			`### Installing TensorFlow Privacy`

Update main README.md with pip installation directions and TF 2 announcement. PiperOrigin-RevId: 336957862 2020-10-13 15:36:16 -06:00			`If you only want to use TensorFlow Privacy as a library, you can simply execute`

			`pip install tensorflow-privacy`

			`Otherwise, you can clone this GitHub repository into a directory of your choice:`
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00
			```
			`git clone https://github.com/tensorflow/privacy`
			```

			`You can then install the local package in "editable" mode in order to add it to`
			your `PYTHONPATH`:

			```
			`cd privacy`
Project import generated by Copybara. PiperOrigin-RevId: 227101826 2018-12-28 00:43:52 -07:00			`pip install -e .`
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			```

			`If you'd like to make contributions, we recommend first forking the repository`
			`and then cloning your fork rather than cloning this repository directly.`

			`## Contributing`

			`Contributions are welcomed! Bug fixes and new features can be initiated through`
Specifying minimal TF version required (currently 1.13, due to dependency on the train module). PiperOrigin-RevId: 248809713 2019-05-17 17:39:24 -06:00			`GitHub pull requests. To speed the code review process, we ask that:`
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00
			* When making code contributions to TensorFlow Privacy, you follow the `PEP8
			with two spaces` coding style (the same as the one used by TensorFlow) in
			your pull requests. In most cases this can be done by running `autopep8 -i
			--indent-size 2 <file>` on the files you have edited.

add pylint to README PiperOrigin-RevId: 259029555 2019-07-19 14:40:45 -06:00			`* You should also check your code with pylint and TensorFlow's pylint`
			`[configuration file](https://raw.githubusercontent.com/tensorflow/tensorflow/master/tensorflow/tools/ci_build/pylintrc)`
			by running `pylint --rcfile=/path/to/the/tf/rcfile <edited file.py>`.

Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`* When making your first pull request, you`
			`[sign the Google CLA](https://cla.developers.google.com/clas)`

			`* We do not accept pull requests that add git submodules because of`
			`[the problems that arise when maintaining git submodules](https://medium.com/@porteneuve/mastering-git-submodules-34c65e940407)`

			`## Tutorials directory`

add walkthrough MD PiperOrigin-RevId: 241016765 2019-03-29 12:24:16 -06:00			`To help you get started with the functionalities provided by this library, we`
Fix documentation filenames so they can be properly displayed. PiperOrigin-RevId: 299422855 2020-03-06 14:22:58 -07:00			`provide a detailed walkthrough [here](tutorials/walkthrough/README.md) that`
add walkthrough MD PiperOrigin-RevId: 241016765 2019-03-29 12:24:16 -06:00			`will teach you how to wrap existing optimizers`
			`(e.g., SGD, Adam, ...) into their differentially private counterparts using`
			`TensorFlow (TF) Privacy. You will also learn how to tune the parameters`
			`introduced by differentially private optimization and how to`
			`measure the privacy guarantees provided using analysis tools included in TF`
			`Privacy.`
Specifying minimal TF version required (currently 1.13, due to dependency on the train module). PiperOrigin-RevId: 248809713 2019-05-17 17:39:24 -06:00
add walkthrough MD PiperOrigin-RevId: 241016765 2019-03-29 12:24:16 -06:00			`In addition, the`
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`tutorials/` folder comes with scripts demonstrating how to use the library
Closes #25 PiperOrigin-RevId: 239031260 2019-03-18 12:54:41 -06:00			`features. The list of tutorials is described in the README included in the`
			`tutorials directory.`
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00
			`NOTE: the tutorials are maintained carefully. However, they are not considered`
			`part of the API and they can change at any time without warning. You should not`
			`write 3rd party code that imports the tutorials and expect that the interface`
			`will not break.`
Closes #25 PiperOrigin-RevId: 239031260 2019-03-18 12:54:41 -06:00
Import PATE code from tensorflow/models. PiperOrigin-RevId: 229239903 2019-01-14 14:12:30 -07:00			`## Research directory`

			`This folder contains code to reproduce results from research papers related to`
			`privacy in machine learning. It is not maintained as carefully as the tutorials`
Update README with note about TensorFlow 2.x. PiperOrigin-RevId: 308322055 2020-04-24 15:08:42 -06:00			`directory, but rather intended as a convenient archive.`

			`## TensorFlow 2.x`

Update main README.md with pip installation directions and TF 2 announcement. PiperOrigin-RevId: 336957862 2020-10-13 15:36:16 -06:00			`TensorFlow Privacy now works with TensorFlow 2! You can use the new`
			`Keras-based estimators found in`
			`privacy/tensorflow_privacy/privacy/optimizers/dp_optimizer_keras.py`.

			For this to work with `tf.keras.Model` and `tf.estimator.Estimator`, however,
			`you need to install TensorFlow 2.4 or later.`
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00
			`## Remarks`

			`The content of this repository supersedes the following existing folder in the`
			`tensorflow/models [repository](https://github.com/tensorflow/models/tree/master/research/differential_privacy)`
Specifying minimal TF version required (currently 1.13, due to dependency on the train module). PiperOrigin-RevId: 248809713 2019-05-17 17:39:24 -06:00
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`## Contacts`
Specifying minimal TF version required (currently 1.13, due to dependency on the train module). PiperOrigin-RevId: 248809713 2019-05-17 17:39:24 -06:00
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`If you have any questions that cannot be addressed by raising an issue, feel`
Specifying minimal TF version required (currently 1.13, due to dependency on the train module). PiperOrigin-RevId: 248809713 2019-05-17 17:39:24 -06:00			`free to contact:`

Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00			`* Galen Andrew (@galenmandrew)`
Project import generated by Copybara. PiperOrigin-RevId: 226510932 2018-12-21 11:44:21 -07:00			`* Steve Chien (@schien1729)`
			`* Nicolas Papernot (@npapernot)`
Project import generated by Copybara. PiperOrigin-RevId: 226056146 2018-12-18 15:06:54 -07:00
			`## Copyright`

update year in README PiperOrigin-RevId: 228586249 2019-01-09 15:15:21 -07:00			`Copyright 2019 - Google LLC`