- Добавил: literator
- Дата: 27-01-2023, 00:31
- Комментариев: 0
Название: Practical Data Privacy: Solving Privacy and Security Problems in Your Data Science Workflow (Fifth Early Release)
Автор: Katharine Jarmul
Издательство: O’Reilly Media, Inc.
Год: 2023-01-27
Страниц: 306
Язык: английский
Формат: epub (true), mobi
Размер: 14.2 MB
Between major privacy regulations like the GDPR and CCPA and expensive and notorious data breaches, there has never been so much pressure for data scientists to ensure data privacy. Unfortunately, integrating privacy into your data science workflow is still complicated. This essential guide will give you solid advice and best practices on breakthrough privacy-enhancing technologies such as encrypted learning and differential privacy--as well as a look at emerging technologies and techniques in the field. Federated Learning (FL) and distributed Data Science provide new ways to think about how you do data analysis by keeping data at the edge: on phones, laptops, edge services — or even on-premise architecture or separate cloud architecture when working with partners. The data is not collected or copied to your own cloud or storage before you do analysis or Machine Learning. In this chapter, you’ll learn how this works in practice and determine when this approach is appropriate for a given use case. You’ll also evaluate how to offer privacy via other tools, along with what types of data or engineering problems federated approaches can solve and which are a poor fit. In Data Science, you are almost always using distributed data. Every time you start up a Kubernetes or Hadoop cluster or use a multi-cloud setup for data analysis, your data is de facto distributed.
Автор: Katharine Jarmul
Издательство: O’Reilly Media, Inc.
Год: 2023-01-27
Страниц: 306
Язык: английский
Формат: epub (true), mobi
Размер: 14.2 MB
Between major privacy regulations like the GDPR and CCPA and expensive and notorious data breaches, there has never been so much pressure for data scientists to ensure data privacy. Unfortunately, integrating privacy into your data science workflow is still complicated. This essential guide will give you solid advice and best practices on breakthrough privacy-enhancing technologies such as encrypted learning and differential privacy--as well as a look at emerging technologies and techniques in the field. Federated Learning (FL) and distributed Data Science provide new ways to think about how you do data analysis by keeping data at the edge: on phones, laptops, edge services — or even on-premise architecture or separate cloud architecture when working with partners. The data is not collected or copied to your own cloud or storage before you do analysis or Machine Learning. In this chapter, you’ll learn how this works in practice and determine when this approach is appropriate for a given use case. You’ll also evaluate how to offer privacy via other tools, along with what types of data or engineering problems federated approaches can solve and which are a poor fit. In Data Science, you are almost always using distributed data. Every time you start up a Kubernetes or Hadoop cluster or use a multi-cloud setup for data analysis, your data is de facto distributed.