A presentation at London Web Performance Meetup May 2020 in in London, UK by Emanuil Tolev

GDPR Compliance for Your Datastore Emanuil Tolev @emanuil_tolev
Also known as “not a GDPR expert” I’m good at making moussaka and bean soup though @emanuil_tolev am a EU citizen ^ had a small business
It’s a cultural artefact now @emanuil_tolev Can you recommend a GDPR expert? Yes! Great, can you give me their email address so I can contact them? No.
Joke credit @wardrox https://twitter.com/wardrox/status/988363811479572483 @emanuil_tolev
Questions in chat :) @emanuil_tolev
General Data Protection Regulation Adopted 2016/04/14 Enforceable 2018/05/25 @emanuil_tolev Behind those bulky terms there is an EU regulation that is not a paper tiger. We will dive into where and to who this applies, what is covered, and how you can work with this regulation. Fines: Whatever is greater
Where & Who? EU organizations Services or goods for / monitoring of EU citizens @emanuil_tolev
Fines 2 tiers: up to 10m EUR or 2% of turnover up to 20m EUR or 4% of turnover @emanuil_tolev meaningful fines
of turnover s’up in the UK? Equifax (500k - old DPA law), BA (Jul 2019), Mariott (Jul 2019) Crown Prosecution Service Guess the amount Facebook was fined for Cambridge Analytica! @emanuil_tolev unencrypted DVDs with police recordings lost by CPS Facebook was 500k, because it was under the DPA - max under the old legislation
What? Personal Data Any information relating to an identified or identifiable natural person @emanuil_tolev Personal data means any information relating to an identified or identifiable natural person — name, contact details, IP,… There is also sensitive personal data which includes race, sex, trade union membership where your protections should be stricter. So this will include your server logs up to your marketing campaigns. But what are the actual rights natural persons get?
Rights? to be informed access rectification @emanuil_tolev 8 data protection rights Right to be informed: You must tell individuals how and why youʼre collecting and processing their data Right of access: You must let people know how youʼre using their data and allow them to check youʼre doing it legally Right to rectification: If youʼve made a mistake in someoneʼs data, you must correct it
Rights? erasure (to be forgotten) restrict processing data portability @emanuil_tolev Right to erasure: Also known as the right to be forgotten, in some circumstances an individual can request that you delete data about them Right to restrict processing: You can still store the data but an individual can ask you stop using it Right to data portability: People must be able to get hold of the data you hold on them and then use it elsewhere
Rights? object automatic decision making @emanuil_tolev Right to object: If youʼre using someoneʼs data for marketing or research purposes, they can ask you to stop Rights relating to automatic decision making: This covers automated profiling, machine learning and so on (unless explicitly agreed or required)
Lawful use of data? Informed consent Contractual obligation Legitimate interest @emanuil_tolev 6 ways for lawful use of data Informed consent: The individual explicitly opts-in to the precise way you say that youʼll use their data Contractual obligation: you need to use the data in order to deliver a service the person has asked for, or that theyʼve told you theyʼre considering, and youʼre using only the data needed to fulfil that contract Legitimate interest: Perhaps the vaguest of the lawful bases, this allows you to use data if the legitimate interests of your company require it and you can show that this balances with the rights of the individual
Lawful use of data? Legal obligation Vital interests Public task @emanuil_tolev Legal obligation: this allows you to use data where the law requires you to Vital interests: You need to use the data to save someoneʼs life Public task: This applies most to public authorities and allows for the use of personal data if itʼs in the public interest
Proof Required Right to collect and legally use @emanuil_tolev One of the game changers: You need to prove that you are legally using the data. Rights: When collect Use: Stay within those For every dataprocess that you have
Disclosure Within 72 hours to a member state’s “supervisory body” @emanuil_tolev
Legacy Data Stop, Check, Delete @emanuil_tolev If you find you have data that was collected in a way that doesnʼt comply with the GDPR, destroy it. Similarly, if youʼre using data in a noncompliant way, stop doing so.
What if no legal grounds? @emanuil_tolev Somebody just visits your site. How do you collect any information from them? They didn’t even had a chance to give you their consent, but you also don’t want to burn your monitoring to the ground and be blind.
Can be a site just for reading or an entire service
unroll.me is doing this for example
https://twitter.com/rianjohnson/status/999730569641525248 This might then look like this: Before you can start the film / website, you need to go through this. And you would actually watch this one
Anonymous No information that could potentially identify an individual Not considered Personal Data by GDPR @emanuil_tolev
Pseudonymous Re-identification possible if combined with additional information Without this information, reidentification practically impossible @emanuil_tolev
When? Ingestion time Search time @emanuil_tolev When do you change your data? Let’s assume we want to do it at ingestion time, because it saves us a lot of hassle later on
fingerprint { method => “SHA256” source => [“ip”] key => “${FINGERPRINT_KEY}” } mutate { add_field => { ‘[identities][0][key]’ => “%{fingerprint}” ‘[identities][0][value]’ => “%{ip}” } } mutate { replace => { “ip” => “%{fingerprint}” } } @emanuil_tolev
The service can even enrich data with pther known records. This does not offer enough protection for pseudoanonymization (in my opinion). You need to implement this properly.
Access Control & Encryption @emanuil_tolev
Deletion @emanuil_tolev
“Interesting #GDPR solution for the “right to erasure” : Encrypt all user’s data and when you have to delete it you just get rid of the private key. Will this become the norm?” https://twitter.com/Stephan007/status/985103374118014976 @emanuil_tolev One of the more clever approaches for personal data.
“[…] personal data of our users can only be persisted when it is encrypted. Each user has their own set of keys […] it reduces the impact of leaking a dataset, since the dataset by itself is useless — attackers also need the decryption keys. […] it allows us to control the lifecycle of data for individual users centrally.” https://labs.spotify.com/2018/09/18/scalable-user-privacy/ @emanuil_tolev This is exactly what Spotify is doing. Though this is more of an application feature, so we are not covering it in detail. It helped keeping their microservice architecture simple, since deleting data everywhere becomes a major hassle otherwise. Another option they considered was a central datastore and everything else basically only caches data. Though with various access patterns (email or profile picture) this was deemed too complicated. Article goes into a lot of details around Padlock: a global key-management system
Conclusion @emanuil_tolev
Data Protection The new standard and norm of approaching personal data @emanuil_tolev Even if it sounds difficult for some, this is by design the new standard and way to approach personal data. It’s not an afterthought any more
Special category: racial, ethnic, religious, political, biometric,…
I am not a lawyer @emanuil_tolev I am not a lawyer, sorry.
As a dev agency / consultancy @emanuil_tolev Generally we determined clients were data controllers and we were data processors But when we wanted to run a SaaS service we became data controllers. Even though in practice our (university) clients told us their reqs. I’d err on the side of more responsibility.
Heather Burns https://www.smashingmagazine.com/2018/02/gdpr-forweb-developers/ @emanuil_tolev She’s far better than me and I only read this post today. You should read it.
❤ GDPR and carry on @emanuil_tolev Regulations are everywhere, so don’t panic. Even a coffee cart comes with legal implications: food safety laws, commercial operation laws, municipal laws, administrative laws, employment law,… Generally: Do the right thing and you will be fine
@emanuil_tolev And don’t handle it like zoom.us — yes or yes is not an appropriate way to do this.
Why care? Stick Carrot Godwin @emanuil_tolev I want to be a good person and an upstanding citizen. Why is that so boring sometimes?! Well, it’s all about framing. We all get the stuff about being fined. But why should you spend your limited brain cycles on this? Europe is densely populated and we cannot help but stick our nose in each other’s business. It’s kinda silly to think your local hair salon could expose your email address in a breach, but data protection law comes from a long and sombre line of privacy violations and data gathering in Europe. You’ve probably all heard of the use of census data by the Nazi regime in Germany in the late 1930s. It was processed for storage earlier by IBM who almost certainly didn’t it to be used as it was. Personal opinion time! Sometimes, to make a business decision we must invoke emotion as well as fact. If you want to invoke something other than boredom when thinking of data protection, then invoke a sense of duty towards people’s privacy and build businesses and systems which respect that privacy. Only collect data the data you need. Question if you need pieces of it at the product design stage. Only use data as you need it. Only store data as long as you need it. If you collected something earlier and want to retire the functionality that uses it - drop that data, or archive it with encryption far from your live systems. This, in my opinion as a web dev, is the spirit of the GDPR regulation.
Questions? Emanuil Tolev @emanuil_tolev @emanuil_tolev
The General Data Protection Regulation (GDPR) is changing how you can handle data in Europe. But what does this actually mean? The first part of this talk gives an overview about the implications of GDPR, which affects every software project with a European relation. That includes users’ right to see, edit, and export their data, the right to be forgotten,… The second part takes a look at what this means for actual software projects with the specific use-case of logging. The main focus here is how to stay GDPR compliant while still being able to use the data for security and operation aspects.
This talk does not replace legal advice or a deeper examination of the topic. It gives you an overview and pointers to relevant techniques, but you need to discuss the implementation for your project with your own legal counsel.
