April 18, 2018, 2:45 p.m.
Before releasing a public dataset, practitioners need to tread the balance between utility and protection of individuals. In this talk Felipe moves from theory to real-life while handling massive public datasets, showcasing newly available tools that help with PII detection, and bringing concepts like k-anonymity and l-diversity to a practical realm.
Related research: Considerations for Sensitive Data within Machine Learning Datasets
Watch the Q&A for this session here.
The Civic Tech conference that plugs a gap in debate, networking and research between practitioners, commentators, academics and funders of civic technology.
Your donations keep this site and others like it running
In association with
The William and Flora Hewlett Foundation
mySociety Limited is a project of UK Citizens Online Democracy, a registered charity in England and Wales. For full details visit mysociety.org.