April 18, 2018, 2:45 p.m.
Before releasing a public dataset, practitioners need to tread the balance between utility and protection of individuals. In this talk Felipe moves from theory to real-life while handling massive public datasets, showcasing newly available tools that help with PII detection, and bringing concepts like k-anonymity and l-diversity to a practical realm.
Related research: Considerations for Sensitive Data within Machine Learning Datasets
Watch the Q&A for this session here.
The Civic Tech conference that plugs a gap in debate, networking and research between practitioners, commentators, academics and funders of civic technology.
Your donations keep this site and others like it running
mySociety Limited is a project of UK Citizens Online Democracy, a registered charity in England and Wales. For full details visit mysociety.org.