18 Apr 2018, 2:45 p.m.
Before releasing a public dataset, practitioners need to tread the balance between utility and protection of individuals. In this talk Felipe moves from theory to real-life while handling massive public datasets, showcasing newly available tools that help with PII detection, and bringing concepts like k-anonymity and l-diversity to a practical realm.
Related research: Considerations for Sensitive Data within Machine Learning Datasets
Watch the Q&A for this session here.
TICTeC supports the mission of the non-profit mySociety by bringing together practitioners, commentators, academics and funders to debate, network, and share research and knowledge in the civic tech field.
Your donations keep this site and others like it running
is a registered charity in England and Wales (1076346)
and a limited company (03277032). We provide commercial
services through our wholly owned subsidiary