Privacy vs progress: the ethical quandary of big data

·10 May 2015

These days, massive volumes of data about us are collected from censuses and surveys, computers and mobile devices, as well as scanning machines and sensors of many kinds. But this data can also reveal personal and sensitive information about us, raising some serious privacy concerns.

Data are routinely collected when we shop, use public transport, visit our GP or access government services in person or online. There’s also data from using our smart phones and fitness monitoring devices.

These data are generally collected for a purpose, called the “primary purpose”. For example, having purchased goods delivered, catching a bus from home to work, having a health check, obtaining a Medicare refund, navigating or searching our local area, as well as logging our fitness regime.

But in addition to being used for such primary purposes, many data are stored and used for other purposes, called “secondary purposes”. This includes research to help inform decision-making and debate within government and the community.

For example, data from Medicare, the Pharmaceutical Benefits Scheme and hospitals can be used to identify potential adverse drug reactions much faster than is currently possible.

What about privacy?

But these data can also reveal highly sensitive information about us, such as about our preferences, behaviours, friends and whether we have a disease or not.

Given the rapid change in the volume and nature of data in the digital age, it is timely to ask whether the existing ethics frameworks for the secondary use of such data are still adequate. Do they address the right ethical issues associated with research using the data? In particular, how will an individual’s privacy be protected?

There have been two important responses to these issues. A group of researchers, supported by the University of Melbourne and the Carlton Connect Initiative, explored these issues through workshops, desk research and many consultations.

They produced the Guidelines for the Ethical Use of Digital Data in Human Research. It’s a work in progress, requiring ongoing practice and revision, rather than a definitive set of prescriptions.

A team at CSIRO and the Sax Institute also addressed the deeper ethical issue of protecting privacy in the secondary use of health data. This work will be developed into Guidelines for Confidentiality Protection in Public Health Research Results.

Ethical issues for digital data

In the first of the guidelines, five key categories of ethical issues are identified as highly relevant to digital data and require additional consideration when using digital data.

Consent: making sure that participants can make informed decisions about their participation in the research
Privacy and confidentiality: privacy is the control that individuals have over who can access their personal information. Confidentiality is the principle that only authorised persons should have access to information
Ownership and authorship: who has responsibility for the data, and at what point does the individual give up their right to control their personal data?
Data sharing – assessing the social benefits of research: data matching and re-use of data from one source or research project in another
Governance and custodianship: oversight and implementation of the management, organisation, access and preservation of digital data.

The voluntary guidelines were developed to help people conducting research and to assist ethics committees to assess research involving digital data.

Without such guidelines, there is a risk that new ethical issues involving digital data will not adequately be considered and managed by researchers and ethics committees.

Privacy risks from the data

Traditionally, the data custodians responsible for granting access to data sets have sought to protect people’s confidentiality by only providing access to approved researchers. They also restricted the detail of the data released, such as replacing age or date of birth by month or year of birth.

More recently, data custodians are increasingly being asked for highly flexible access to more and more details about individual persons from an expanded range of data collections.

Custodians are responding by developing a new flexible range of access modes or mechanisms, including remote analysis systems and virtual data centres.

Under remote analysis, a researcher does not have access to any of the data but submits queries and receives analysis results through a secure webpage.

A virtual data centre is less restrictive than a remote analysis system. It enables researchers to interact directly with data, submit queries and receive results through a secure interface.

But the results of statistical analysis as released by a virtual data centre may still reveal personal information. For example, if a result such as an average is computed on a very small number of people then it is probably very close to the value for each of those people.

By following such voluntary guidelines, researchers can maintain confidentiality while ensuring that society can benefit from their work.

The rapid technological advances in our society are creating more and more data archives of many different types. It’s vital that we continue to assess the ethical and privacy risks from secondary use of this data if researchers are to reap the potential benefits from access to the information.

By Christine O’Keefe, Research Director · Digital Productivity at CSIRO

This article was originally published on The Conversation.
Read the original article

More from The Conversation

Computers might be joining your board of directors

Bitcoin is still around, but do you even care?

World’s largest “supervolcano” is even bigger than previously thought

New battery tech will change how we use power

Privacy vs progress: the ethical quandary of big data

What about privacy?

Ethical issues for digital data

Privacy risks from the data

More from The Conversation

Must Read

United States tapping skilled South Africans for the deadliest job in America – paying R62,700 a month

State-owned company in South Africa delivers some happy news for the first time in 6 years

End of an era for Nedbank

Good news for FNB customers in South Africa

Make or break for the South African Reserve Bank

First of South Africa’s two ‘extra’ public holidays coming next week

Industry News

NSPCA Funding: How Corporate Donors Help Protect Vulnerable Animals

South African company makes TIME’s 2026 top 100 EdTech list

GoTyme Bank named Africa’s Best Digital Bank by Euromoney

Don’t let the unexpected ruin your next big event

Why working capital, not demand, is holding back South African businesses

How Codehesion’s AI-enabled innovation pods build your software faster and better

More News

Telkom sees huge mobile growth in South Africa

Worst news in 7 months for businesses in South Africa

Here is the official petrol price for August

New rules and regulations for crypto coming to South Africa

Business Talk – Liberty’s Clyde Parsons discusses how AI is driving insurance innovation in South Africa

New international flight launching for one of South Africa’s busiest routes

Poll

Newsletter

Business Talk

Nice surprise could be coming for South Africa

Plan to sell iconic South African giant for R16 billion to an international group run by its former CEO

Expropriation without compensation under fire, and big changes for speeding fines in South Africa

Capitec sounds the alarm over R50.4 trillion gap in South Africa

International retailer that quietly exited South Africa by selling its 90 stores for R1

Shoprite coming after Pick n Pay and Spar in South Africa

The most expensive province for shopping malls in South Africa

Warning for anyone living in Cape Town

What about privacy?

Ethical issues for digital data

Privacy risks from the data

More from The Conversation

Must Read

Industry News

More News

Poll

Newsletter

Business Talk

Trending Now