Data portability is a concept to protect users from having their data stored in "silos" or "walled gardens" that are incompatible with one another, i.e.
closed platforms, thus subjecting them to
vendor lock-in
In economics, vendor lock-in, also known as proprietary lock-in or customer lockin, makes a customer dependent on a vendor for products, unable to use another vendor without substantial switching costs.
The use of open standards and alternati ...
and making the creation of
data backups or moving accounts between services difficult.
Data portability requires common
technical standard
A technical standard is an established Social norm, norm or requirement for a repeatable technical task which is applied to a common and repeated use of rules, conditions, guidelines or characteristics for products or related processes and producti ...
s to facilitate the transfer from one data controller to another, such as the ability to
export
An export in international trade is a good produced in one country that is sold into another country or a service provided in one country for a national or resident of another country. The seller of such goods or the service provider is a ...
user data into a user-accessible local file, thus promoting
interoperability
Interoperability is a characteristic of a product or system to work with other products or systems. While the term was initially defined for information technology or systems engineering services to allow for information exchange, a broader de ...
, as well as facilitate searchability with sophisticated tools such as
grep
grep is a command-line utility for searching plaintext datasets for lines that match a regular expression. Its name comes from the ed command g/re/p (global regular expression search and print), which has the same effect. grep was originally de ...
.
Data portability applies to personal data. It involves access to personal data without implying data ownership per se.
Development
At the global level, there are proponents who see the protection of digital data as a human right. Thus, in an emerging civil society draft declaration, one finds mention of the following concepts and statutes: Right to Privacy on the Internet, Right to Digital Data Protection, Rights to Consumer Protection on the Internet –
United Nations Guidelines for Consumer Protection.
At the regional level, there are at least three main jurisdictions where data rights are seen differently: China and India, the United States and the European Union. In the latter, personal data was given special protection under the 2018
General Data Protection Regulation
The General Data Protection Regulation (Regulation (EU) 2016/679), abbreviated GDPR, is a European Union regulation on information privacy in the European Union (EU) and the European Economic Area (EEA). The GDPR is an important component of ...
(GDPR).
The GDPR thus became the fifth of the 24 types of legislation listed in Annex 1 Table of existing and proposed European Directives and Regulations in relation to data.
Personal data are the basis for
behavioral advertising
Targeted advertising or data-driven marketing is a form of advertising, including online advertising, that is directed towards an audience with certain traits, based on the product or person the advertiser is promoting.
These traits can either ...
, and early in the 21st century their value began to grow exponentially, at least as measured in the market capitalization of the major platforms holding personal data on their respective users. European Union regulators reacted to this perceived power imbalance between platforms and users, although much still hinges on the terms of consent given by users to the platforms. The concept of data portability comprises an attempt to correct the perceived power imbalance by introducing an element of competition allowing users to choose among platforms.
Online platforms
With the advent of the
General Data Protection Regulation
The General Data Protection Regulation (Regulation (EU) 2016/679), abbreviated GDPR, is a European Union regulation on information privacy in the European Union (EU) and the European Economic Area (EEA). The GDPR is an important component of ...
s (GDPR),
social media platform
Social media are interactive technologies that facilitate the creation, sharing and aggregation of content (such as ideas, interests, and other forms of expression) amongst virtual communities and networks. Common features include:
* Onlin ...
s such as
Twitter
Twitter, officially known as X since 2023, is an American microblogging and social networking service. It is one of the world's largest social media platforms and one of the most-visited websites. Users can share short text messages, image ...
,
Instagram
Instagram is an American photo sharing, photo and Short-form content, short-form video sharing social networking service owned by Meta Platforms. It allows users to upload media that can be edited with Social media camera filter, filters, be ...
,
Snapchat
Snapchat is an American multimedia social media and instant messaging app and service developed by Snap Inc., originally Snapchat Inc. One of the principal features of the app are that pictures and messages, known as "snaps", are usually availa ...
, and the
Wall Street Journal
''The Wall Street Journal'' (''WSJ''), also referred to simply as the ''Journal,'' is an American newspaper based in New York City. The newspaper provides extensive coverage of news, especially business and finance. It operates on a subscriptio ...
online subscriber community have widely adopted the ability to export and download user data into a
ZIP archive file
In computing, an archive file stores the content of one or more files, possibly compressed, with associated metadata such as file name, directory structure, error detection and correction information, commentary, compressed data archives, sto ...
. Other platforms such as
Google
Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
and
Facebook
Facebook is a social media and social networking service owned by the American technology conglomerate Meta Platforms, Meta. Created in 2004 by Mark Zuckerberg with four other Harvard College students and roommates, Eduardo Saverin, Andre ...
were equipped with export options earlier. Some platforms restrict exports with time delays between each, such as once per 30 days on Twitter, and many platforms lack partial export options.
Other sites such as
Quora
Quora is an American social question-and-answer website and online knowledge market headquartered in Mountain View, California. It was founded on June 25, 2009, and made available to the public on June 21, 2010. Users can post questions, answ ...
and Bumble offer no automated request form, requiring the user to request a copy of their data through personal support
email
Electronic mail (usually shortened to email; alternatively hyphenated e-mail) is a method of transmitting and receiving Digital media, digital messages using electronics, electronic devices over a computer network. It was conceived in the ...
.
Ratings and reviews
Reputation portability refers to the ability of an individual to transfer their reputation or credibility from one context to another. This concept is becoming increasingly important in today's interconnected world, where individuals are involved in multiple online and offline communities.
The idea behind reputation portability is that an individual's reputation should not be tied solely to a single community or platform. Rather, it should be transferable across different contexts, such as professional networks, social media platforms, and online marketplaces. This enables individuals to maintain a consistent reputation across various contexts, which can be beneficial in terms of building trust, and overcoming the so-called "cold-start" problem, and hence mitigating platform lock-in.
Overall, reputation portability is an important concept in today's digital landscape, and research has shown that imported reputation can serve as viable signals for building trust. As technology continues to evolve, it is likely that reputation portability will become increasingly important in shaping how we interact with each other online and offline.
In consumer electronics
Mobile devices
Some
mobile app
A mobile application or app is a computer program or software application designed to run on a mobile device such as a smartphone, phone, tablet computer, tablet, or smartwatch, watch. Mobile applications often stand in contrast to desktop appli ...
s restrict data portability by storing user data in locked
directories while lacking
export
An export in international trade is a good produced in one country that is sold into another country or a service provided in one country for a national or resident of another country. The seller of such goods or the service provider is a ...
options. Such may include
configuration file
A configuration file, a.k.a. config file, is a computer file, file that stores computer data, data used to configure a software system such as an application software, application, a server (computing), server or an operating system.
Some applic ...
s,
digital bookmarks,
browsing history
Web browsing history refers to the list of web pages a user has visited, as well as associated metadata such as page title and time of visit. It is usually stored locally by web browsers in order to provide the user with a history list to go back ...
and
sessions (e.g. list of open
tabs and navigation histories), watch and search histories in multimedia
streaming
Streaming media refers to multimedia delivered through a network for playback using a media player. Media is transferred in a ''stream'' of packets from a server to a client and is rendered in real-time; this contrasts with file downl ...
apps, custom
playlists
A playlist is a list of video or audio files that can be played back on a media player, either sequentially or in a shuffled order. In its most general form, an audio playlist is simply a list of songs that can be played once or in a loop. ...
in
multimedia player software, entries in
note taking and
memorandum
A memorandum (: memorandums or memoranda; from the Latin ''memorandum'', "(that) which is to be remembered"), also known as a briefing note, is a Writing, written message that is typically used in a professional setting. Commonly abbreviation, ...
software, digital
phone books (
contact lists), call logs from the telephone app, and conversations through
SMS and
instant messaging
Instant messaging (IM) technology is a type of synchronous computer-mediated communication involving the immediate ( real-time) transmission of messages between two or more parties over the Internet or another computer network. Originally involv ...
software.
Locked directories are inaccessible to an end-user without extraordinary measures such as so-called
rooting (Android)
Rooting is the process by which users of Android (operating system), Android devices can attain Privilege escalation, privileged control (known as root access) over various subsystems of the device, usually smartphones and tablet computer, tablets ...
or
jailbreaking (iOS)
iOS jailbreaking is the use of a privilege escalation exploit to remove software restrictions imposed by Apple on devices running iOS and iOS-based operating systems. It is typically done through a series of kernel patches. A jailbroken dev ...
.
The former requires the so-called
boot loader
A bootloader, also spelled as boot loader or called bootstrap loader, is a computer program that is responsible for booting a computer and booting an operating system. If it also provides an interactive menu with multiple boot choices then it's o ...
of the device to be in an unlocked state in advance, which it usually is not by default. Toggling that state involves a full erasure of all user data, known as the ''wipe'', making it a
vicious cycle
A vicious circle (or cycle) is a complex chain of events that reinforces itself through a feedback loop, with detrimental results. It is a system with no tendency toward equilibrium (social, economic, ecological, etc.), at least in the short ...
if the user's aim were to access their locked data.
Other mobile apps only allow the creation of user data backups using
proprietary software
Proprietary software is computer software, software that grants its creator, publisher, or other rightsholder or rightsholder partner a legal monopoly by modern copyright and intellectual property law to exclude the recipient from freely sharing t ...
provided by the vendor, lacking the ability to directly export the data to a local file in the mobile device's common user data directory. Such said software requires an external host computer to run on.
Some device vendors offer
cloud storage
Cloud storage is a model of computer data storage in which data, said to be on "the cloud", is stored remotely in logical pools and is accessible to users over a network, typically the Internet. The physical storage spans multiple servers (so ...
and synchronisation services for backing up data. Such services however require registration and depend on internet connection and preferably high internet speeds and data plan limits if used regularly. Some services may only allow moving parts of the data such as text messages and
phone books between locked directories on devices of the same vendor (
vendor lock-in
In economics, vendor lock-in, also known as proprietary lock-in or customer lockin, makes a customer dependent on a vendor for products, unable to use another vendor without substantial switching costs.
The use of open standards and alternati ...
), without the ability to export the information into local files directly accessible by the end user.
Restrictions added in more recent versions of
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
s, such as ''
scoped storage'', which is claimed to have been implemented with the aim to improve user privacy, compromise both
backwards compatibility
In telecommunications and computing, backward compatibility (or backwards compatibility) is a property of an operating system, software, real-world product, or technology that allows for interoperability with an older legacy system, or with Input ...
to established existing software such as
file manager
A file manager or file browser is a computer program that provides a user interface to manage computer files, files and folder (computing), folders. The most common Computer file#Operations, operations performed on files or groups of files incl ...
s and
FTP server applications, as well as legitimate uses such as cross-app communication and facilitating large
file transfer
File transfer is the transmission of a computer file through a communication channel from one computer system to another. Typically, file transfer is mediated by a communications protocol. In the history of computing, numerous file transfer protoc ...
s and
backup
In information technology, a backup, or data backup is a copy of computer data taken and stored elsewhere so that it may be used to restore the original after a data loss event. The verb form, referring to the process of doing so, is "wikt:back ...
creation.
Further possible restraints on data portability are poor reliability, stability and performance of existing means of data transfer, such as described in .
Digital video recorders
Some
digital video recorder
A digital video recorder (DVR), also referred to as a personal video recorder (PVR) particularly in Canadian and British English, is an electronic device that records video in a digital format to a disk drive, USB flash drive, SD memory card, SS ...
s (DVRs) which store recordings on an internal hard drive lack the ability to back up recordings, forcing a user to delete existing recordings upon exhausted disk space, which is an instance of poor data portability.
Some DVRs have an
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
that depends on an Internet connection to boot and operate, meaning that recordings stored locally are inaccessible if no internet connection is available. If service for the device gets deprecated by the television service provider, the existing recordings become inaccessible and thus considerably lost.
Other appliances
Cordless
The term cordless is generally used to refer to electrical or electronic devices that are powered by a battery (electricity), battery or battery pack and can operate without a power cord or cable attached to an electrical outlet to provide mains ...
landline telephone
A landline is a physical telephone connection that uses Metal wire, metal wires or optical fiber from the subscriber's premises to the network, allowing multiple phones to operate simultaneously on the same phone number. It is also referred to a ...
units, as well as their associated base stations, which have
firmware
In computing
Computing is any goal-oriented activity requiring, benefiting from, or creating computer, computing machinery. It includes the study and experimentation of algorithmic processes, and the development of both computer hardware, h ...
s with
phone book and
SMS messaging functionality, commonly lack an interface to connect to a computer for backing the data up.
In software
Some software such as the
''Discourse'' forum software offers a built-in ability for users to download their posts into an archive file.
Other software may operate locally, but store user data in a
proprietary format
A proprietary file format is a file format of a company, organization, or individual that contains data that is ordered and stored according to a particular encoding-scheme, such that the decoding and interpretation of this stored data is easily ac ...
, thus causing
vendor lock-in
In economics, vendor lock-in, also known as proprietary lock-in or customer lockin, makes a customer dependent on a vendor for products, unable to use another vendor without substantial switching costs.
The use of open standards and alternati ...
until successfully
reverse-engineered by third party developers.
By country
European Union
The right to data portability was laid down in the European Union's
General Data Protection Regulation
The General Data Protection Regulation (Regulation (EU) 2016/679), abbreviated GDPR, is a European Union regulation on information privacy in the European Union (EU) and the European Economic Area (EEA). The GDPR is an important component of ...
(GDPR) passed in April 2016. The regulation applies to data processors, whether inside or outside the EU, if they process data on individuals who are physically located within an EU member state.
Earlier the
European Data Protection Supervisor had stated that data portability could "let individuals benefit from the value created by the use of their personal data".
The European-level
Article 29 Data Protection Working Party held a consultation on this in English lasting until the end of January 2017.
Their guidelines and FAQ on the right to data portability contain this call for action:
The French national data supervisor
CNIL hosted a discussion in French. Current participants offer opinions on how the legislation provides few benefits for companies, but many for users.
In April 2017, new guidelines were published on the Article 29 Working Party website.
In late 2019 the Data Governance Act was published by the Commission.
In 2021 researchers, many of them French and Finnish, published a 46-page report covering the state-of-the-art.
In 2022 the European Commission published the Data Act.
Although the United Kingdom
voted to withdraw from the EU, it intends to incorporate much of the GDPR in its own legislation, which will include data portability, as "...the GDPR itself contains some noteworthy innovations – for instance… the introduction of a new right to data portability".
In November at the Internet Governance Forum 2019 in Berlin panelists reported that Article 20 GDPR is not actionable, neither legally nor technically. In the UK—ironically post-Brexit—researchers are monitoring developments.
Germany has called to strengthen the European Union's right to data portability using competition law. A commission was set up for the purpose of proposing improvements.
Switzerland
Likewise, in Switzerland, a nation-state that is related to the EU only on a bilateral basis and as an
EFTA
The European Free Trade Association (EFTA) is a regional trade organization and free trade area consisting of four European states: Iceland, Liechtenstein, Norway and Switzerland. The organization operates in parallel with the European Union ...
member state, there has been a trend moving in the same direction. The Swiss view was officially published in March 2018 (as a document in PDF).
An association proposed to have a right to data portability anchored in the constitution of the Swiss Confederation. A law was passed that includes data portability; as described here in German
and here in French. The association partners with a cooperative called MIDATA.coop, which will offer users a place to store their data.
A second association has issued its guideline on the topic.
Over the longer term, the Swiss may have to consider that data portability is in the GDPR. Given that the GDPR will raise compliance costs for EU-based companies, it is unlikely that the EU would tolerate a situation with third-party countries in which Swiss companies would not be held to the same standard in order to keep competition fair. The legal terms involved are adequacy and reciprocity.
United States, California
California has a Consumer Privacy Act (CCPA) of 2018, which introduces data portability to the USA.
Canada
Canada anticipates a law in that it shows Transparency, Portability and Interoperability as Principle No. 4 of its Digital Charter.
India
Data portability is included in the
Personal Data Protection Bill 2019 about to become law as section 26 in chapter VI.
Brazil
Data portability is included in the
Privacy law#Brazil as its Article 18.
Australia
In Australia, a
Consumer Data Right has been proposed.
Thailand
Data portability is included in the new law.
Kenya
A right to data portability is enshrined in the new data protection law under clause 34. However, the intentions behind the new law, its enforcement and relation to the government's new
Identity management
Identity and access management (IAM or IdAM) or Identity management (IdM), is a framework of policies and technologies to ensure that the right users (that are part of the ecosystem connected to or within an enterprise) have the appropriate acce ...
system have already been contested.
Requirements for effective data interoperability
It is always tricky for legislators to regulate at the right level of precision, as everyone understands technology will evolve faster than the law. So far, only the European Union has formalized the expectations around data portability, requiring the data "in a structured, commonly used, machine-readable and interoperable format".
This touches on at least two distinct technical requirements for effective interoperability:
* the need to use file standards that allow for easy reuse (for instance CSV or JSON instead of PDF or even printed paper), encompassed by a "structured, commonly used, machine-readable" format.
* the need (hinging on "interoperable") to consider not only an individual's data release on its own, but also in conjunction with other systems and other individuals' data releases from the same company. This hints at requirements regarding data schemas, versioning and specification of those schemas in case of frequent changes, and generally the absence of efforts on the part of the source data controller to complicate the effective interoperability downstream.
Likewise, European researchers stress that there are both practical and legal gaps that the EU should fill.
Rights of data subjects under the European Union's new GDPR
The list of these rights has grown.
Data portability in relation to the right of access
The data portability right is slightly different from the
Right of access to personal data; see
GDPR
The General Data Protection Regulation (Regulation (EU) 2016/679), abbreviated GDPR, is a European Union regulation on information privacy in the European Union (EU) and the European Economic Area (EEA). The GDPR is an important component of ...
and the seventh item in the list cited immediately above. The right of access only mandates that the data subject gets to see their personal data. The old EU Data Protection Directive used to require explicitly in such cases for the data to be provided in "intelligible" form, which has been interpreted so far as "human readable". This requirement is still somewhat present in the EU's General Data Protection Regulation, but only implicitly in conjunction with
Recital (law). Since the right to portability is mostly concerned with reuse by other services (i.e. most likely automated), it could be that both "human readable" and "raw format" would be inappropriate for effective data portability. Some intermediate level might need to be sought.
In addition, the GDPR limits the scope of data portability to cases where the processing is made on the basis of either consent of the data subject, or the performance of a contract.
Data portability in relation to the right of explanation
The data portability right is related to the "
right to explanation", i.e. when automated decisions are made that have legal effect or significant impact on individual data subjects. How to display an algorithm? One way is through a
decision tree
A decision tree is a decision support system, decision support recursive partitioning structure that uses a Tree (graph theory), tree-like Causal model, model of decisions and their possible consequences, including probability, chance event ou ...
. This right, however, was found to be not very useful in an empirical study.
The right to explanation is related to the "Right to not be evaluated on the basis of automated processing" shown as the last item in the list shown in Gabel / Hickman. This includes decisions based on
profiling. Such a right was included in the EU Data Protection Directive of 1995, but not much enforcement followed. An article in ''Wired'' emphasised the poignancy of the discussion. The issue has been discussed by Bygrave, and by Hildebrandt, who claimed this to be one of the most important transparency rights in the era of machine learning and
big data
Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data processing, data-processing application software, software. Data with many entries (rows) offer greater statistical power, while data with ...
. Contrary to Hildebrandt's high expectations in 2012, four years later, after many revisions to the GDPR, when the text was finalized, three other well-known authors contest whether a right to explanation still exists in the GDPR (see below).
In the United States there was a description of related developments in a seminal book by law professor Frank Pasquale; the relevant passages were reviewed by the Electronic Privacy Information Center (EPIC). Even the U.S. Defense Advanced Research Projects Agency DARPA has an
Explainable AI (XAI) program
cited critically by blogger Artur Kiulian.
Several papers have been published on these topics in 2016, the first of which, by Goodman / Flaxman, outlines the development of the right to explanation. Pasquale does not think the approach goes far enough, as he has stated in a blog entry at the
London School of Economics
The London School of Economics and Political Science (LSE), established in 1895, is a public research university in London, England, and a member institution of the University of London. The school specialises in the social sciences. Founded ...
(LSE). In fact at LSE there is a whole series on Algorithmic Accountability of which that was one entry in Feb. of 2016, and other notable ones were by Joshua Kroll and
Mireille Hildebrandt.
Another 2016 paper, published by Katarinou et al., includes remarks on a right of appeal such that "individuals would have a right to appeal to a machine against a decision made by a human."
A third 2016 paper, one co-authored by Mittelstadt et al., maps the literature and relates it to the GDPR on its pages 13–14.
A fourth paper, one co-authored by Wachter, Mittelstadt and Floridi, refutes the idea that such a right might be included in the GDPR, proposes a limited 'right to be informed' instead and calls for the creation of an agency to implement the transparency requirement. A further paper by Edwards and Veale claims such a right is unlikely to apply in the cases of the 'algorithmic harms' attracting recent media attention, and that insufficient attention has been paid to both the computer science literature on explanation and how other GDPR provisions, such as data protection impact assessments and data portability, might help. Almost two years later a paper appeared that challenges earlier papers, especially Wachter / Mittelstadt / Floridi.
On both sides of the Atlantic, there has been recent activity pertaining to this ongoing debate. Early in 2016 experts on artificial intelligence and UK government officials met during a number of meetings, and developed a Data Science Ethical Framework. On November 7, 2016 an event was held in Brussels, organized by MEP Marietje Schaake in the European Parliament and described by danah Boyd. Only eleven days later at New York University there was a conference on "Fairness, Accountability, and Transparency in Machine Learning " where Principles for Accountable Algorithms and a Social Impact Statement for Algorithms were articulated and placed online for discussion. By mid-December the IEEE came out with a document whose editing was backed up by public comments that were invited by March 2017 on "Ethically Aligned Design".
Later in 2017 data portability was analysed by professors of data protection as a central innovation of the new GDPR.
See also
External wikiGDPR Hub maintained by Max Schrems et al.
*
Data Transfer Project
*
Ethics of artificial intelligence
The ethics of artificial intelligence covers a broad range of topics within AI that are considered to have particular ethical stakes. This includes algorithmic biases, Fairness (machine learning), fairness, automated decision-making, accountabili ...
*
General Data Protection Regulation
The General Data Protection Regulation (Regulation (EU) 2016/679), abbreviated GDPR, is a European Union regulation on information privacy in the European Union (EU) and the European Economic Area (EEA). The GDPR is an important component of ...
Notes
References
{{data
Digital rights
Interoperability