Using Google Takeout to reclaim your data
Everyone knows your data drives the Google machine, but now you can find out exactly what it holds on you
Do you use Google products and services? Given there are just shy of two billion Google accounts, the chances are pretty high that you do. Everyone knows that what drives the Google machine is data; your data.
I’ve written a lot over the years about ways to strengthen your Google security posture and how to lock down your privacy as far as you can while reaping the benefits of Google Workspace (foremerly G Suite). I’m not here to repeat myself, but instead shed light on how to find out what Google knows about you.
Well, more precisely, how to download the data Google holds from your use of services such as Gmail, Calendar, Maps, Pay, among countless others.
What is Google Takeout?
Google Takeout is a data visibility tool developed by in-house engineers collectively named, and I kid you not, the Google Data Liberation Front. It has a multitude of use cases, including making an archive of your contacts and images for example, but also discovering just what personal data Google holds under your account umbrella.
You’ll need to sign into your Google account if you haven't already, and then head to the Google Takeout site to get started. Here you’ll be met with a list of Google products from which you can select the specific data you wish to download. In my case, this meant a staggering 50 product categories to choose from. Your mileage will vary, as will just how much data you will end up downloading.
If you’ve been paying attention and following some of my privacy tips across the years then chances are you’ve already set expiration dates on certain data sets, but even so, be prepared for a large ZIP file. Also, be prepared for a wait before you can download it. Google itself warns this can take “possibly hours or days” to complete; you’ll get a link by email once the report is ready.
This is where tailoring the archive export to your own needs comes in useful. By way of an example that can hugely reduce both download size and report preparation time, you might want to exclude the self-explanatory “access log activity”. If you’re curious, you can customise the download by filtering either by activity (a list of Google services accessed by your devices, such as your phone syncing with Gmail), or devices that have accessed your account over the collection period.
What data can you reclaim from Google?
Just by browsing through the list itself you’ll likely be shocked, or perhaps just have your thoughts on data collection validated. I hope you’re sitting down as the list includes:
- Android Device Configuration Service: Device attributes, account identifiers, etc
- Blogger: Blogs, posts, comments, settings
- Calendar: Data
- Chrome: Bookmarks, autofill data, extensions, browser history, etc
- Cloud Print: History
- Data Shared for Research
- Drive: All your files
- Fit: Workout data, sleep data, daily metrics, etc
- Gmail: Messages and attachments
- Google Account: Activity
- Google Cloud Search: Content metadata ingested as part of the indexing flow
- Google Pay: Activity, transaction history, etc
- Google Store: Purchases, subscriptions, reviews
- Hangouts: Conversation history and attachments
- Home: Device and history data
- Keep: Notes and attachments
- Location: History data
- Maps: Preferences and personal places
- My Activity: Activity data with images and audio attachments
- News: Publications, categories and source data
- Photos and videos
- Play Store: App installs, ratings
- Shopping: Order history, addresses, reviews, loyalty
- Reminders: History
- Search: Ratings, reviews and ‘other contributions’
- Tasks: Open and completed task data
- YouTube/YouTube Music: Watch and search history, videos, comments and content created
I didn’t even include everything that I could choose from within that epic list.
How to export your Google data
Once you’ve chosen the data you want to export, you click a couple of buttons and then wait for that email. Be warned, your ZIP file could be a few GBs in size depending on what you’ve selected and how much data Google has a hold of. Also note that videos and images bump sizes up considerably.
It's worth mentioning, at this point, that you might be restricted by your organisation's internal Google Workspace policies, as far as exporting your personal data is concerned. If that's the case, you'll need to have a conversation with your IT adminstrators.
Unzip the archive and you’ll have folders for each category which contain a combination of .json, .mbox, .html and .csv files. If you want to delete at least some of this data without deleting your Google Account or a particular service, head to the My Activity page where you can do so for web and app activity, location history and YouTube history.
Services you no longer have any use for can be deleted from the Data & Privacy section of your Google account, where you can also delete the account itself along with all data if you’re going for the nuclear option.
Accelerating AI modernisation with data infrastructure
Generate business value from your AI initiativesFree Download
Recommendations for managing AI risks
Integrate your external AI tool findings into your broader security programsFree Download
Modernise your legacy databases in the cloud
An introduction to cloud databasesFree Download
Powering through to innovation
IT agility drive digital transformationFree Download