Information archaeology

While some information is old in minutes, there’s plenty of information in your organisation you need to keep for many years. Take action now, or older file formats could be unreadable.

With your inbox always full to overflowing, you may not look back at older messages and documents very often but that's where most of the knowledge in your organisation is buried.

Search tools make it easier to find without digging away by hand, but what happens to the older files and the messages you have to delete to stay under your mail quota? It's not just about the storage space file formats change and older applications won't always run on the latest version of Windows, even if you can find the software you were using 10 years ago.

Advertisement - Article continues below

It's a major issue for governments, says Natalie Ceeney, the chief executive of the National Archives. "We're used to being able to walk into a business and read documents in Word. We expect our salary to be paid, we expect to get a pension. That means the government needs to keep pension records. We need to keep records on where we bury nuclear waste. We need to make sure censuses taken today are readable in a hundred years. But digital information is inherently more ephemeral than paper and we are living in a world with a ticking time bomb."

The problem has already reached the oil and gas industry. Many of the elephant' oil fields discovered recently aren't new, it's just that the analysis and extraction techniques have improved to the point that it's now economical to mine them. But if the survey is 20 years old, is it cheaper to rescue the data or do the survey all over again?

Advertisement - Article continues below
Advertisement - Article continues below

How about a company that closes a research lab and then needs to defend a patent developed by that lab, or restart old research using a new generation of technology. A pharmaceutical company that needs to play video footage from a 20-year-old clinical study has the same problem as a police force that needs to produce video evidence used for a conviction that's appealed a decade later.

Adam Farquhar, head of digital library technology at the British Library, is part of the EU's PLANETS projects (Preservation and Long-term Access through Networked Services) which calculates that losing access to documents loses businesses across Europe 3 billion (2.7 billion) every year.

"Billions and billions of documents representing billions of Euros are at substantial risk," he believes. "This affects everyone who keeps digital media for more than 15 years."

Physical failure and formats

Failure of the media files are stored on is familiar. CDs often last fewer than 10 years, magnetic tape has to be spooled regularly to keep it readable, and older drives may no longer be available and if you do have both the media and the drive, you need drivers for your current operating system and the right connector.

Advertisement - Article continues below

Assuming the files are available, the real issue is the file format. Microsoft Office includes converters for many older formats from Microsoft and other vendors, as does, but subtle formatting changes can cause problems if a document is repaginated and a legal agreement refers to a clause on a specific page, for example.

Featured Resources

Preparing for long-term remote working after COVID-19

Learn how to safely and securely enable your remote workforce

Download now

Cloud vs on-premise storage: What’s right for you?

Key considerations driving document storage decisions for businesses

Download now

Staying ahead of the game in the world of data

Create successful marketing campaigns by understanding your customers better

Download now

Transforming productivity

Solutions that facilitate work at full speed

Download now

Most Popular

Google Android

Over two dozen Android apps found stealing user data

7 Jul 2020

How to find RAM speed, size and type

24 Jun 2020

The road to recovery

30 Jun 2020