[0]Hugh Pickens writes "The chief executive of the British Library, Lynne Brindley, says that our cultural heritage is at risk as the Internet evolves and technologies become obsolete, and that historians and citizens face [1]a 'black hole' in the knowledge base of the 21st century unless urgent action is taken to preserve websites and other digital records. For example, when Barack Obama was inaugurated as US president last week, [2]all traces of George W. Bush disappeared from the White House website. There were more than 150 websites relating to the 2000 Olympics in Sydney that vanished instantly at the end of the games and are now stored only by the National Library of Australia. 'If websites continue to disappear in the same way as those on President Bush and the Sydney Olympics... the memory of the nation disappears too,' says Brindley. ...
1. http://www.guardian.co.uk/technology/2009/jan/25/internet-heritage
CFM >> This is appropriate? The world is growing it's 'document' production systems, some now automated! The exponential growth in reading materials, audio and video records is undeniable.
BUT, The librarian is not tasked with capturing ALL of it !
The librarian is tasked with keeping a representative overview of things, within the budgets available. This forces SELECTION !
We need to select from the vast chatter on the internet and in more traditional media, what we wish to be available to future generations and researchers. This selection process needs also to be matched with a system to facilitate a researcher "finding" and perhaps "translating" the material efficiently, as the quantity of material grows. However, some system of 'valuing' documents is needed, perhaps Oxford and Cambridge university presses did this in times past,along with newspaper editors. Should there be a more democratic valuing system today ?
I think not. I think that various organisations will retain what they feel is important, and perhaps make that available to the internet users, or not. I think this is appropriate. Having everything ever recorded in the internet search engines will dilute the value of the search engine !
Also considering that with vast amounts of material, future researchers will only be able to use automated searches, and pulling the really valuable (and value is a property of the valuer, not the item) documents, which are on target with respect to the query, will become increasingly difficult.
Perhaps a new "Decimal Dewey Classification(*)" system, or PageRank(+) algorithm is needed, designed to facilitate searches of documents based on tags, time-of-writing, perspective-of-author and perspective-0f-reader. The last two properties here are obviously multi-dimensional, and may benefit from an agreed clasification system themselves.
* (from Wikipedia) The Dewey Decimal Classification (DDC, also called the Dewey Decimal System) is a proprietary system of library classification developed by Melvil Dewey in 1876, and has since then been greatly modified and expanded through twenty-two major revisions, the most recent in 2004. The system is a method for placing books on library shelves in a specific and repeatable order that makes it easier to find any specific book or to return it to its proper place.
+ PageRank is a link analysis algorithm used by the Google Internet search engine that assigns a numerical weighting to each element of a hyperlinked set of documents, such as the World Wide Web, with the purpose of "measuring" its relative importance within the set. The algorithm may be applied to any collection of entities with reciprocal quotations and references.