Last week I attended a student conference on digital preservation, hosted by the Digital Preservation Coalition. The event was called ‘What I Wish I Knew Before I Started’, and several digital preservationists gave some very interesting insights into the skills and challenges of digital preservation. The basic message was that digital preservation is a big task which requires urgent action, but that archivists already have many of the skills needed to carry it out. I’ve posted about it at the futureArch blog, if anyone is interested in finding out more.
Digital Preservation: What I Wish I Knew Before I Started
January 30, 2012 by Rebecca Nielsen
Posted in Uncategorized | 4 Comments
4 Responses
Leave a Reply Cancel reply
Search this site:
Welcome to the Oxford Trainees blog!
This blog is independent of Oxford Libraries and it exists for the trainees to speak about what they are doing and learning about. We hope you enjoy following their journey.
This blog provides hypertext links to external sites. However, it is not responsible for the contents of any linked site, or any changes or updates to such sites.
Tag Cloud
adayinthelife alexander library archives blogging bodleian bodleian law library British Library careers christmas CILIP clsig current awareness digital archives Graduate trainee project How To information literacy interesting stories ITLP law law library librarians libraries library Library marketing libraryschool Library spaces new professionals conference OUCS Oxford Podcasts Projects Project showcase radcliffe science library role in society sherardian library slaeurope social media social science libraries technology Twitter wednesdays Welcome09 Welcome10 Welcome11 Welcome12Blogroll
Links
Archives
- May 2013
- April 2013
- March 2013
- February 2013
- January 2013
- December 2012
- November 2012
- October 2012
- September 2012
- August 2012
- July 2012
- May 2012
- April 2012
- February 2012
- January 2012
- November 2011
- October 2011
- September 2011
- August 2011
- July 2011
- June 2011
- May 2011
- March 2011
- February 2011
- January 2011
- December 2010
- November 2010
- October 2010
- September 2010
- August 2010
- May 2010
- March 2010
- February 2010
- January 2010
- December 2009
- November 2009
- October 2009
- September 2009
- August 2009
Authors
Hi Rebecca, thanks for your interesting post. I was wondering about the process of archiving a website: do you archive all the files onto your own server? And if so, how connected are you to the actual website – as in, are you only archiving oxford uni ones, or do you ask other people nicely if you can have their files. Basically, how does it work? Ta, Laurence.
Hi Laurence, thanks for your question! We make use of a service provided by the Internet Archive called Archive-It ( http://www.archive-it.org/ ) to crawl the websites, making a copy which is stored on their servers.
We do archive Oxford university websites, but we also archive websites that are related to existing collections within the Bodleian, providing we can get permission off the website owners. If you look at our Archive-It page ( http://www.archive-it.org/organizations/467 ) you can see which sites we’ve archived.
I hope that makes sense! Let me know if you have any more questions.
So you’re not so much copying the files themselves as how they are rendered by the web browser? Can you do that and still keep all the links working? Fancy. And what about a website that has a database behind it?… Maybe we should continue this in more detail at the pub…
Yes, all the links work. Basically we’re gathering the html code and all the other bits of information that make up a website (images, videos, flash, audio content, etc.) so that Archive-It can display it as an actual website. So all of the links within the website work as normal (or should do, any way). What’s really good is that if a site links to another website we archive, Archive-It can make the connection, so it works a bit like a mini-internet of archived sites almost (not a technical description).
As for databases, I’m not entirely sure, but I should think that if it can be crawled it ought to work. I’ll try to find out! I can report back at the pub…!