Web Archiving Service Evaluation
Chris Prom has a very helpful blog entry about his progress on evaluating web archiving service providers. His first three installments were reviews of open source software such as HTTrack, GNU Wget...
View ArticleWeb Archiving at the Library of Congress
Since 2000 the Library of Congress has been working to collect and preserve web sites. Their project had been called MINERVA, now it is known as The Library of Congress Web Archives (LCWA). They have...
View ArticleHeritrix: The Internet Archive’s web crawler project
Heritrix is the Internet Archive’s open-source, extensible, web-scale, archival-quality web crawler project. It’s being used and supported by such institutions as the Library of Congress, the National...
View Article
More Pages to Explore .....