Thursday, 8. January 2004

Heritrix


Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Heritrix (sometimes spelled heretrix , or misspelled or missaid as heratrix / heritix / heretix / heratix ) is an archaic word for inheritess . Since our crawler seeks to collect the digital artifacts of our culture for the benefit of future researchers and generations, this name seemed apt.

¬> Heritrix

... Comment