Qualitas Corpus Catalogue: Release 20130901

heritrix (Active, 2 versions) Extensible, web-scale, archival-quality web crawler project
sysver fullname domain jreversion license distribution releasedate sourcepackages n_bin n_both n_files n_top(bin) loc(both) ncloc(both) url
heritrix-1.8.0 Heritrix tool 1.4.2_19 GNU LESSER GENERAL PUBLIC LICENSE Version 2.1, February 1999;src/heritrix-1.8.0/LICENSE.txt f 2006-05-05 org.archive.crawler. org.archive.extractor org.archive.httpclient org.archive.io org.archive.io.arc org.archive.net org.archive.net.rsync org.archive.queue org.archive.util org.archive.util.fingerprint org.archive.util.iterator st.ata.util 531 531 422 422 92319 47272 http://crawler.archive.org/
heritrix-1.14.4 Heritrix tool 1.5.0_22 GNU LESSER GENERAL PUBLIC LICENSE Version 2.1, February 1999;src/heritrix-1.14.4/LICENSE.txt r 2010-05-10 org.archive. st.ata.util 706 703 551 553 118415 61681 http://crawler.archive.org/


Created 2013-09-05 02:41Z