3.16. Crawl Filter#


This page is under construction.

Plugin Key

mediatype_crawl_filter_factory, where mediatype is a media type like text/html

Plugin Value Type


Plugin Value Format

The value is the fully qualified name of a Java class implementing the org.lockss.plugin.FilterFactory interface.


If files of a given media type need to be pre-processed (filtered) before URLs are extracted by the crawler using a Link Extractor, this plugin feature can be used to point at custom filtering code.

Crawl filters are somewhat related to hash filters.