📄 processor.options
字号:
# Available processors# Each processor class should be listed with full package info# followed by a '|' and a descriptive name (containing only [a-z,A-z])# Lines beginning with # and empty lines are ignoredorg.archive.crawler.prefetch.Preselector|Preselectororg.archive.crawler.prefetch.PreconditionEnforcer|Preprocessororg.archive.crawler.fetcher.FetchDNS|DNSorg.archive.crawler.fetcher.FetchHTTP|HTTPorg.archive.crawler.fetcher.FetchFTP|FTPorg.archive.crawler.extractor.ExtractorHTTP|ExtractorHTTPorg.archive.crawler.extractor.ExtractorHTML|ExtractorHTMLorg.archive.crawler.extractor.AggressiveExtractorHTML|AggressiveExtractorHTMLorg.archive.crawler.extractor.ExtractorCSS|ExtractorCSSorg.archive.crawler.extractor.ExtractorSWF|ExtractorSWForg.archive.crawler.extractor.ExtractorJS|ExtractorJSorg.archive.crawler.extractor.ExtractorPDF|ExtractorPDForg.archive.crawler.extractor.ExtractorDOC|ExtractorDOCorg.archive.crawler.extractor.ExtractorXML|ExtractorXMLorg.archive.crawler.extractor.ExtractorUniversal|ExtractorUniversalorg.archive.crawler.extractor.ExtractorURI|ExtractorURIorg.archive.crawler.extractor.ExtractorImpliedURI|ExtractorImpliedURIorg.archive.crawler.extractor.ChangeEvaluator|ChangeEvaluatororg.archive.crawler.extractor.HTTPContentDigest|HTTPContentDigestorg.archive.crawler.writer.ARCWriterProcessor|Archiverorg.archive.crawler.writer.WARCWriterProcessor|WARCArchiverorg.archive.crawler.writer.Kw3WriterProcessor|Kw3Archiverorg.archive.crawler.writer.MirrorWriterProcessor|MirrorWriterorg.archive.crawler.postprocessor.CrawlStateUpdater|Updaterorg.archive.crawler.postprocessor.LinksScoper|LinksScoperorg.archive.crawler.postprocessor.SupplementaryLinksScoper|SupplementaryLinksScoperorg.archive.crawler.postprocessor.FrontierScheduler|FrontierSchedulerorg.archive.crawler.postprocessor.LowDiskPauseProcessor|LowDiskPauseorg.archive.crawler.postprocessor.WaitEvaluator|WaitEvaluatororg.archive.crawler.postprocessor.ContentBasedWaitEvaluator|ContentBasedWaitEvaluatororg.archive.crawler.postprocessor.TextWaitEvaluator|TextWaitEvaluatororg.archive.crawler.postprocessor.ImageWaitEvaluator|ImageWaitEvaluatororg.archive.crawler.postprocessor.AcceptRevisitProcessor|AcceptRevisitProcessororg.archive.crawler.postprocessor.RejectRevisitProcessor|RejectRevisitProcessororg.archive.crawler.processor.LexicalCrawlMapper|LexicalCrawlMapperorg.archive.crawler.processor.HashCrawlMapper|HashCrawlMapperorg.archive.crawler.processor.BeanShellProcessor|BeanShellProcessororg.archive.crawler.prefetch.QuotaEnforcer|QuotaEnforcerorg.archive.crawler.prefetch.RuntimeLimitEnforcer|RuntimeLimitEnforcerorg.archive.crawler.extractor.JerichoExtractorHTML|JerichoExtractorHTMLorg.archive.crawler.processor.recrawl.PersistStoreProcessor|PersistStoreProcessororg.archive.crawler.processor.recrawl.PersistLogProcessor|PersistLogProcessororg.archive.crawler.processor.recrawl.PersistLoadProcessor|PersistLoadProcessororg.archive.crawler.processor.recrawl.FetchHistoryProcessor|FetchHistoryProcessororg.archive.crawler.extractor.TrapSuppressExtractor|TrapSuppressExtractor
⌨️ 快捷键说明
复制代码
Ctrl + C
搜索代码
Ctrl + F
全屏模式
F11
切换主题
Ctrl + Shift + D
显示快捷键
?
增大字号
Ctrl + =
减小字号
Ctrl + -