-
@Component() public final class ParseComponentParser checker, useful for testing parser. It also accurately reports possible fetching and parsing failures and presents protocol status signals to aid debugging. The tool enables us to retrieve the following data from any
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description public classParseComponent.Companion
-
Field Summary
Fields Modifier and Type Field Description private final CrawlFilterscrawlFiltersprivate final PageParserpageParserprivate final GlobalCacheFactoryglobalCacheFactoryprivate final ImmutableConfigconf
-
Constructor Summary
Constructors Constructor Description ParseComponent(GlobalCacheFactory globalCacheFactory, ImmutableConfig conf)ParseComponent(CrawlFilters crawlFilters, PageParser pageParser, GlobalCacheFactory globalCacheFactory, ImmutableConfig conf)
-
Method Summary
Modifier and Type Method Description final CrawlFiltersgetCrawlFilters()final PageParsergetPageParser()final GlobalCacheFactorygetGlobalCacheFactory()final ImmutableConfiggetConf()final ParseResultparse(WebPage page, Boolean reparseLinks, Boolean noLinkFilter)final Map<String, Object>getTraceInfo()-
-
Constructor Detail
-
ParseComponent
ParseComponent(GlobalCacheFactory globalCacheFactory, ImmutableConfig conf)
-
ParseComponent
ParseComponent(CrawlFilters crawlFilters, PageParser pageParser, GlobalCacheFactory globalCacheFactory, ImmutableConfig conf)
-
-
Method Detail
-
getCrawlFilters
final CrawlFilters getCrawlFilters()
-
getPageParser
final PageParser getPageParser()
-
getGlobalCacheFactory
final GlobalCacheFactory getGlobalCacheFactory()
-
getConf
final ImmutableConfig getConf()
-
parse
final ParseResult parse(WebPage page, Boolean reparseLinks, Boolean noLinkFilter)
-
getTraceInfo
final Map<String, Object> getTraceInfo()
-
-
-
-