A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
_
G
- gauge(String,MetricRegistry.MetricSupplier) - function in com.codahale.metrics.AppMetricRegistry
- GenerateComponent - class in ai.platon.pulsar.crawl.component
- Parser checker, useful for testing parser.
- GenerateComponent.Companion - class in ai.platon.pulsar.crawl.component.GenerateComponent
- GenerateComponent.Companion.Counter - class in ai.platon.pulsar.crawl.component.GenerateComponent.Companion
- generateNextId() - function in ai.platon.pulsar.session.AbstractPulsarSession.Companion
- GenerateOptions - class in ai.platon.pulsar.common.options
- generatorSortValue(WebPage,ScoreVector) - function in ai.platon.pulsar.crawl.scoring.ScoringFilter
- This method prepares a sort value for the purpose of sorting and selecting top N scoring pages during fetchlist generation.
- generatorSortValue(WebPage,ScoreVector) - function in ai.platon.pulsar.crawl.scoring.ScoringFilters
- Calculate a sort value for Generate.
- GENERIC - enum entry in ai.platon.pulsar.common.domain.TopLevelDomain.Type
- get(String) - function in ai.platon.pulsar.common.domain.DomainSuffixes
- Return the DomainSuffix object for the extension, if extension is atop level domain returned object will be an instance of TopLevelDomain
- get(Integer) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- get(Enum) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- get(String) - function in ai.platon.pulsar.context.PulsarContext
- get(String) - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- Get a webpage from the storage
- get(String) - function in ai.platon.pulsar.context.support.BasicPulsarContext
- Get a webpage from the storage
- get(String) - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- Get a webpage from the storage
- get(String) - function in ai.platon.pulsar.context.support.StaticPulsarContext
- Get a webpage from the storage
- get(Integer) - function in ai.platon.pulsar.crawl.common.WeakPageIndexer
- get() - function in java.util.concurrent.CompletableHyperlink
- get(Long,TimeUnit) - function in java.util.concurrent.CompletableHyperlink
- get() - function in java.util.concurrent.CompletableListenableHyperlink
- get(Long,TimeUnit) - function in java.util.concurrent.CompletableListenableHyperlink
- get(String) - function in ai.platon.pulsar.crawl.component.WebDbComponent
- get(Integer) - function in ai.platon.pulsar.crawl.fetch.batch.TaskSchedulers
- get() - function in ai.platon.pulsar.crawl.index.io.IndexDocumentWritable
- get(String) - function in ai.platon.pulsar.crawl.parse.html.OpenMapFields
- get(Name) - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector
- get(Integer) - function in ai.platon.pulsar.common.NamedScoreVector
- get() - function in ai.platon.pulsar.crawl.scoring.io.ScoreVectorWritable
- get(String) - function in ai.platon.pulsar.session.AbstractPulsarSession
- Get a page from database if exists
- get(String) - function in ai.platon.pulsar.session.BasicPulsarSession
- Get a page from database if exists
- get(String) - function in ai.platon.pulsar.session.PulsarSession
- Get a page from database if exists
- getAbstract() - function in ai.platon.pulsar.crawl.CrawlLoop
- getAbstract() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- getAbstract() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.CommonOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.FetchOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.GenerateOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.InjectOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.LinkOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.LoadOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.PulsarOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getAcceptUnknownOptions() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getActivateSelector() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getActivateTimeout() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getActiveContext() - function in ai.platon.pulsar.context.PulsarContexts
- getActiveContexts() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyManager
- NOTE: we can use a priority queue and every time we need a context, take the top one
- getActiveTime() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getActiveTime() - function in ai.platon.pulsar.crawl.fetch.driver.NavigateEntry
- getAdddays() - function in ai.platon.pulsar.common.options.GenerateOptions
- getAfterComputeExpressions() - function in ai.platon.pulsar.crawl.ExpressionSimulateEventHandler
- getAlias() - function in ai.platon.pulsar.common.sites.amazon.AmazonSuggestion
- getAliases() - function in ai.platon.pulsar.crawl.parse.ParserConfig
- getAll(Integer) - function in ai.platon.pulsar.crawl.common.WeakPageIndexer
- getAllow() - function in ai.platon.pulsar.crawl.filter.BlockFilter
- getAllowLinks() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.CommonOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.FetchOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.GenerateOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.InjectOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.LinkOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.LoadOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.PulsarOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getAllowParameterOverwriting() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getANCHOR_ORDER() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getAnchorRegex() - function in ai.platon.pulsar.common.options.LinkOptions
- getApiPublicOptionNames() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getApplicationContext() - function in ai.platon.pulsar.context.PulsarContext
- getApplicationContext() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getApplicationContext() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getApplicationContext() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getApplicationContext() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getArgOrDefault(String,String) - function in ai.platon.pulsar.persist.FilterResult
- getArgOrDefault(String,String) - function in ai.platon.pulsar.persist.ParseResult
- getArgs() - function in ai.platon.pulsar.common.options.CommonOptions
- getArgs() - function in ai.platon.pulsar.common.options.FetchOptions
- getArgs() - function in ai.platon.pulsar.common.options.GenerateOptions
- getArgs() - function in ai.platon.pulsar.common.options.InjectOptions
- getArgs() - function in ai.platon.pulsar.common.options.LinkOptions
- getArgs() - function in ai.platon.pulsar.common.options.LoadOptions
- getArgs() - function in ai.platon.pulsar.common.options.PulsarOptions
- getArgs() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getArgs() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getArgs() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getArgs() - function in ai.platon.pulsar.common.urls.NormUrl
- getArgs() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getArgs() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getArgs() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getArgs() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- The url arguments
- getArgs() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getArgs() - function in ai.platon.pulsar.persist.FilterResult
- getArgs() - function in ai.platon.pulsar.persist.ParseResult
- getArgv() - function in ai.platon.pulsar.common.options.CommonOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.FetchOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.GenerateOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.InjectOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.LinkOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.LoadOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.PulsarOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- The argument vector
- getArgv() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- The argument vector
- getArity0BooleanParams() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getArity1BooleanParams() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getAuthToken() - function in ai.platon.pulsar.common.options.LoadOptions
- getAuthToken() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getAuthToken() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getAuthToken() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getAutoClose() - function in ai.platon.pulsar.common.metrics.EnumCounterReporter
- getAutoClose() - function in ai.platon.pulsar.crawl.AbstractCrawler
- getAutoClose() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getAutoClose() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMonitor
- getAvailableMemory() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getAverageRecentTimeCost() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getAverageRecentTps() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getAverageTime() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getAverageTps() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getBackground() - function in ai.platon.pulsar.common.options.LoadOptions
- getBadSeeds() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getBaseHref() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- Sets the
baseHref. - getBaseURLFromTag(Node) - function in ai.platon.pulsar.crawl.parse.html.PrimerParser
- If Node contains a BASE tag then it's HREF is returned.
- getBatchId() - function in ai.platon.pulsar.common.options.FetchOptions
- getBatchId() - function in ai.platon.pulsar.common.options.GenerateOptions
- getBatchId() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getBatchSize() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getBatchSize() - function in ai.platon.pulsar.common.metrics.AppMetrics
- getBatchSize() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getBatchStat() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getBatchTaskId() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getBbsUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getBean(KClass) - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getBean() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getBean(KClass) - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getBean() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getBean(KClass) - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getBean() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getBean(KClass) - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getBean() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getBeanOrNull(KClass) - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getBeanOrNull() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getBeanOrNull(KClass) - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getBeanOrNull() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getBeanOrNull(KClass) - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getBeanOrNull() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getBeanOrNull(KClass) - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getBeanOrNull() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getBeforeComputeExpressions() - function in ai.platon.pulsar.crawl.ExpressionSimulateEventHandler
- getBlockFilter() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getBlogUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getBoost() - function in ai.platon.pulsar.common.domain.DomainSuffix
- getBoost() - function in ai.platon.pulsar.common.domain.TopLevelDomain
- getBrowser() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- The default browser
- getBrowser() - function in ai.platon.pulsar.common.options.LoadOptions
- TODO: session scope browser choice is not support by now
- getBrowserInstance() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getBrowserInstance() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getBrowserInstanceId() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getBrowserInstanceId() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getBrowserType() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getBrowserType() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getBrowserType() - function in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId
- getBytesPerPage() - function in ai.platon.pulsar.crawl.fetch.BatchStat
- getBytesPerSecond() - function in ai.platon.pulsar.crawl.fetch.BatchStat
- getBytesThoRate() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler.Status
- getCACHE() - function in ai.platon.pulsar.crawl.protocol.RobotRulesParser.Companion
- getCapacity() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- The cache capacity, we assume that all items in the file are loaded into the cache
- getCapacity() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- The cache capacity, we assume that all items in the file are loaded into the cache
- getCapacity() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- The cache capacity, we assume that all items in the file are loaded into the cache
- getCapacity() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getCapacity() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getCapacity() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getCause() - function in ai.platon.pulsar.common.IllegalBusinessPreconditionException
- getCause() - function in ai.platon.pulsar.common.IllegalApplicationContextStateException
- getCause() - function in ai.platon.pulsar.crawl.protocol.http.BlockedException
- getCause() - function in ai.platon.pulsar.crawl.protocol.http.HttpException
- getCause() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextException
- getCause() - function in ai.platon.pulsar.crawl.fetch.privacy.FatalPrivacyContextException
- getCause() - function in ai.platon.pulsar.crawl.filter.UrlFilterException
- getCause() - function in ai.platon.pulsar.crawl.index.IndexingException
- getCause() - function in ai.platon.pulsar.crawl.parse.ParseException
- getCause() - function in ai.platon.pulsar.crawl.parse.ParserNotFound
- getCause() - function in ai.platon.pulsar.crawl.protocol.ProtocolException
- getCause() - function in ai.platon.pulsar.crawl.protocol.ProtocolNotFound
- getCause() - function in ai.platon.pulsar.crawl.scoring.ScoringFilterException
- getChildren() - function in ai.platon.pulsar.crawl.parse.AbstractParseFilter
- getChildren() - function in ai.platon.pulsar.crawl.parse.EmptyParseFilter
- getChildren() - function in ai.platon.pulsar.crawl.parse.ParseFilter
- getClassName(String) - function in ai.platon.pulsar.crawl.parse.ParserConfig
- getClickTarget() - function in ai.platon.pulsar.common.options.LoadOptions
- getClosed() - function in ai.platon.pulsar.crawl.AbstractCrawler
- getClosed() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getClosed() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getCloseSelector() - function in ai.platon.pulsar.crawl.event.CloseMaskLayerHandler
- getClues() - function in ai.platon.pulsar.common.EncodingDetector
- getCluesAsString() - function in ai.platon.pulsar.common.EncodingDetector
- getCMD_SPLIT_PATTERN() - function in ai.platon.pulsar.common.options.PulsarOptions.Companion
- getCodes() - function in ai.platon.pulsar.crawl.common.FetchState
- getCollectCount() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getCollectCount() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getCollectCount() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getCollectCount() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getCollectCount() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getCollectCount() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getCollected() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector.Companion.Counters
- getCollectedCount() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getCollectedCount() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getCollectedCount() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getCollectedCount() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getCollectedCount() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getCollectedCount() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getCollectionOptions() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getCollector() - function in ai.platon.pulsar.common.collect.PriorityDataCollectorFormatter
- getCollectors() - function in ai.platon.pulsar.common.collect.PriorityDataCollectorsTableFormatter
- getCollectors() - function in ai.platon.pulsar.crawl.CrawlLoop
- getCollectors() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- getCollectors() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getCollects() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector.Companion.Counters
- getCollectTime() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getCollectTime() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getCollectTime() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getCollectTime() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getCollectTime() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getCollectTime() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getConf() - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getConf() - function in ai.platon.pulsar.common.config.AbstractNativeHttpProtocol
- getConf() - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- getConf() - function in ai.platon.pulsar.crawl.protocol.HttpRobotRulesParser
- Get the Configuration object
- getConf() - function in ai.platon.pulsar.common.message.MiscMessageWriter
- getConf() - function in ai.platon.pulsar.common.options.GenerateOptions
- getConf() - function in ai.platon.pulsar.common.options.LoadOptions
- getConf() - function in ai.platon.pulsar.crawl.common.GlobalCache
- getConf() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getConf() - function in ai.platon.pulsar.crawl.component.InjectComponent
- getConf() - function in ai.platon.pulsar.crawl.component.ParseComponent
- getConf() - function in ai.platon.pulsar.crawl.component.UpdateComponent
- getConf() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer
- getConf() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextIdGeneratorFactory
- getConf() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getConf() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyManager
- getConf() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getConf() - function in ai.platon.pulsar.crawl.filter.CrawlFilters
- getConf() - function in ai.platon.pulsar.crawl.filter.CrawlUrlFilters
- getConf() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers
- getConf() - function in ai.platon.pulsar.crawl.index.IndexWriters
- getConf() - function in ai.platon.pulsar.crawl.index.IndexingFilter
- getConf() - function in ai.platon.pulsar.crawl.index.IndexingFilters
- getConf() - function in ai.platon.pulsar.crawl.parse.LinkFilter
- getConf() - function in ai.platon.pulsar.crawl.parse.PageParser
- getConf() - function in ai.platon.pulsar.crawl.parse.ParseFilters
- getConf() - function in ai.platon.pulsar.crawl.parse.html.PrimerParser
- getConf() - function in ai.platon.pulsar.common.config.Protocol
- getConf() - function in ai.platon.pulsar.crawl.protocol.RobotRulesParser
- Get the Configuration object
- getConf() - function in ai.platon.pulsar.crawl.schedule.AbstractFetchSchedule
- getConf() - function in ai.platon.pulsar.crawl.schedule.AdaptiveFetchSchedule
- getConf() - function in ai.platon.pulsar.crawl.schedule.DefaultFetchSchedule
- getConf() - function in ai.platon.pulsar.crawl.schedule.NewsFetchSchedule
- getConf() - function in ai.platon.pulsar.crawl.scoring.ScoringFilters
- getConfig() - function in ai.platon.pulsar.crawl.CrawlLoop
- getConfig() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- getConfig() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getConfiguration() - function in ai.platon.pulsar.common.ReducerContext
- getConfiguredUrl() - function in ai.platon.pulsar.common.urls.NormUrl
- getConfiguredUrl() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getConfiguredUrl() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getConfiguredUrl() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getConfiguredUrl() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getConfiguredUrl() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getContains() - function in ai.platon.pulsar.crawl.filter.TextFilter
- getContainsAny() - function in ai.platon.pulsar.crawl.filter.TextFilter
- getContainsNone() - function in ai.platon.pulsar.crawl.filter.TextFilter
- getCONTENT_SCORE() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getContentPersists() - function in ai.platon.pulsar.crawl.CoreMetrics
- getContentType() - function in ai.platon.pulsar.crawl.parse.ParserNotFound
- getContext() - function in ai.platon.pulsar.context.ContextAware
- getContext() - function in ai.platon.pulsar.session.AbstractPulsarSession
- The pulsar context
- getContext() - function in ai.platon.pulsar.session.BasicPulsarSession
- The pulsar context
- getContext() - function in ai.platon.pulsar.session.PulsarSession
- The pulsar context
- getContextClassLoader() - function in java.lang.IndexThread
- getContextDir() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextId
- getContextDir() - function in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId
- getContextLeaks() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMetrics
- getContexts() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMetrics
- getCookies() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getCookies() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getCoreMetrics() - function in ai.platon.pulsar.common.AppStatusTracker
- getCoreMetrics() - function in ai.platon.pulsar.crawl.component.BatchFetchComponent
- getCoreMetrics() - function in ai.platon.pulsar.crawl.component.FetchComponent
- getCoreMetrics() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getCorrectedOutLinkSelector() - function in ai.platon.pulsar.common.options.LoadOptions
- getCounter() - function in ai.platon.pulsar.common.metrics.MultiMetric
- getCounterReportInterval() - function in ai.platon.pulsar.common.metrics.AppMetrics
- getCounters() - function in ai.platon.pulsar.common.collect.FatLinkExtractor
- getCounters() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getCounters() - function in com.codahale.metrics.AppMetricRegistry
- getCounters(MetricFilter) - function in com.codahale.metrics.AppMetricRegistry
- getCountry() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getCountry() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getCountry() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getCountry() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getCountry() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getCountry() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getCountry() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getCountry() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getCountry() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getCountry() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- Required website country
- getCountry() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getCountryName() - function in ai.platon.pulsar.common.domain.TopLevelDomain
- Returns the country name if TLD is Country Code TLD
- getCRAWL_FILTER_RULES() - function in ai.platon.pulsar.crawl.filter.CrawlFilters.Companion
- getCrawlDelay() - function in ai.platon.pulsar.crawl.protocol.RobotRules
- Get Crawl-Delay, in milliseconds.
- getCrawler() - function in ai.platon.pulsar.crawl.CrawlLoop
- getCrawler() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- getCrawler() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getCrawlEventHandler() - function in ai.platon.pulsar.crawl.PulsarEventHandler
- getCrawlEventHandler() - function in ai.platon.pulsar.crawl.AbstractPulsarEventHandler
- getCrawlEventHandler() - function in ai.platon.pulsar.crawl.DefaultPulsarEventHandler
- getCrawlEventHandler() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getCrawlEventHandler() - function in ai.platon.pulsar.crawl.StreamingCrawler
- The crawl event handler
- getCrawlFilters() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getCrawlFilters() - function in ai.platon.pulsar.crawl.component.ParseComponent
- getCrawlFilters() - function in ai.platon.pulsar.crawl.filter.CrawlFilters
- getCrawlFilters() - function in ai.platon.pulsar.crawl.parse.PageParser
- getCrawlId() - function in ai.platon.pulsar.common.options.FetchOptions
- getCrawlId() - function in ai.platon.pulsar.common.options.GenerateOptions
- getCrawlId() - function in ai.platon.pulsar.common.options.InjectOptions
- getCrawlLoops() - function in ai.platon.pulsar.context.PulsarContext
- getCrawlLoops() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getCrawlLoops() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getCrawlLoops() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getCrawlLoops() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The main loop
- getCrawlPool() - function in ai.platon.pulsar.context.PulsarContext
- getCrawlPool() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getCrawlPool() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getCrawlPool() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getCrawlPool() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getCREATE_TIME() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getCreatedAt() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getCreatedAt() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getCreatedAt() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getCreateTime() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getCreateTime() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getCreateTime() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getCreateTime() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getCreateTime() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getCreateTime() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getCreateTime() - function in ai.platon.pulsar.crawl.fetch.driver.NavigateEntry
- getCrid() - function in ai.platon.pulsar.common.sites.amazon.AmazonSuggestion
- getCssRules() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getCssRules() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getCssSelector() - function in ai.platon.pulsar.common.collect.HyperlinkExtractor
- getCsvReportInterval() - function in ai.platon.pulsar.common.metrics.AppMetrics
- getCurrentKey() - function in ai.platon.pulsar.common.ReducerContext
- getCurrentValue() - function in ai.platon.pulsar.common.ReducerContext
- getDailyCounter() - function in ai.platon.pulsar.common.metrics.MultiMetric
- getDailyCounters() - function in ai.platon.pulsar.common.metrics.AppMetricRegistry
- getDbGetCount() - function in ai.platon.pulsar.crawl.component.LoadComponent.Companion
- getDEAD_URLS_PAGE() - function in ai.platon.pulsar.crawl.fetch.LazyFetchTaskManager.Companion
- getDeadTime() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getDeadTime() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getDeadTime() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getDeadTime() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getDeadTime() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getDeadTime() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getDeadTime() - function in ai.platon.pulsar.common.options.LoadOptions
- getDeadTime() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getDeadTime() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getDeadTime() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getDeadTime() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getDeadTime() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getDeadUrls() - function in ai.platon.pulsar.crawl.CoreMetrics
- getDEFAULT() - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry.Companion
- getDEFAULT() - function in ai.platon.pulsar.common.options.LinkOptions.Companion
- getDefault() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getDEFAULT() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions.Companion
- getDEFAULT() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextId.Companion
- getDEFAULT() - function in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId.Companion
- getDEFAULT_DELIMETER() - function in ai.platon.pulsar.common.options.PulsarOptions.Companion
- getDEFAULT_DIR() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext.Companion
- getDEFAULT_FINGERPRINT() - function in ai.platon.pulsar.crawl.fetch.FetchTask.Companion
- getDEFAULT_MINE_TYPE() - function in ai.platon.pulsar.crawl.parse.ParserFactory.Companion
- getDEFAULT_SCORE_ENTRIES() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getDEFAULT_SEED_ARGS() - function in ai.platon.pulsar.common.options.LinkOptions.Companion
- getDEFAULT_SEED_OPTIONS() - function in ai.platon.pulsar.common.options.LinkOptions.Companion
- getDefaultArgs() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getDefaultArgsMap() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getDefaultBatchId() - function in ai.platon.pulsar.common.options.FetchOptions.Companion
- getDefaultCharEncoding() - function in ai.platon.pulsar.common.EncodingDetector
- getDefaultMetricRegistry() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getDefaultOptions() - function in ai.platon.pulsar.crawl.CrawlLoop
- getDefaultOptions() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- Data collector lower capacity
- getDefaultOptions() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- Data collector lower capacity
- getDefaultOptions() - function in ai.platon.pulsar.crawl.StreamingCrawler
- The default load options
- getDefaultParams() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getDelayCache() - function in ai.platon.pulsar.common.collect.LoadingDelayQueue
- getDelayPolicy() - function in ai.platon.pulsar.common.sites.amazon.AmazonSearcherJsEventHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.AbstractWebDriverHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.AbstractWebPageWebDriverHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.EmptyWebDriverHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.AbstractSimulateEventHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.WebPageWebDriverHandlerPipeline
- getDelayPolicy() - function in ai.platon.pulsar.crawl.ExpressionSimulateEventHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.DefaultSimulateEventHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.event.CloseMaskLayerHandler
- getDelayPolicy() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getDelayPolicy() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getDELIMITER() - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getDepth() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getDepth() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getDepth() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getDetail() - function in ai.platon.pulsar.common.urls.NormUrl
- getDETAIL_PAGE_URL_PATTERNS() - function in ai.platon.pulsar.crawl.filter.CrawlFilter.Companion
- getDetailUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getDigits() - function in ai.platon.pulsar.common.NamedScoreVector
- getDimension() - function in ai.platon.pulsar.common.NamedScoreVector
- getDisabledMetricAttributes() - function in com.codahale.metrics.CodahaleSlf4jReporter
- getDisallow() - function in ai.platon.pulsar.crawl.filter.BlockFilter
- getDisplay() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextId
- getDisplay() - function in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId
- getDisplay() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getDisplay() - function in ai.platon.pulsar.session.AbstractPulsarSession
- getDisplay() - function in ai.platon.pulsar.session.BasicPulsarSession
- getDisplay() - function in ai.platon.pulsar.session.PulsarSession
- getDISTANCE() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getDistrict() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getDistrict() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getDistrict() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getDistrict() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getDistrict() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getDistrict() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getDistrict() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getDistrict() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getDistrict() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getDistrict() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- Required website district
- getDistrict() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getDO_NOT_FETCH() - function in ai.platon.pulsar.crawl.common.FetchState
- getDoc() - function in ai.platon.pulsar.crawl.index.io.IndexDocumentWritable
- getDocument() - function in ai.platon.pulsar.common.collect.HyperlinkExtractor
- getDocument() - function in ai.platon.pulsar.common.collect.RegexHyperlinkExtractor
- getDocument() - function in ai.platon.pulsar.crawl.parse.ParseResult
- getDocument() - function in ai.platon.pulsar.crawl.parse.html.JsoupExtractor
- getDocument() - function in ai.platon.pulsar.crawl.parse.html.JsoupParser
- getDocument() - function in ai.platon.pulsar.crawl.parse.html.ParseContext
- getDocumentCache() - function in ai.platon.pulsar.crawl.common.GlobalCache
- The global document cache, a document will be removed if it's expired or the cache is full
- getDocumentCache() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getDocumentCache() - function in ai.platon.pulsar.session.AbstractPulsarSession
- getDocumentCache() - function in ai.platon.pulsar.session.BasicPulsarSession
- getDocumentCache() - function in ai.platon.pulsar.session.PulsarSession
- getDocumentCacheHits() - function in ai.platon.pulsar.session.AbstractPulsarSession.Companion
- getDomain() - function in ai.platon.pulsar.common.domain.DomainSuffix
- getDomain() - function in ai.platon.pulsar.common.domain.TopLevelDomain
- getDomain() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getDomainName(URL) - function in ai.platon.pulsar.crawl.common.URLUtil
- Returns the domain name of the url.
- getDomainName(String) - function in ai.platon.pulsar.crawl.common.URLUtil
- Returns the domain name of the url.
- getDomainName(String,String) - function in ai.platon.pulsar.crawl.common.URLUtil
- getDomainSuffix(URL) - function in ai.platon.pulsar.crawl.common.URLUtil
- Returns the DomainSuffix corresponding to the last public part of the hostname
- getDomainSuffix(DomainSuffixes,URL) - function in ai.platon.pulsar.crawl.common.URLUtil
- getDomainSuffix(DomainSuffixes,String) - function in ai.platon.pulsar.crawl.common.URLUtil
- Returns the DomainSuffix corresponding to the last public part of the hostname
- getDomStatistics() - function in ai.platon.pulsar.crawl.parse.ParseResult
- getDrops() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getDurationUnit() - function in com.codahale.metrics.CodahaleSlf4jReporter
- getElapsedTime() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getElapsedTime() - function in ai.platon.pulsar.crawl.CoreMetrics
- getElapsedTime() - function in ai.platon.pulsar.crawl.fetch.BatchStat
- getElapsedTime() - function in ai.platon.pulsar.crawl.fetch.LazyFetchTaskManager
- getElapsedTime() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getEMPTY_RULES() - function in ai.platon.pulsar.crawl.protocol.RobotRulesParser.Companion
- A BaseRobotRules object appropriate for use when the
robots.txtfile is empty or missing; all requests are allowed. - getEndKey() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getEntries() - function in ai.platon.pulsar.common.NamedScoreVector
- getEnumCounterRegistry() - function in ai.platon.pulsar.common.metrics.AppMetricRegistry
- getEnumCounters() - function in ai.platon.pulsar.common.metrics.AppMetricRegistry
- getEstimatedExternalSize() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getEstimatedExternalSize() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getEstimatedExternalSize() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getEstimatedExternalSize() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getEstimatedExternalSize() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getEstimatedExternalSize() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getEstimatedSize() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getEstimatedSize() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getEstimatedSize() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getEstimatedSize() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getEstimatedSize() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getEstimatedSize() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getEventHandler() - function in ai.platon.pulsar.common.options.LoadOptions
- getEventHandler() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getEventHandler() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getEventHandler() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getEventHandler() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getException() - function in ai.platon.pulsar.crawl.fetch.FetchResult
- getExecutor() - function in ai.platon.pulsar.common.metrics.EnumCounterReporter
- getExecutor() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMonitor
- getExpandAtSign() - function in ai.platon.pulsar.common.options.CommonOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.FetchOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.GenerateOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.InjectOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.LinkOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.LoadOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.PulsarOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getExpandAtSign() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getExpireAt() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- The default time to expire
- getExpireAt() - function in ai.platon.pulsar.common.options.LoadOptions
- The page is expired if the current time expireAt
- getEXPIRED() - function in ai.platon.pulsar.crawl.common.FetchState
- getExpiredLinks() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getExpires() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getExpires() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- The default expire time, some time we may need expire all pages by default, for example, in test mode
- getExpires() - function in ai.platon.pulsar.common.options.LoadOptions
- Web page expiry time The term "expires" usually be used for a expiry time, for example, http-equiv, or in cookie specification, guess it means "expires at"The expires field supports both ISO-8601 standard and hadoop time duration format ISO-8601 standard : PnDTnHnMn.nS Hadoop time duration format : Valid units are : ns, us, ms, s, m, h, d.
- getExpireTime() - function in ai.platon.pulsar.crawl.protocol.RobotRules
- Get expire time
- getExternalSize() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getExternalSize() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getExternalSize() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getExternalSize() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getExternalSize() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getExternalSize() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getFAILED_URLS_PAGE() - function in ai.platon.pulsar.crawl.fetch.LazyFetchTaskManager.Companion
- getFailedHosts() - function in ai.platon.pulsar.crawl.CoreMetrics
- getFailedUrls() - function in ai.platon.pulsar.crawl.CoreMetrics
- getFatLink() - function in ai.platon.pulsar.common.collect.PageFatLink
- getFatLinks() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- Track the status of this batch, we need a notice when the batch is finished
- getFatLinks() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- Track the status of this batch, we need a notice when the batch is finished
- getFatLinks() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- Track the status of this batch, we need a notice when the batch is finished
- getFetchComponent() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getFetchComponent() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getFetchComponent() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getFetchComponent() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The fetch component
- getFetchComponent() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getFetchConcurrency() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getFetchingCache() - function in ai.platon.pulsar.crawl.common.GlobalCache
- The fetching cache, an url is added to the cache before fetching and removed from it after fetching
- getFetchInterval() - function in ai.platon.pulsar.common.options.LoadOptions
- The page is expired if the current time expireAt
- getFetchInterval() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getFetchLinks() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getFetchMode() - function in ai.platon.pulsar.common.options.FetchOptions
- getFetchMode() - function in ai.platon.pulsar.common.options.LoadOptions
- getFetchPriority() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getFetchSchedule() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getFetchSchedule() - function in ai.platon.pulsar.crawl.component.UpdateComponent
- getFetchSuccesses() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getFetchSuccesses() - function in ai.platon.pulsar.crawl.AmazonMetrics.Companion
- getFetchTaskTimeout() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getFetchTime() - function in ai.platon.pulsar.crawl.schedule.ModifyInfo
- The actual latest fetch time, WebPage.
- getField(CharSequence) - function in ai.platon.pulsar.crawl.index.IndexDocument
- getFieldNames() - function in ai.platon.pulsar.crawl.index.IndexDocument
- getFields() - function in ai.platon.pulsar.crawl.index.IndexDocument
- getFieldValue(CharSequence) - function in ai.platon.pulsar.crawl.index.IndexDocument
- getFieldValueAsString(CharSequence) - function in ai.platon.pulsar.crawl.index.IndexDocument
- getFieldValues(CharSequence) - function in ai.platon.pulsar.crawl.index.IndexDocument
- getFileName() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getFileName() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getFileName() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getFilterReport() - function in ai.platon.pulsar.crawl.parse.LinkFilter
- getFilters() - function in ai.platon.pulsar.crawl.CrawlLoops.Companion
- getFilters() - function in ai.platon.pulsar.crawl.parse.FilterResult
- getFingerprint() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getFingerprint() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextId
- getFingerprint() - function in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId
- getFinishedTasks() - function in ai.platon.pulsar.crawl.CoreMetrics
- getFinishedTasksPerSecond() - function in ai.platon.pulsar.crawl.CoreMetrics
- getFinishes() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getFinishes() - function in ai.platon.pulsar.crawl.AmazonMetrics.Companion
- getFinishes() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMetrics
- getFinishTime() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getFinishTimes() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getFirst() - function in ai.platon.pulsar.crawl.fetch.batch.TaskSchedulers
- getFirstCollectTime() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getFirstCollectTime() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getFirstCollectTime() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getFirstCollectTime() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getFirstCollectTime() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getFirstCollectTime() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getFlowStatus() - function in ai.platon.pulsar.crawl.parse.FilterResult
- getFlowStatus() - function in ai.platon.pulsar.crawl.parse.ParseResult
- getFORBID_ALL_RULES() - function in ai.platon.pulsar.crawl.protocol.RobotRulesParser.Companion
- A BaseRobotRules object appropriate for use when the
robots.txtfile is not fetched due to a403/Forbiddenresponse; all requests are disallowed. - getFreeMemory() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getFreeMemoryGiB() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getFreeSpace() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getFreshLinks() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getGauges() - function in com.codahale.metrics.AppMetricRegistry
- getGauges(MetricFilter) - function in com.codahale.metrics.AppMetricRegistry
- getGeneralTags() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- Returns all collected values of the general meta tags.
- getGenerator() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextIdGeneratorFactory
- getGlobalCache() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getGlobalCache() - function in ai.platon.pulsar.crawl.common.GlobalCacheFactory
- getGlobalCache() - function in ai.platon.pulsar.crawl.component.BatchFetchComponent
- getGlobalCache() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getGlobalCacheFactory() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getGlobalCacheFactory() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getGlobalCacheFactory() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getGlobalCacheFactory() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The global cache
- getGlobalCacheFactory() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- The global cache
- getGlobalCacheFactory() - function in ai.platon.pulsar.crawl.StreamingCrawler
- A optional global cache which will hold the retry tasks
- getGlobalCacheFactory() - function in ai.platon.pulsar.crawl.component.BatchFetchComponent
- getGlobalCacheFactory() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getGlobalCacheFactory() - function in ai.platon.pulsar.crawl.component.ParseComponent
- getGlobalCacheFactory() - function in ai.platon.pulsar.session.AbstractPulsarSession
- getGlobalCacheFactory() - function in ai.platon.pulsar.session.BasicPulsarSession
- getGlobalCacheFactory() - function in ai.platon.pulsar.session.PulsarSession
- getGlobalCounters() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion
- getGlobalCounters() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector.Companion
- getGlobalMetrics() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext.Companion
- getGone() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getGOOD_CONTENT_TEXT_LENGTH() - function in ai.platon.pulsar.crawl.signature.TextProfileSignature.Companion
- getGraphiteReportInterval() - function in ai.platon.pulsar.common.metrics.AppMetrics
- getGraphiteServer() - function in ai.platon.pulsar.common.metrics.AppMetrics
- getGroup(T) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getGroup(Class) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getGroup(Class) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry.Companion
- getGroup(T) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry.Companion
- getGroupMode() - function in ai.platon.pulsar.crawl.CoreMetrics
- getGroupMode() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- Host group mode : can be by ip, by host or by domain
- getHardRedirect() - function in ai.platon.pulsar.common.options.LoadOptions
- getHeader(String) - function in ai.platon.pulsar.crawl.protocol.ForwardingResponse
- The value of a named header.
- getHeader(String) - function in ai.platon.pulsar.crawl.protocol.Response
- The value of a named header.
- getHeaders() - function in ai.platon.pulsar.crawl.protocol.ForwardingResponse
- getHeaders() - function in ai.platon.pulsar.crawl.protocol.ProtocolOutput
- getHeaders() - function in ai.platon.pulsar.crawl.protocol.Response
- getHelpList() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getHistogramContentBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- getHistograms() - function in com.codahale.metrics.AppMetricRegistry
- getHistograms(MetricFilter) - function in com.codahale.metrics.AppMetricRegistry
- getHistoryUrl() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getHost(String) - function in ai.platon.pulsar.crawl.common.URLUtil
- getHost(String,URLUtil.GroupMode) - function in ai.platon.pulsar.crawl.common.URLUtil
- getHost(String,String,URLUtil.GroupMode) - function in ai.platon.pulsar.crawl.common.URLUtil
- getHost(URL,String,URLUtil.GroupMode) - function in ai.platon.pulsar.crawl.common.URLUtil
- getHost(URL,URLUtil.GroupMode) - function in ai.platon.pulsar.crawl.common.URLUtil
- getHost() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getHost() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask.Key
- getHost() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getHost() - function in ai.platon.pulsar.crawl.fetch.batch.data.PoolId
- getHostBatches(URL) - function in ai.platon.pulsar.crawl.common.URLUtil
- Partitions of the hostname of the url by ".
- getHostBatches(String) - function in ai.platon.pulsar.crawl.common.URLUtil
- Partitions of the hostname of the url by ".
- getHostName(String) - function in ai.platon.pulsar.crawl.common.URLUtil
- Returns the lowercased hostname for the url or null if the url is not well formed.
- getHostName(String,String) - function in ai.platon.pulsar.crawl.common.URLUtil
- getHostName() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getHourlyCounter() - function in ai.platon.pulsar.common.metrics.MultiMetric
- getHourlyCounters() - function in ai.platon.pulsar.common.metrics.AppMetricRegistry
- getHref() - function in ai.platon.pulsar.common.urls.NormUrl
- getHref() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getHref() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getHref() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getHref() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- The hypertext reference, It defines the address of the document, which this time is linked from
- getHref() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getHref() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getHrefSpec() - function in ai.platon.pulsar.common.urls.NormUrl
- getHttpCode() - function in ai.platon.pulsar.crawl.protocol.ForwardingResponse
- getHttpCode() - function in ai.platon.pulsar.crawl.protocol.Response
- getHttpEquivTags() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- Returns all collected values of the "http-equiv" meta tags.
- getHypeLinks() - function in ai.platon.pulsar.crawl.parse.ParseResult
- getHyperlinks() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getHyperlinks() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getHyperlinks() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getId() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getId() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getId() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getId() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getId() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getId() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getId() - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getId() - function in ai.platon.pulsar.context.PulsarContext
- getId() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getId() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getId() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getId() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getId() - function in ai.platon.pulsar.crawl.CrawlLoop
- getId() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- getId() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getId() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getId() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getId() - function in ai.platon.pulsar.crawl.fetch.batch.FetchLoop
- getId() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getId() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getId() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getId() - function in ai.platon.pulsar.crawl.fetch.driver.BrowserInstance
- getId() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getId() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getId() - function in java.lang.IndexThread
- getId() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- The data directory for this context, very context has it's own data directory
- getId() - function in ai.platon.pulsar.crawl.parse.AbstractParseFilter
- getId() - function in ai.platon.pulsar.crawl.parse.EmptyParseFilter
- getId() - function in ai.platon.pulsar.crawl.parse.ParseFilter
- getId() - function in ai.platon.pulsar.session.AbstractPulsarSession
- The session id.
- getId() - function in ai.platon.pulsar.session.BasicPulsarSession
- The session id.
- getId() - function in ai.platon.pulsar.session.PulsarSession
- The session id.
- getID_CAPACITY() - function in ai.platon.pulsar.session.AbstractPulsarSession.Companion
- getID_END() - function in ai.platon.pulsar.session.AbstractPulsarSession.Companion
- getID_START() - function in ai.platon.pulsar.session.AbstractPulsarSession.Companion
- getIdent() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getIdent() - function in ai.platon.pulsar.crawl.AmazonDiagnosis
- getIdent() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextId
- getIdent() - function in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId
- getIDENT_PREFIX() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext.Companion
- getIdGen() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop.Companion
- getIdleTime() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getIdleTime() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getIdleTimeout() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIdleTimeout() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getIdleTimeout() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getIdleTimeout() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getIframe() - function in ai.platon.pulsar.common.options.LoadOptions
- getIgnoredPageCount() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer
- getIgnoreFailure() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- Retry or not if a page is gone
- getIgnoreFailure() - function in ai.platon.pulsar.common.options.LoadOptions
- Force retry fetching the page if it's failed last time, or it's marked as gone
- getIgnoreQuery() - function in ai.platon.pulsar.common.options.LoadOptions
- getIgnoreUrlQuery() - function in ai.platon.pulsar.common.options.LoadOptions
- getIllegalState() - function in ai.platon.pulsar.crawl.fetch.batch.FetchLoop.Companion
- getImmutableConfig() - function in ai.platon.pulsar.crawl.common.GlobalCacheFactory
- getImmutableConfig() - function in ai.platon.pulsar.crawl.component.BatchFetchComponent
- getImmutableConfig() - function in ai.platon.pulsar.crawl.component.FetchComponent
- getImmutableConfig() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getImmutableConfig() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getIncognito() - function in ai.platon.pulsar.common.options.LoadOptions
- getIndex(Enum) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- Get counter indexSearch over small vector is very fast, even faster than small tree.
- getIndex() - function in ai.platon.pulsar.common.options.FetchOptions
- getINDEX_PAGE_URL_PATTERNS() - function in ai.platon.pulsar.crawl.filter.CrawlFilter.Companion
- The follow patterns are simple rule to indicate a url's category, this is a very simple solution, and the result is not accurate
- getIndexed() - function in ai.platon.pulsar.crawl.index.IndexerMapping.MappingField
- getIndexedPageCount() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer
- getIndexerCollection() - function in ai.platon.pulsar.common.options.FetchOptions
- getIndexerUrl() - function in ai.platon.pulsar.common.options.FetchOptions
- getIndexerUrl() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getIndexingFilters() - function in ai.platon.pulsar.crawl.component.IndexComponent
- getIndexingFilters() - function in ai.platon.pulsar.crawl.index.IndexingFilters
- getIndexJIT() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getIndexServerHost() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer
- Index server
- getIndexServerPort() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer
- getIndexThreadCount() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer
- getIndexUnchecked(Integer,Enum) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getIndexUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getIndexWriters() - function in ai.platon.pulsar.crawl.component.IndexComponent
- getIndexWriters() - function in ai.platon.pulsar.crawl.index.IndexWriters
- getInitialDelay() - function in ai.platon.pulsar.common.metrics.AppMetrics
- getInitialDelay() - function in ai.platon.pulsar.common.metrics.EnumCounterReporter
- getInitialDelay() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMonitor
- getInitNetworkIFsRecvBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- The total all bytes received by the hardware at the application startup
- getInjectComponent() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getInjectComponent() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getInjectComponent() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getInjectComponent() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The injection component
- getInstance() - function in ai.platon.pulsar.common.domain.DomainSuffixes
- Singleton instance, lazy instantination
- getInstanceSequencer() - function in ai.platon.pulsar.context.support.AbstractPulsarContext.Companion
- getInstanceSequencer() - function in ai.platon.pulsar.crawl.fetch.FetchTask.Companion
- getInstanceSequencer() - function in ai.platon.pulsar.crawl.fetch.batch.FetchLoop.Companion
- getInstanceSequencer() - function in ai.platon.pulsar.crawl.parse.AbstractParseFilter.Companion
- getIsActive() - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- getIsActive() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getIsActive() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getIsActive() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getIsActive() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getIsActive() - function in ai.platon.pulsar.crawl.AbstractCrawler
- getIsActive() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getIsActive() - function in ai.platon.pulsar.crawl.component.BatchFetchComponent
- getIsActive() - function in ai.platon.pulsar.crawl.component.FetchComponent
- getIsActive() - function in ai.platon.pulsar.crawl.fetch.batch.FeedLoop
- getIsActive() - function in ai.platon.pulsar.crawl.fetch.batch.FetchMonitor
- getIsActive() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getIsActive() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getIsActive() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyManager
- getIsActive() - function in ai.platon.pulsar.crawl.index.IndexWriter
- getIsActive() - function in ai.platon.pulsar.session.AbstractPulsarSession
- getIsActive() - function in ai.platon.pulsar.session.BasicPulsarSession
- getIsCanceled() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getIsCanceled() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIsCanceled() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver.Status
- getIsCanceled() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getIsCrashed() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIsCrashed() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver.Status
- getIsCrawlRetry() - function in ai.platon.pulsar.crawl.fetch.FetchResult
- getIsDead() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getIsDead() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getIsDead() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getIsDead() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getIsDead() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getIsDead() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getIsDefault() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextId
- getIsEmpty() - function in ai.platon.pulsar.common.urls.NormUrl
- getIsEmpty() - function in ai.platon.pulsar.crawl.EventHandlerPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.VoidEventHandlerPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.UrlAwareHandlerPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.UrlAwareFilterPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.UrlFilterPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.UrlHandlerPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.WebPageHandlerPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.UrlAwareWebPageHandlerPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.HtmlDocumentHandlerPipeline
- getIsEmpty() - function in ai.platon.pulsar.crawl.WebDriverHandlerPipeline
- getIsEnabled() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer
- getIsExpired() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getIsfb() - function in ai.platon.pulsar.common.sites.amazon.AmazonSuggestion
- getIsFinished() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getIsFree() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIsFree() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver.Status
- getIsGood() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getIsGUI() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getIsHalted() - function in ai.platon.pulsar.crawl.fetch.indexer.IndexThread
- getIsHelp() - function in ai.platon.pulsar.common.options.CommonOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.FetchOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.GenerateOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.InjectOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.LinkOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.LoadOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.PulsarOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getIsHelp() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getIsIdle() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIsIdle() - function in ai.platon.pulsar.crawl.fetch.driver.BrowserInstance
- getIsIdle() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getIsIdle() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getIsIdle() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getIsInactive() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getIsLeaf() - function in ai.platon.pulsar.crawl.parse.AbstractParseFilter
- getIsLeaf() - function in ai.platon.pulsar.crawl.parse.EmptyParseFilter
- getIsLeaf() - function in ai.platon.pulsar.crawl.parse.ParseFilter
- getIsLeaked() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getIsMissionComplete() - function in ai.platon.pulsar.crawl.fetch.batch.FetchMonitor
- getIsMockedPageSource() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- Whether the web page source is mocked
- getIsMockedPageSource() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getIsNil() - function in ai.platon.pulsar.common.urls.NormUrl
- getIsNil() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getIsNil() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getIsNil() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getIsNil() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getIsNil() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getIsNotEmpty() - function in ai.platon.pulsar.common.urls.NormUrl
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.EventHandlerPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.VoidEventHandlerPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.UrlAwareHandlerPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.UrlAwareFilterPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.UrlFilterPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.UrlHandlerPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.WebPageHandlerPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.UrlAwareWebPageHandlerPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.HtmlDocumentHandlerPipeline
- getIsNotEmpty() - function in ai.platon.pulsar.crawl.WebDriverHandlerPipeline
- getIsNotNil() - function in ai.platon.pulsar.common.urls.NormUrl
- getIsNotWorking() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIsOutOfWork() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getIsPersistable() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getIsPersistable() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getIsPersistable() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getIsPersistable() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- If this link is persistable
- getIsPersistable() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getIsPrivacyRetry() - function in ai.platon.pulsar.crawl.fetch.FetchResult
- getIsPrototype() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextId
- getIsQuit() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIsQuit() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver.Status
- getIsQuit() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getIsResource() - function in ai.platon.pulsar.common.options.LoadOptions
- getIsRetired() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getIsRetired() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIsRetired() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver.Status
- getIsRetired() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getIsRoot() - function in ai.platon.pulsar.crawl.parse.AbstractParseFilter
- getIsRoot() - function in ai.platon.pulsar.crawl.parse.EmptyParseFilter
- getIsRoot() - function in ai.platon.pulsar.crawl.parse.ParseFilter
- getIsRunning() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getIsSlow() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getIsSmall() - function in ai.platon.pulsar.crawl.fetch.FetchResult
- getIsStarted() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getIsStarted() - function in ai.platon.pulsar.crawl.CrawlLoops
- getIsSuccess() - function in ai.platon.pulsar.crawl.fetch.FetchResult
- getIsWorking() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getIsWorking() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getIsWorking() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver.Status
- getItem() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getItemBrowser() - function in ai.platon.pulsar.common.options.LoadOptions
- TODO: session scope browser choice is not support by now
- getItemExpireAt() - function in ai.platon.pulsar.common.options.LoadOptions
- Web page expire time
- getItemExpires() - function in ai.platon.pulsar.common.options.LoadOptions
- getItemId() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getItemId() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask.Key
- getItemPageLoadTimeout() - function in ai.platon.pulsar.common.options.LoadOptions
- getItemRequireAnchors() - function in ai.platon.pulsar.common.options.LoadOptions
- getItemRequireImages() - function in ai.platon.pulsar.common.options.LoadOptions
- getItemRequireNotBlank() - function in ai.platon.pulsar.common.options.LoadOptions
- getItemRequireSize() - function in ai.platon.pulsar.common.options.LoadOptions
- getItemScriptTimeout() - function in ai.platon.pulsar.common.options.LoadOptions
- getItemScrollCount() - function in ai.platon.pulsar.common.options.LoadOptions
- Note: if scroll too many times, the page may fail to calculate the vision information
- getItemScrollInterval() - function in ai.platon.pulsar.common.options.LoadOptions
- getJitIndexer() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getJobId() - function in ai.platon.pulsar.common.ReducerContext
- getJobID() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask.Key
- getJobName() - function in ai.platon.pulsar.common.ReducerContext
- getJobName() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getJobName() - function in ai.platon.pulsar.crawl.fetch.batch.FetchMonitor
- Initialize in setup using job conf
- getKey() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getKey() - function in ai.platon.pulsar.crawl.index.IndexDocument
- getKeyMap() - function in ai.platon.pulsar.crawl.index.IndexerMapping
- getKeyword() - function in ai.platon.pulsar.common.sites.amazon.AmazonSuggestion
- getKeyword() - function in ai.platon.pulsar.common.sites.amazon.AmazonSearcherJsEventHandler
- getKeywords() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getLabel() - function in ai.platon.pulsar.common.options.LoadOptions
- getLabel() - function in ai.platon.pulsar.common.persist.ext.WebPageExKt
- getLabel() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getLabel() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getLabel() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getLabel() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getLabel() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getLabeledHypeLinks() - function in ai.platon.pulsar.crawl.parse.ParseResult.Companion
- getLabelOfPortal(String) - function in ai.platon.pulsar.crawl.AmazonDiagnosis.Companion
- getLabelOfPortalOrNull(String) - function in ai.platon.pulsar.crawl.AmazonDiagnosis.Companion
- getLabels() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getLabels() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getLabels() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getLabels() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getLabels() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getLabels() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getLang() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getLang() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getLang() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getLang() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getLang() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getLang() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getLang() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getLang() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getLang() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getLang() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- Required website language
- getLang() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getLastActiveTime() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getLastActiveTime() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getLastActiveTime() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getLastCollectedTime() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getLastCollectedTime() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getLastCollectedTime() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getLastCollectedTime() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getLastCollectedTime() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getLastCollectedTime() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getLastFailedLinks() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getLastTaskFinishTime() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getLastTaskStartTime() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getLauncherOptions() - function in ai.platon.pulsar.crawl.fetch.driver.BrowserInstance
- getLauncherOptions() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getLaunchOptions() - function in ai.platon.pulsar.crawl.fetch.driver.BrowserInstance
- getLaunchOptions() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getLAZY_FETCH_URLS_PAGE_BASE() - function in ai.platon.pulsar.crawl.fetch.LazyFetchTaskManager.Companion
- getLazyFlush() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- getLazyFlush() - function in ai.platon.pulsar.common.options.LoadOptions
- getLazyTasks(FetchMode) - function in ai.platon.pulsar.crawl.fetch.LazyFetchTaskManager
- getLeakWarnings() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMetrics
- getLength() - function in ai.platon.pulsar.crawl.protocol.ForwardingResponse
- getLength() - function in ai.platon.pulsar.crawl.protocol.ProtocolOutput
- getLength() - function in ai.platon.pulsar.crawl.protocol.Response
- getLimit() - function in ai.platon.pulsar.common.options.FetchOptions
- getLimit() - function in ai.platon.pulsar.common.options.GenerateOptions
- getLimit() - function in ai.platon.pulsar.common.options.InjectOptions
- getLinkFilter() - function in ai.platon.pulsar.crawl.parse.PageParser
- getLinkOptions() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getLoadArgs() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getLoadArgs() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getLoadArgs() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getLoadComponent() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getLoadComponent() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getLoadComponent() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getLoadComponent() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The load component
- getLoadedSeeds() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getLoadEventHandler() - function in ai.platon.pulsar.common.persist.ext.WebPageExKt
- getLoadEventHandler() - function in ai.platon.pulsar.crawl.PulsarEventHandler
- getLoadEventHandler() - function in ai.platon.pulsar.crawl.AbstractPulsarEventHandler
- getLoadEventHandler() - function in ai.platon.pulsar.crawl.DefaultPulsarEventHandler
- getLocalizedMessage() - function in kotlin.IllegalBusinessPreconditionException
- getLocalizedMessage() - function in kotlin.IllegalApplicationContextStateException
- getLocalizedMessage() - function in kotlin.BlockedException
- getLocalizedMessage() - function in kotlin.HttpException
- getLocalizedMessage() - function in kotlin.PrivacyContextException
- getLocalizedMessage() - function in kotlin.FatalPrivacyContextException
- getLocalizedMessage() - function in kotlin.UrlFilterException
- getLocalizedMessage() - function in kotlin.IndexingException
- getLocalizedMessage() - function in kotlin.ParseException
- getLocalizedMessage() - function in kotlin.ParserNotFound
- getLocalizedMessage() - function in kotlin.ProtocolException
- getLocalizedMessage() - function in kotlin.ProtocolNotFound
- getLocalizedMessage() - function in kotlin.ScoringFilterException
- getLOG() - function in ai.platon.pulsar.crawl.protocol.http.HttpRobotRulesParser.Companion
- getLOG() - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getLOG() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getLOG() - function in ai.platon.pulsar.crawl.component.IndexComponent.Companion
- getLOG() - function in ai.platon.pulsar.crawl.component.InjectComponent.Companion
- getLOG() - function in ai.platon.pulsar.crawl.component.UpdateComponent
- getLOG() - function in ai.platon.pulsar.crawl.component.WebDbComponent.Companion
- getLOG() - function in ai.platon.pulsar.crawl.fetch.batch.TaskSchedulers.Companion
- getLOG() - function in ai.platon.pulsar.crawl.fetch.batch.data.PoolQueue.Companion
- getLOG() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer.Companion
- getLOG() - function in ai.platon.pulsar.crawl.filter.CrawlFilter.Companion
- getLOG() - function in ai.platon.pulsar.crawl.filter.CrawlFilters.Companion
- getLOG() - function in ai.platon.pulsar.crawl.filter.CrawlUrlFilters
- getLOG() - function in ai.platon.pulsar.crawl.index.IndexWriter.Companion
- getLOG() - function in ai.platon.pulsar.crawl.index.IndexWriters
- getLOG() - function in ai.platon.pulsar.crawl.index.IndexerMapping.Companion
- getLOG() - function in ai.platon.pulsar.crawl.index.IndexingFilter.Companion
- getLOG() - function in ai.platon.pulsar.crawl.index.IndexingFilters.Companion
- getLOG() - function in ai.platon.pulsar.crawl.inject.SeedBuilder.Companion
- getLOG() - function in ai.platon.pulsar.crawl.parse.LinkFilter.Companion
- getLOG() - function in ai.platon.pulsar.crawl.parse.PageParser.Companion
- getLOG() - function in ai.platon.pulsar.crawl.parse.Parser.Companion
- getLOG() - function in ai.platon.pulsar.crawl.parse.ParserConfigReader.Companion
- getLOG() - function in ai.platon.pulsar.crawl.parse.ParserFactory.Companion
- getLOG() - function in ai.platon.pulsar.crawl.parse.html.JsoupExtractor.Companion
- getLOG() - function in ai.platon.pulsar.crawl.parse.html.JsoupParser
- getLog() - function in ai.platon.pulsar.crawl.protocol.Protocol.Companion
- getLOG() - function in ai.platon.pulsar.crawl.protocol.RobotRulesParser.Companion
- getLOG() - function in ai.platon.pulsar.crawl.scoring.ScoringFilter.Companion
- getLoginUrl() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getLogLevel() - function in ai.platon.pulsar.common.options.LinkOptions
- getLoops() - function in ai.platon.pulsar.crawl.CrawlLoops
- getLoss() - function in ai.platon.pulsar.crawl.parse.html.OpenMapFields
- getMajorCode() - function in ai.platon.pulsar.persist.FilterResult
- getMajorCode() - function in ai.platon.pulsar.persist.ParseResult
- getMajorName() - function in ai.platon.pulsar.persist.FilterResult
- getMajorName() - function in ai.platon.pulsar.persist.ParseResult
- getMap() - function in ai.platon.pulsar.crawl.parse.html.OpenMapFields
- getMappedName() - function in ai.platon.pulsar.crawl.index.IndexerMapping.MappingField
- getMas() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getMAX_COUNTERS() - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getMAX_COUNTERS_IN_GROUP() - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getMAX_GROUPS() - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getMaxAnchorLength() - function in ai.platon.pulsar.common.options.LinkOptions
- getMaxFetchInterval() - function in ai.platon.pulsar.crawl.schedule.AbstractFetchSchedule
- getMaxFetchInterval() - function in ai.platon.pulsar.crawl.schedule.AdaptiveFetchSchedule
- getMaxFetchInterval() - function in ai.platon.pulsar.crawl.schedule.DefaultFetchSchedule
- getMaxFetchInterval() - function in ai.platon.pulsar.crawl.schedule.FetchSchedule
- getMaxFetchInterval() - function in ai.platon.pulsar.crawl.schedule.NewsFetchSchedule
- getMaxHostFailureEvents() - function in ai.platon.pulsar.crawl.CoreMetrics
- getMaximumWarnings() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getMaxReversedKeyRange() - function in ai.platon.pulsar.crawl.filter.CrawlFilters
- getMaxUrlLength() - function in ai.platon.pulsar.common.options.LinkOptions
- getMEDIA_PAGE_URL_PATTERN() - function in ai.platon.pulsar.crawl.filter.CrawlFilter.Companion
- getMEDIA_URL_SUFFIXES() - function in ai.platon.pulsar.crawl.filter.CrawlFilter.Companion
- TODO : use suffix-urlfilter instead
- getMediaUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getMemoryInfo() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getMessage() - function in ai.platon.pulsar.common.IllegalBusinessPreconditionException
- getMessage() - function in ai.platon.pulsar.common.IllegalApplicationContextStateException
- getMessage() - function in ai.platon.pulsar.crawl.protocol.http.BlockedException
- getMessage() - function in ai.platon.pulsar.crawl.protocol.http.HttpException
- getMessage() - function in ai.platon.pulsar.crawl.CriticalWarning
- getMessage() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextException
- getMessage() - function in ai.platon.pulsar.crawl.fetch.privacy.FatalPrivacyContextException
- getMessage() - function in ai.platon.pulsar.crawl.filter.UrlFilterException
- getMessage() - function in ai.platon.pulsar.crawl.index.IndexingException
- getMessage() - function in ai.platon.pulsar.crawl.parse.ParseException
- getMessage() - function in ai.platon.pulsar.crawl.parse.ParserNotFound
- getMessage() - function in ai.platon.pulsar.crawl.protocol.ProtocolException
- getMessage() - function in ai.platon.pulsar.crawl.protocol.ProtocolNotFound
- getMessage() - function in ai.platon.pulsar.crawl.scoring.ScoringFilterException
- getMessageWriter() - function in ai.platon.pulsar.common.AppStatusTracker
- getMessageWriter() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getMessageWriter() - function in ai.platon.pulsar.crawl.component.UpdateComponent
- getMessageWriter() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getMessageWriter() - function in ai.platon.pulsar.crawl.parse.PageParser
- getMessageWriter() - function in ai.platon.pulsar.crawl.schedule.AbstractFetchSchedule
- getMessageWriter() - function in ai.platon.pulsar.crawl.schedule.AdaptiveFetchSchedule
- getMessageWriter() - function in ai.platon.pulsar.crawl.schedule.DefaultFetchSchedule
- getMessageWriter() - function in ai.platon.pulsar.crawl.schedule.NewsFetchSchedule
- getMetadata(Node) - function in ai.platon.pulsar.crawl.parse.html.PrimerParser
- getMeter() - function in ai.platon.pulsar.common.metrics.MultiMetric
- getMeterContentBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- getMeterContentMBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- getMeterFinishes() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getMeters() - function in com.codahale.metrics.AppMetricRegistry
- getMeters(MetricFilter) - function in com.codahale.metrics.AppMetricRegistry
- getMeterSmallPages() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getMeterSuccesses() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getMeterTasks() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getMeterTotalNetworkIFsRecvMBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- getMetrics() - function in ai.platon.pulsar.common.AppStatusTracker
- getMetrics() - function in com.codahale.metrics.AppMetricRegistry
- getMimeType(String) - function in ai.platon.pulsar.common.MimeTypeResolver
- Facade interface to Tika's underlying getMimeType method.
- getMimeType(File) - function in ai.platon.pulsar.common.MimeTypeResolver
- Facade interface to Tika's underlying getMimeType method.
- getMinAnchorLength() - function in ai.platon.pulsar.common.options.LinkOptions
- getMinConfidence() - function in ai.platon.pulsar.common.EncodingDetector
- getMinimumThroughput() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getMinorCode() - function in ai.platon.pulsar.persist.FilterResult
- getMinorCode() - function in ai.platon.pulsar.persist.ParseResult
- getMinorLeakWarnings() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMetrics
- getMinorName() - function in ai.platon.pulsar.persist.FilterResult
- getMinorName() - function in ai.platon.pulsar.persist.ParseResult
- getMinorWarningFactor() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getMinUrlLength() - function in ai.platon.pulsar.common.options.LinkOptions
- getMISS_FIELD() - function in ai.platon.pulsar.crawl.common.FetchState
- getModified() - function in ai.platon.pulsar.crawl.schedule.ModifyInfo
- getModifiedAt() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getModifiedAt() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getModifiedAt() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getModifiedOptions() - function in ai.platon.pulsar.common.options.LoadOptions
- getModifiedParams() - function in ai.platon.pulsar.common.options.LoadOptions
- getModifiedTime() - function in ai.platon.pulsar.crawl.schedule.ModifyInfo
- getMODIFY_TIME() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getMovedUrls() - function in ai.platon.pulsar.crawl.CoreMetrics
- getMultiValued() - function in ai.platon.pulsar.crawl.index.IndexerMapping.MappingField
- getMWishedF() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getName() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- The collector name
- getName() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- The collector name
- getName() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- The collector name
- getName() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getName() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getName() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getName() - function in ai.platon.pulsar.common.metrics.AppMetrics
- getName() - function in ai.platon.pulsar.common.metrics.CodahaleSlf4jReporter.LoggingLevel
- getName() - function in ai.platon.pulsar.common.metrics.CommonCounter
- getName(Enum) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getName() - function in ai.platon.pulsar.common.options.ItemExtractor
- getName() - function in ai.platon.pulsar.common.options.Condition
- getName() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getName() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getName() - function in ai.platon.pulsar.common.sites.amazon.AmazonSearcherJsEventHandler
- getName() - function in ai.platon.pulsar.crawl.CrawlLoop
- getName() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- getName() - function in ai.platon.pulsar.crawl.EventHandler
- getName() - function in ai.platon.pulsar.crawl.AbstractEventHandler
- getName() - function in ai.platon.pulsar.crawl.VoidEventHandler
- getName() - function in ai.platon.pulsar.crawl.UrlAwareHandler
- getName() - function in ai.platon.pulsar.crawl.UrlAwareFilter
- getName() - function in ai.platon.pulsar.crawl.UrlHandler
- getName() - function in ai.platon.pulsar.crawl.UrlFilter
- getName() - function in ai.platon.pulsar.crawl.WebPageHandler
- getName() - function in ai.platon.pulsar.crawl.UrlAwareWebPageHandler
- getName() - function in ai.platon.pulsar.crawl.HtmlDocumentHandler
- getName() - function in ai.platon.pulsar.crawl.FetchResultHandler
- getName() - function in ai.platon.pulsar.crawl.WebPageBatchHandler
- getName() - function in ai.platon.pulsar.crawl.FetchResultBatchHandler
- getName() - function in ai.platon.pulsar.crawl.PrivacyContextHandler
- getName() - function in ai.platon.pulsar.crawl.WebDriverHandler
- getName() - function in ai.platon.pulsar.crawl.WebPageWebDriverHandler
- getName() - function in ai.platon.pulsar.crawl.VoidEventHandlerPipeline
- getName() - function in ai.platon.pulsar.crawl.UrlAwareHandlerPipeline
- getName() - function in ai.platon.pulsar.crawl.UrlAwareFilterPipeline
- getName() - function in ai.platon.pulsar.crawl.UrlFilterPipeline
- getName() - function in ai.platon.pulsar.crawl.UrlHandlerPipeline
- getName() - function in ai.platon.pulsar.crawl.WebPageHandlerPipeline
- getName() - function in ai.platon.pulsar.crawl.UrlAwareWebPageHandlerPipeline
- getName() - function in ai.platon.pulsar.crawl.HtmlDocumentHandlerPipeline
- getName() - function in ai.platon.pulsar.crawl.WebDriverHandlerPipeline
- getName() - function in ai.platon.pulsar.crawl.AbstractWebDriverHandler
- getName() - function in ai.platon.pulsar.crawl.AbstractWebPageWebDriverHandler
- getName() - function in ai.platon.pulsar.crawl.EmptyWebDriverHandler
- getName() - function in ai.platon.pulsar.crawl.WebPageWebDriverHandlerPipeline
- getName() - function in ai.platon.pulsar.crawl.AddRefererAfterFetchHandler
- getName() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getName() - function in ai.platon.pulsar.crawl.CriticalWarning
- getName() - function in ai.platon.pulsar.crawl.StreamingCrawler
- The crawl name
- getName() - function in ai.platon.pulsar.crawl.common.URLUtil.GroupMode
- getName() - function in ai.platon.pulsar.crawl.component.GenerateComponent.Companion.Counter
- getName() - function in ai.platon.pulsar.crawl.component.UpdateComponent.Companion.Counter
- getName() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getName() - function in ai.platon.pulsar.crawl.event.CloseMaskLayerHandler
- getName() - function in ai.platon.pulsar.crawl.fetch.FetchTask.State
- getName() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool.Status
- getName() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getName() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler.Companion.Counter
- getName() - function in ai.platon.pulsar.crawl.fetch.batch.TaskSchedulers
- getName() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getName() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver.Status
- getName() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getName() - function in java.lang.IndexThread
- getName() - function in ai.platon.pulsar.crawl.index.IndexWriter
- getName() - function in ai.platon.pulsar.crawl.index.IndexerMapping.MappingField
- getName() - function in ai.platon.pulsar.persist.FilterResult
- getName() - function in ai.platon.pulsar.crawl.parse.PageParser.Counter
- getName() - function in ai.platon.pulsar.persist.ParseResult
- getName() - function in ai.platon.pulsar.crawl.parse.html.OpenMapFields
- getName() - function in ai.platon.pulsar.crawl.scoring.Name
- getNames() - function in com.codahale.metrics.AppMetricRegistry
- getNavigateHistory() - function in ai.platon.pulsar.crawl.fetch.driver.BrowserInstance
- getNavigateHistory() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getNetCondition() - function in ai.platon.pulsar.common.options.LoadOptions
- getNetworkIFsRecvBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- getNetworkIFsRecvBytesPerPage() - function in ai.platon.pulsar.crawl.CoreMetrics
- getNetworkIFsRecvBytesPerSecond() - function in ai.platon.pulsar.crawl.CoreMetrics
- getNEW_PAGE() - function in ai.platon.pulsar.crawl.common.FetchState
- getNextPageSelector() - function in ai.platon.pulsar.common.options.LoadOptions
- getNIL() - function in ai.platon.pulsar.common.urls.NormUrl.Companion
- getNIL() - function in ai.platon.pulsar.crawl.fetch.FetchTask.Companion
- getNJitRetry() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- The are several cases to enable jit retry For example, in test environment
- getNJitRetry() - function in ai.platon.pulsar.common.options.LoadOptions
- getNMaxRetry() - function in ai.platon.pulsar.common.options.LoadOptions
- getNMaxRetry() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getNMaxRetry() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getNMaxRetry() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getNMaxRetry() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- The maximum retry times
- getNMaxRetry() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getNO_CONTENT() - function in ai.platon.pulsar.crawl.common.FetchState
- getNoCache() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- A convenience method.
- getNoFilter() - function in ai.platon.pulsar.common.options.GenerateOptions
- getNoFilter() - function in ai.platon.pulsar.common.options.LoadOptions
- getNoFollow() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- A convenience method.
- getNoIndex() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- A convenience method.
- getNoNorm() - function in ai.platon.pulsar.common.options.LoadOptions
- getNoNormalizer() - function in ai.platon.pulsar.common.options.GenerateOptions
- getNoProxy() - function in ai.platon.pulsar.crawl.StreamingCrawler
- Do not use proxy
- getNoRedirect() - function in ai.platon.pulsar.common.options.LoadOptions
- getNormalizer() - function in ai.platon.pulsar.common.collect.FatLinkExtractor
- getNormalizer() - function in ai.platon.pulsar.common.collect.HyperlinkExtractor
- getNotContains() - function in ai.platon.pulsar.crawl.filter.TextFilter
- getNow(T) - function in java.util.concurrent.CompletableHyperlink
- getNow(T) - function in java.util.concurrent.CompletableListenableHyperlink
- getNRelease() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getNRetries() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getNumberOfDependents() - function in java.util.concurrent.CompletableHyperlink
- getNumberOfDependents() - function in java.util.concurrent.CompletableListenableHyperlink
- getNumFinishedTasks() - function in ai.platon.pulsar.crawl.fetch.batch.TaskMonitor
- getNumHtmlParsed() - function in ai.platon.pulsar.crawl.parse.html.PrimerHtmlParser.Companion
- getNumHtmlParses() - function in ai.platon.pulsar.crawl.parse.html.PrimerHtmlParser.Companion
- getNumJsoupParsed() - function in ai.platon.pulsar.crawl.parse.html.JsoupExtractor.Companion
- getNumJsoupParsed() - function in ai.platon.pulsar.crawl.parse.html.JsoupParser.Companion
- getNumJsoupParses() - function in ai.platon.pulsar.crawl.parse.html.JsoupExtractor.Companion
- getNumJsoupParses() - function in ai.platon.pulsar.crawl.parse.html.JsoupParser.Companion
- getNumMaxActiveTabs() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getNumParsed() - function in ai.platon.pulsar.crawl.component.ParseComponent.Companion
- getNumParses() - function in ai.platon.pulsar.crawl.component.ParseComponent.Companion
- getNumPendingTasks() - function in ai.platon.pulsar.crawl.fetch.batch.TaskMonitor
- getNumPendingTasks() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getNumPoolThreads() - function in ai.platon.pulsar.common.options.FetchOptions
- getNumPrivacyContexts() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getNumReadyTasks() - function in ai.platon.pulsar.crawl.fetch.batch.TaskMonitor
- Task counters
- getNumReadyTasks() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getNumReduceTasks() - function in ai.platon.pulsar.common.options.FetchOptions
- Note: in browser mode, reducer tasks/fetch threads/pool threads are not used use browser instances/active browser tabs instead
- getNumRunningTasks() - function in ai.platon.pulsar.crawl.fetch.batch.FetchLoop.Companion
- getNumRunningTasks() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getNumSlowTasks() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getNumTasks() - function in ai.platon.pulsar.crawl.fetch.batch.TaskMonitor
- getNumTasksSuccess() - function in ai.platon.pulsar.crawl.fetch.BatchStat
- getNumTotalFinishedTasks() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getOnAfterBrowserLaunch() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnAfterBrowserLaunch() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnAfterBrowserLaunch() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnAfterCheckDOMState() - function in ai.platon.pulsar.crawl.SimulateEventHandler
- getOnAfterCheckDOMState() - function in ai.platon.pulsar.crawl.AbstractSimulateEventHandler
- getOnAfterCheckDOMState() - function in ai.platon.pulsar.crawl.ExpressionSimulateEventHandler
- getOnAfterCheckDOMState() - function in ai.platon.pulsar.crawl.DefaultSimulateEventHandler
- getOnAfterComputeFeature() - function in ai.platon.pulsar.crawl.SimulateEventHandler
- getOnAfterComputeFeature() - function in ai.platon.pulsar.crawl.AbstractSimulateEventHandler
- getOnAfterComputeFeature() - function in ai.platon.pulsar.crawl.ExpressionSimulateEventHandler
- getOnAfterComputeFeature() - function in ai.platon.pulsar.crawl.DefaultSimulateEventHandler
- getOnAfterExtract() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnAfterExtract() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnAfterExtract() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnAfterFetch() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnAfterFetch() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnAfterFetch() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnAfterHtmlParse() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnAfterHtmlParse() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnAfterHtmlParse() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnAfterLoad() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnAfterLoad() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnAfterLoad() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnAfterLoad() - function in ai.platon.pulsar.crawl.CrawlEventHandler
- getOnAfterLoad() - function in ai.platon.pulsar.crawl.AbstractCrawlEventHandler
- getOnAfterLoad() - function in ai.platon.pulsar.crawl.DefaultCrawlEventHandler
- getOnAfterParse() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnAfterParse() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnAfterParse() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnBeforeBrowserLaunch() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnBeforeBrowserLaunch() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnBeforeBrowserLaunch() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnBeforeCheckDOMState() - function in ai.platon.pulsar.crawl.SimulateEventHandler
- getOnBeforeCheckDOMState() - function in ai.platon.pulsar.crawl.AbstractSimulateEventHandler
- getOnBeforeCheckDOMState() - function in ai.platon.pulsar.crawl.ExpressionSimulateEventHandler
- getOnBeforeCheckDOMState() - function in ai.platon.pulsar.crawl.DefaultSimulateEventHandler
- getOnBeforeComputeFeature() - function in ai.platon.pulsar.crawl.SimulateEventHandler
- getOnBeforeComputeFeature() - function in ai.platon.pulsar.crawl.AbstractSimulateEventHandler
- getOnBeforeComputeFeature() - function in ai.platon.pulsar.crawl.ExpressionSimulateEventHandler
- getOnBeforeComputeFeature() - function in ai.platon.pulsar.crawl.DefaultSimulateEventHandler
- getOnBeforeExtract() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnBeforeExtract() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnBeforeExtract() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnBeforeFetch() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnBeforeFetch() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnBeforeFetch() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnBeforeHtmlParse() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnBeforeHtmlParse() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnBeforeHtmlParse() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnBeforeLoad() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnBeforeLoad() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnBeforeLoad() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnBeforeLoad() - function in ai.platon.pulsar.crawl.CrawlEventHandler
- getOnBeforeLoad() - function in ai.platon.pulsar.crawl.AbstractCrawlEventHandler
- getOnBeforeLoad() - function in ai.platon.pulsar.crawl.DefaultCrawlEventHandler
- getOnBeforeParse() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnBeforeParse() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnBeforeParse() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getONE() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getOnFilter() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnFilter() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnFilter() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnFilter() - function in ai.platon.pulsar.crawl.CrawlEventHandler
- getOnFilter() - function in ai.platon.pulsar.crawl.AbstractCrawlEventHandler
- getOnFilter() - function in ai.platon.pulsar.crawl.DefaultCrawlEventHandler
- getOnLoad() - function in ai.platon.pulsar.crawl.CrawlEventHandler
- getOnLoad() - function in ai.platon.pulsar.crawl.AbstractCrawlEventHandler
- getOnLoad() - function in ai.platon.pulsar.crawl.DefaultCrawlEventHandler
- getOnNormalize() - function in ai.platon.pulsar.crawl.LoadEventHandler
- getOnNormalize() - function in ai.platon.pulsar.crawl.AbstractLoadEventHandler
- getOnNormalize() - function in ai.platon.pulsar.crawl.DefaultLoadEventHandler
- getOnNormalize() - function in ai.platon.pulsar.crawl.CrawlEventHandler
- getOnNormalize() - function in ai.platon.pulsar.crawl.AbstractCrawlEventHandler
- getOnNormalize() - function in ai.platon.pulsar.crawl.DefaultCrawlEventHandler
- getOnParse() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getOptionFields() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getOptionFieldsMap() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getOptionNames(String) - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getOptionNames() - function in ai.platon.pulsar.common.options.LoadOptions.Companion
- getOptions() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getOptions() - function in ai.platon.pulsar.common.persist.ext.WebPageExKt
- getOptions() - function in ai.platon.pulsar.common.urls.NormUrl
- getOptions() - function in ai.platon.pulsar.crawl.common.FetchEntry
- getOptions() - function in ai.platon.pulsar.crawl.fetch.batch.FetchMonitor
- Initialize in setup using job conf
- getOrder() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getOrder() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getOrder() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- The order of this hyperlink in it's referer page
- getOrder() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- The order of this hyperlink in it's referer page
- getOrdinal() - function in ai.platon.pulsar.common.metrics.CodahaleSlf4jReporter.LoggingLevel
- getOrdinal() - function in ai.platon.pulsar.common.metrics.CommonCounter
- getOrdinal() - function in ai.platon.pulsar.common.options.ItemExtractor
- getOrdinal() - function in ai.platon.pulsar.common.options.Condition
- getOrdinal() - function in ai.platon.pulsar.crawl.CriticalWarning
- getOrdinal() - function in ai.platon.pulsar.crawl.common.URLUtil.GroupMode
- getOrdinal() - function in ai.platon.pulsar.crawl.component.GenerateComponent.Companion.Counter
- getOrdinal() - function in ai.platon.pulsar.crawl.component.UpdateComponent.Companion.Counter
- getOrdinal() - function in ai.platon.pulsar.crawl.fetch.FetchTask.State
- getOrdinal() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool.Status
- getOrdinal() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler.Companion.Counter
- getOrdinal() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver.Status
- getOrdinal() - function in ai.platon.pulsar.crawl.parse.PageParser.Counter
- getOrdinal() - function in ai.platon.pulsar.crawl.scoring.Name
- getOrNull(String) - function in ai.platon.pulsar.context.PulsarContext
- getOrNull(String) - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- Get a webpage from the storage
- getOrNull(String) - function in ai.platon.pulsar.context.support.BasicPulsarContext
- Get a webpage from the storage
- getOrNull(String) - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- Get a webpage from the storage
- getOrNull(String) - function in ai.platon.pulsar.context.support.StaticPulsarContext
- Get a webpage from the storage
- getOrNull(String) - function in ai.platon.pulsar.session.AbstractPulsarSession
- Get a page from database if exists
- getOrNull(String) - function in ai.platon.pulsar.session.BasicPulsarSession
- Get a page from database if exists
- getOrNull(String) - function in ai.platon.pulsar.session.PulsarSession
- Get a page from database if exists
- getOutLinkPattern() - function in ai.platon.pulsar.common.options.LoadOptions
- getOutLinkSelector() - function in ai.platon.pulsar.common.options.LoadOptions
- Arrange links
- getOutLinkSelectorOrNull() - function in ai.platon.pulsar.common.options.LoadOptions
- getOutOfWorkTimeout() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getPage() - function in ai.platon.pulsar.common.collect.PageFatLink
- getPage() - function in ai.platon.pulsar.common.collect.HyperlinkExtractor
- getPage() - function in ai.platon.pulsar.common.collect.RegexHyperlinkExtractor
- getPage() - function in ai.platon.pulsar.common.message.PageFormatter
- getPage() - function in ai.platon.pulsar.crawl.common.FetchEntry
- getPage() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getPage() - function in ai.platon.pulsar.crawl.fetch.batch.IFetchEntry
- getPage() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getPage() - function in ai.platon.pulsar.crawl.parse.html.ParseContext
- getPage() - function in ai.platon.pulsar.crawl.protocol.ForwardingResponse
- getPage() - function in ai.platon.pulsar.crawl.protocol.Response
- getPageAnchors() - function in ai.platon.pulsar.crawl.CoreMetrics
- getPageCache() - function in ai.platon.pulsar.crawl.common.GlobalCache
- The global page cache, a page will be removed if it's expired or the cache is full
- getPageCache() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getPageCache() - function in ai.platon.pulsar.session.AbstractPulsarSession
- getPageCache() - function in ai.platon.pulsar.session.BasicPulsarSession
- getPageCache() - function in ai.platon.pulsar.session.PulsarSession
- getPageCacheHits() - function in ai.platon.pulsar.crawl.component.LoadComponent.Companion
- getPageCacheHits() - function in ai.platon.pulsar.session.AbstractPulsarSession.Companion
- getPageCategory(String) - function in ai.platon.pulsar.crawl.filter.CrawlFilter.Companion
- getPageDatum() - function in ai.platon.pulsar.crawl.protocol.ForwardingResponse
- getPageDatum() - function in ai.platon.pulsar.crawl.protocol.ProtocolOutput
- getPageDatum() - function in ai.platon.pulsar.crawl.protocol.Response
- getPageHeights() - function in ai.platon.pulsar.crawl.CoreMetrics
- getPageImages() - function in ai.platon.pulsar.crawl.CoreMetrics
- getPageLoadTimeout() - function in ai.platon.pulsar.common.options.LoadOptions
- getPageNumbers() - function in ai.platon.pulsar.crawl.CoreMetrics
- getPageParser() - function in ai.platon.pulsar.crawl.component.ParseComponent
- getPageParser() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getPages() - function in ai.platon.pulsar.common.message.LoadedPagesFormatter
- getPageSmallTexts() - function in ai.platon.pulsar.crawl.CoreMetrics
- getPagesPerSecond() - function in ai.platon.pulsar.crawl.fetch.BatchStat
- getPagesThroughputRate() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler.Status
- getPageText(TypeAliased(typeAlias=GenericTypeConstructor(dri=kotlin.text/StringBuilder///PointingToDeclaration/, projections=[], presentableName=null, extra=PropertyContainer(map={})), inner=GenericTypeConstructor(dri=java.lang/StringBuilder///PointingToDeclaration/, projections=[], presentableName=null, extra=PropertyContainer(map={}))),Node,Boolean) - function in ai.platon.pulsar.crawl.parse.html.PrimerParser
- This method takes a StringBuilder and a DOM Node, and will append all the content text found beneath the DOM node to the
StringBuilder. - getPageText(TypeAliased(typeAlias=GenericTypeConstructor(dri=kotlin.text/StringBuilder///PointingToDeclaration/, projections=[], presentableName=null, extra=PropertyContainer(map={})), inner=GenericTypeConstructor(dri=java.lang/StringBuilder///PointingToDeclaration/, projections=[], presentableName=null, extra=PropertyContainer(map={}))),Node) - function in ai.platon.pulsar.crawl.parse.html.PrimerParser
- This is a convinience method, equivalent to .
- getPageText(Node) - function in ai.platon.pulsar.crawl.parse.html.PrimerParser
- getPageTitle(Node) - function in ai.platon.pulsar.crawl.parse.html.PrimerParser
- getPageType() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getPageViews() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getPARAM_INDEXER_MAPPING_FILE() - function in ai.platon.pulsar.crawl.index.IndexerMapping.Companion
- getParams() - function in ai.platon.pulsar.common.config.CommonOptions
- getParams() - function in ai.platon.pulsar.common.config.FetchOptions
- getParams() - function in ai.platon.pulsar.common.config.GenerateOptions
- getParams() - function in ai.platon.pulsar.common.options.InjectOptions
- getParams() - function in ai.platon.pulsar.common.options.LinkOptions
- getParams() - function in ai.platon.pulsar.common.options.LoadOptions
- getParams() - function in ai.platon.pulsar.common.config.PulsarOptions
- getParams() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getParams() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getParams() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getParams() - function in ai.platon.pulsar.crawl.CoreMetrics
- getParams() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getParams() - function in ai.platon.pulsar.crawl.component.InjectComponent
- getParams() - function in ai.platon.pulsar.crawl.component.UpdateComponent
- getParams() - function in ai.platon.pulsar.common.config.LazyFetchTaskManager
- getParams() - function in ai.platon.pulsar.crawl.fetch.batch.FeedLoop
- getParams() - function in ai.platon.pulsar.crawl.fetch.batch.FetchMonitor
- getParams() - function in ai.platon.pulsar.crawl.fetch.batch.TaskMonitor
- getParams() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getParams() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getParams() - function in ai.platon.pulsar.crawl.fetch.indexer.JITIndexer
- getParams() - function in ai.platon.pulsar.common.config.IndexWriter
- getParams() - function in ai.platon.pulsar.common.config.IndexingFilter
- getParams() - function in ai.platon.pulsar.crawl.inject.SeedBuilder
- getParams() - function in ai.platon.pulsar.common.config.AbstractParseFilter
- getParams() - function in ai.platon.pulsar.common.config.EmptyParseFilter
- getParams() - function in ai.platon.pulsar.crawl.parse.LinkFilter
- getParams() - function in ai.platon.pulsar.crawl.parse.PageParser
- getParams() - function in ai.platon.pulsar.common.config.ParseFilter
- getParams() - function in ai.platon.pulsar.common.config.Parser
- getParams() - function in ai.platon.pulsar.crawl.parse.html.PrimerHtmlParser
- getParams() - function in ai.platon.pulsar.crawl.schedule.AbstractFetchSchedule
- getParams() - function in ai.platon.pulsar.crawl.schedule.AdaptiveFetchSchedule
- getParams() - function in ai.platon.pulsar.crawl.schedule.DefaultFetchSchedule
- getParams() - function in ai.platon.pulsar.common.config.FetchSchedule
- getParams() - function in ai.platon.pulsar.crawl.schedule.NewsFetchSchedule
- getParams() - function in ai.platon.pulsar.common.config.ScoringFilter
- getParams() - function in ai.platon.pulsar.crawl.scoring.ScoringFilters
- getParent() - function in ai.platon.pulsar.crawl.parse.AbstractParseFilter
- getParent() - function in ai.platon.pulsar.crawl.parse.EmptyParseFilter
- getParent() - function in ai.platon.pulsar.crawl.parse.ParseFilter
- getParentId() - function in ai.platon.pulsar.crawl.parse.AbstractParseFilter
- getParentId() - function in ai.platon.pulsar.crawl.parse.EmptyParseFilter
- getParentId() - function in ai.platon.pulsar.crawl.parse.ParseFilter
- getParse() - function in ai.platon.pulsar.common.options.FetchOptions
- getParse() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- getParse() - function in ai.platon.pulsar.common.options.LoadOptions
- getParse() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getPARSE_PLUGINS_FILE() - function in ai.platon.pulsar.crawl.parse.ParserConfigReader.Companion
- The property name of the parse-plugins location
- getParseComponent() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getParseComponent() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getParseComponent() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getParseComponent() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The parse component
- getParseComponent() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getParseFilters() - function in ai.platon.pulsar.crawl.parse.ParseFilters
- getParseResult() - function in ai.platon.pulsar.crawl.parse.html.ParseContext
- getParserFactory() - function in ai.platon.pulsar.crawl.parse.PageParser
- getParsers() - function in ai.platon.pulsar.crawl.parse.ParseResult
- getParsers() - function in ai.platon.pulsar.crawl.parse.ParserConfig
- getParsers(String) - function in ai.platon.pulsar.crawl.parse.ParserConfig
- getParsers(String,String) - function in ai.platon.pulsar.crawl.parse.ParserFactory
- Function returns an array of Parsers for a given content type.
- getPassword() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getPasswordSelector() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getPath() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- The path of the file source
- getPath() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- The path of the file source
- getPath() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- The path of the file source
- getPath(String) - function in ai.platon.pulsar.common.MiscMessageWriter
- getPendingFetchItems() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler.Status
- getPendingStart() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getPendingTask(Integer) - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getPendingTasks() - function in ai.platon.pulsar.crawl.fetch.batch.FetchLoop.Companion
- getPersist() - function in ai.platon.pulsar.common.options.LoadOptions
- getPersistContentMBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- getPersists() - function in ai.platon.pulsar.crawl.CoreMetrics
- getPoolId() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getPoolPendingTimeout() - function in ai.platon.pulsar.crawl.fetch.batch.TaskMonitor
- Once timeout, the pending items should be put to the ready pool again.
- getPortalLabels() - function in ai.platon.pulsar.crawl.AmazonDiagnosis.Companion
- The labeled portals: https://www.amazon.com/Best-Sellers/zgbs https://www.amazon.com/gp/new-releases https://www.amazon.com/gp/movers-and-shakers https://www.amazon.
- getPreferParallel() - function in ai.platon.pulsar.common.options.LoadOptions
- getPrevFetchTime() - function in ai.platon.pulsar.crawl.schedule.ModifyInfo
- The previous actual latest fetch time
- getPrevModifiedTime() - function in ai.platon.pulsar.crawl.schedule.ModifyInfo
- getPriority() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getPriority() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getPriority() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getPriority() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getPriority() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getPriority() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getPriority() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getPriority() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getPriority() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getPriority() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- The priority
- getPriority() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getPriority() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getPriority() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getPriority() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask.Key
- getPriority() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getPriority() - function in ai.platon.pulsar.crawl.fetch.batch.data.PoolId
- getPriority() - function in java.lang.IndexThread
- getPRIORITY() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getPrivacyContextIdGenerator() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyManager
- getPrivacyLeakMinorWarnings() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getPrivacyLeakWarnings() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getProcessing() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getProtocol() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getProtocol() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask.Key
- getProtocol() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getProtocol() - function in ai.platon.pulsar.crawl.fetch.batch.data.PoolId
- getProtocol(WebPage) - function in ai.platon.pulsar.crawl.protocol.ProtocolFactory
- TODO: configurable, using major protocol/sub protocol is a good idea Using major protocol/sub protocol is a good idea, for example: selenium:http://www.baidu.
- getProtocol(String) - function in ai.platon.pulsar.crawl.protocol.ProtocolFactory
- Returns the appropriate Protocol implementation for a url.
- getProtocol(FetchMode) - function in ai.platon.pulsar.crawl.protocol.ProtocolFactory
- getProtocolFactory() - function in ai.platon.pulsar.crawl.component.BatchFetchComponent
- getProtocolFactory() - function in ai.platon.pulsar.crawl.component.FetchComponent
- getProtocolOutput(WebPage) - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getProtocolOutput(WebPage) - function in ai.platon.pulsar.crawl.protocol.AbstractNativeHttpProtocol
- getProtocolOutput(WebPage) - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- Returns the ProtocolOutput for a fetch list entry.
- getProtocolOutput(WebPage) - function in ai.platon.pulsar.crawl.protocol.Protocol
- Returns the ProtocolOutput for a fetch list entry.
- getProtocolOutputDeferred(WebPage,Continuation) - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getProtocolOutputDeferred(WebPage) - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- Returns the ProtocolOutput for a fetch list entry.
- getProtocolOutputDeferred(WebPage) - function in ai.platon.pulsar.crawl.protocol.Protocol
- Returns the ProtocolOutput for a fetch list entry.
- getProtocolStatus() - function in ai.platon.pulsar.crawl.protocol.ProtocolOutput
- getPROTOTYPE() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextId.Companion
- getPROTOTYPE_DIR() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext.Companion
- getProxyEntry() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getProxyServer() - function in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId
- getPulsarEnvironment() - function in ai.platon.pulsar.context.PulsarContext
- getPulsarEnvironment() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getPulsarEnvironment() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getPulsarEnvironment() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getPulsarEnvironment() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- getQuery(String) - function in ai.platon.pulsar.crawl.common.URLUtil
- Returns the path for the url.
- getReadonly() - function in ai.platon.pulsar.common.options.LoadOptions
- getReadyFetchItems() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler.Status
- getRealTimeNetworkIFsRecvBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- getRECENT_TASKS_COUNT_LIMIT() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool.Companion
- getREF_EXTRACT_ERROR_DENSITY() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getREF_FETCH_ERROR_DENSITY() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getREF_INDEX_ERROR_DENSITY() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getREF_PARSE_ERROR_DENSITY() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getReferer() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getReferer() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getReferer() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getReferer() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- The url of the referer page
- getReferer() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getReferrer() - function in ai.platon.pulsar.common.options.LoadOptions
- getReflectedGlobalCache() - function in ai.platon.pulsar.crawl.common.GlobalCacheFactory
- getRefresh() - function in ai.platon.pulsar.common.options.LoadOptions
- getREFRESH() - function in ai.platon.pulsar.crawl.common.FetchState
- getRefresh() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- Sets
refreshto the supplied value. - getRefreshCodes() - function in ai.platon.pulsar.crawl.common.FetchState
- getRefreshHref() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- Sets the
refreshHref. - getRefreshTime() - function in ai.platon.pulsar.crawl.parse.html.HTMLMetaTags
- Sets the
refreshTime. - getReg() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getReGenerate() - function in ai.platon.pulsar.common.options.GenerateOptions
- getReGenerateSeeds() - function in ai.platon.pulsar.common.options.GenerateOptions
- getRegexMatchedLinks() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getRegexRules() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getRegexRules() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getRegisteredCounters() - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getRemoteAddr() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getRemoteAddr() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getRemoteAddr() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getReparseLinks() - function in ai.platon.pulsar.common.options.LoadOptions
- getReport() - function in ai.platon.pulsar.common.options.LinkOptions
- getReport() - function in ai.platon.pulsar.crawl.CrawlLoop
- getReport() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- getReport() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- getRequireAnchors() - function in ai.platon.pulsar.common.options.LoadOptions
- getRequired() - function in ai.platon.pulsar.crawl.index.IndexerMapping.MappingField
- getRequired() - function in ai.platon.pulsar.crawl.parse.html.OpenMapFields
- getRequireImages() - function in ai.platon.pulsar.common.options.LoadOptions
- getRequireNotBlank() - function in ai.platon.pulsar.common.options.LoadOptions
- getRequireSize() - function in ai.platon.pulsar.common.options.LoadOptions
- getReservedUrl() - function in ai.platon.pulsar.crawl.fetch.batch.IFetchEntry
- getResponse(String,WebPage,boolean) - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getResponse(WebPage,boolean) - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getResponse(WebPage,Boolean) - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- getResponse() - function in ai.platon.pulsar.crawl.fetch.FetchResult
- getResponseDeferred(WebPage,boolean,Continuation) - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getResponseDeferred(WebPage,Boolean) - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- getResponses(Collection,VolatileConfig) - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getResponses(Collection,VolatileConfig) - function in ai.platon.pulsar.crawl.protocol.AbstractNativeHttpProtocol
- getResponses(Collection,VolatileConfig) - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- getResponses(Collection,VolatileConfig) - function in ai.platon.pulsar.crawl.protocol.Protocol
- getRestrictCss() - function in ai.platon.pulsar.common.collect.RegexHyperlinkExtractor
- getRestrictCss() - function in ai.platon.pulsar.common.options.LinkOptions
- getResume() - function in ai.platon.pulsar.common.options.FetchOptions
- getRetries() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getRETRY() - function in ai.platon.pulsar.crawl.common.FetchState
- getRetryFailed() - function in ai.platon.pulsar.common.options.LoadOptions
- Force retry fetching the page if it's failed last time, or it's marked as gone This option is deprecated and be replaced by ignoreFailure which is more descriptive
- getReversedEndKey() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getReversedKeyRanges() - function in ai.platon.pulsar.crawl.filter.CrawlFilters
- getReversedStartKey() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getRobotRules(WebPage) - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getRobotRules(WebPage) - function in ai.platon.pulsar.crawl.protocol.AbstractNativeHttpProtocol
- getRobotRules(WebPage) - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- Retrieve robot rules applicable for this url.
- getRobotRules(WebPage) - function in ai.platon.pulsar.crawl.protocol.Protocol
- Retrieve robot rules applicable for this url.
- getRobotRulesSet(Protocol,URL) - function in ai.platon.pulsar.crawl.protocol.http.HttpRobotRulesParser
- Get the rules from robots.
- getRobotRulesSet(Protocol,String) - function in ai.platon.pulsar.crawl.protocol.HttpRobotRulesParser
- getRobotRulesSet(Protocol,String) - function in ai.platon.pulsar.crawl.protocol.RobotRulesParser
- getRobotRulesSet(Protocol,URL) - function in ai.platon.pulsar.crawl.protocol.RobotRulesParser
- getRoot() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getRoot() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getRound() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getRound() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector.Companion.Counters
- getRound() - function in ai.platon.pulsar.common.options.FetchOptions
- getRound() - function in ai.platon.pulsar.common.options.GenerateOptions
- getRoundCollected() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getRunningChromeProcesses() - function in ai.platon.pulsar.crawl.CoreMetrics.Companion
- getSCHEDULED() - function in ai.platon.pulsar.crawl.common.FetchState
- getScope() - function in ai.platon.pulsar.crawl.filter.CrawlFilters
- getScope() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers
- getSCOPE_CRAWLDB() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- getSCOPE_DEFAULT() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- Default scope.
- getSCOPE_FETCHER() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- getSCOPE_GENERATE_HOST_COUNT() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- getSCOPE_INDEXER() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- getSCOPE_INJECT() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- getSCOPE_LINKDB() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- getSCOPE_OUTLINK() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- getSCOPE_PARTITION() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers.Companion
- getScore() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getScoreVector() - function in ai.platon.pulsar.crawl.scoring.io.ScoreVectorWritable
- getScoringFilters() - function in ai.platon.pulsar.crawl.component.UpdateComponent
- getScriptTimeout() - function in ai.platon.pulsar.common.options.LoadOptions
- getScrollCount() - function in ai.platon.pulsar.common.options.LoadOptions
- getScrollInterval() - function in ai.platon.pulsar.common.options.LoadOptions
- getSEARCH_PAGE_URL_PATTERN() - function in ai.platon.pulsar.crawl.filter.CrawlFilter.Companion
- getSearchUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getSeed() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getSeeds() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- The urls of portal pages from where hyper links are extracted from
- getSeeds() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- The urls of portal pages from where hyper links are extracted from
- getSeeds() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- The urls of portal pages from where hyper links are extracted from
- getSeeds() - function in ai.platon.pulsar.common.options.InjectOptions
- getSeeds(FetchMode,Integer) - function in ai.platon.pulsar.crawl.fetch.LazyFetchTaskManager
- getSEPERATORS() - function in ai.platon.pulsar.crawl.filter.TextFilter.Companion
- getSequence() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getSession() - function in ai.platon.pulsar.common.collect.FatLinkExtractor
- getSession() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- The pulsar session to use
- getSession() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- The pulsar session to use
- getSession() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- The pulsar session to use
- getSession() - function in ai.platon.pulsar.crawl.AbstractCrawler
- getSession() - function in ai.platon.pulsar.crawl.StreamingCrawler
- getSession() - function in ai.platon.pulsar.session.SessionAware
- getSessionBean() - function in ai.platon.pulsar.session.AbstractPulsarSession
- getSessionBean() - function in ai.platon.pulsar.session.BasicPulsarSession
- getSessionBeanFactory() - function in ai.platon.pulsar.session.AbstractPulsarSession
- The session scoped bean factory
- getSessionBeanFactory() - function in ai.platon.pulsar.session.BasicPulsarSession
- The session scoped bean factory
- getSessionBeanFactory() - function in ai.platon.pulsar.session.PulsarSession
- The scoped bean factory: for each volatileConfig object, there is a bean factory TODO: session scoped?
- getSessionConfig() - function in ai.platon.pulsar.session.AbstractPulsarSession
- The session scope volatile config, every setting is supposed to be changed at any time and any place
- getSessionConfig() - function in ai.platon.pulsar.session.BasicPulsarSession
- The session scope volatile config, every setting is supposed to be changed at any time and any place
- getSessionConfig() - function in ai.platon.pulsar.session.PulsarSession
- The session scope volatile config, every setting is supposed to be changed at any time and any place
- getSessionId() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getSessionId() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getSessions() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- All open sessions
- getSessions() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- All open sessions
- getSessions() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- All open sessions
- getSessions() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- All open sessions
- getSHADOW_METRIC_SYMBOL() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getShortenKey() - function in ai.platon.pulsar.common.options.LoadOptions
- getShouldBreak() - function in ai.platon.pulsar.crawl.parse.FilterResult
- getShouldBreak() - function in ai.platon.pulsar.crawl.parse.ParseResult
- getShouldContinue() - function in ai.platon.pulsar.crawl.parse.FilterResult
- getShouldContinue() - function in ai.platon.pulsar.crawl.parse.ParseResult
- getSignature() - function in ai.platon.pulsar.crawl.parse.PageParser
- getSimulateEventHandler() - function in ai.platon.pulsar.crawl.PulsarEventHandler
- getSimulateEventHandler() - function in ai.platon.pulsar.crawl.AbstractPulsarEventHandler
- getSimulateEventHandler() - function in ai.platon.pulsar.crawl.DefaultPulsarEventHandler
- getSize() - function in ai.platon.pulsar.common.collect.LoadingDelayQueue
- getSize() - function in ai.platon.pulsar.common.collect.LocalFileHyperlinkCollector
- getSize() - function in ai.platon.pulsar.common.collect.CircularLocalFileHyperlinkCollector
- getSize() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getSize() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getSize() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getSize() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getSize() - function in ai.platon.pulsar.crawl.EventHandlerPipeline
- getSize() - function in ai.platon.pulsar.crawl.VoidEventHandlerPipeline
- getSize() - function in ai.platon.pulsar.crawl.UrlAwareHandlerPipeline
- getSize() - function in ai.platon.pulsar.crawl.UrlAwareFilterPipeline
- getSize() - function in ai.platon.pulsar.crawl.UrlFilterPipeline
- getSize() - function in ai.platon.pulsar.crawl.UrlHandlerPipeline
- getSize() - function in ai.platon.pulsar.crawl.WebPageHandlerPipeline
- getSize() - function in ai.platon.pulsar.crawl.UrlAwareWebPageHandlerPipeline
- getSize() - function in ai.platon.pulsar.crawl.HtmlDocumentHandlerPipeline
- getSize() - function in ai.platon.pulsar.crawl.WebDriverHandlerPipeline
- getSize() - function in ai.platon.pulsar.crawl.fetch.batch.data.PoolQueue
- getSkipTruncated() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getSlashed() - function in ai.platon.pulsar.common.options.InjectOptions
- getSlf4jReportInterval() - function in ai.platon.pulsar.common.metrics.AppMetrics
- getSMALL_CONTENT() - function in ai.platon.pulsar.crawl.common.FetchState
- getSmallPageRate() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getSmallPages() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMetrics
- getSmas() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getSmWishedF() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getSnRelease() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getSource() - function in ai.platon.pulsar.common.EncodingDetector.EncodingClue
- getSpec() - function in ai.platon.pulsar.common.urls.NormUrl
- getSpecifiedGlobalCache() - function in ai.platon.pulsar.crawl.common.GlobalCacheFactory
- getStackTrace() - function in kotlin.IllegalBusinessPreconditionException
- getStackTrace() - function in kotlin.IllegalApplicationContextStateException
- getStackTrace() - function in kotlin.BlockedException
- getStackTrace() - function in kotlin.HttpException
- getStackTrace() - function in java.lang.IndexThread
- getStackTrace() - function in kotlin.PrivacyContextException
- getStackTrace() - function in kotlin.FatalPrivacyContextException
- getStackTrace() - function in kotlin.UrlFilterException
- getStackTrace() - function in kotlin.IndexingException
- getStackTrace() - function in kotlin.ParseException
- getStackTrace() - function in kotlin.ParserNotFound
- getStackTrace() - function in kotlin.ProtocolException
- getStackTrace() - function in kotlin.ProtocolNotFound
- getStackTrace() - function in kotlin.ScoringFilterException
- getStartKey() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getStartTime() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getStartTime() - function in ai.platon.pulsar.common.message.LoadedPagesFormatter
- getStartTime() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getStartTime() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- The start time
- getStartTime() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- The start time
- getStartTime() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- The start time
- getStartTime() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The start time
- getStartTime() - function in ai.platon.pulsar.crawl.CoreMetrics
- getStartTime() - function in ai.platon.pulsar.crawl.fetch.BatchStat
- getStartTime() - function in ai.platon.pulsar.crawl.fetch.LazyFetchTaskManager
- getStartTime() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContext
- getStartTimes() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getState() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getState() - function in java.lang.IndexThread
- getStatus() - function in ai.platon.pulsar.common.domain.DomainSuffix
- getStatus() - function in ai.platon.pulsar.common.domain.TopLevelDomain
- getStatus() - function in ai.platon.pulsar.common.ReducerContext
- getStatus(Set,Boolean) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getStatus(Boolean) - function in ai.platon.pulsar.common.metrics.EnumCounterRegistry
- getStatus() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getStatus() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- getStatus() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getStatus() - function in ai.platon.pulsar.crawl.fetch.FetchResult
- getStatus() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- If a fetch queue is inactive, the queue does not accept any tasks, nor serve any requests, but still hold pending tasks, waiting to finish
- getStatus() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- Driver status
- getStatus() - function in ai.platon.pulsar.crawl.protocol.ForwardingResponse
- getStatus() - function in ai.platon.pulsar.crawl.protocol.Response
- getSTATUS_MODIFIED() - function in ai.platon.pulsar.crawl.schedule.FetchSchedule.Companion
- Page is known to have been modified since our last visit.
- getSTATUS_NOTMODIFIED() - function in ai.platon.pulsar.crawl.schedule.FetchSchedule.Companion
- Page is known to remain unmodified since our last visit.
- getSTATUS_UNKNOWN() - function in ai.platon.pulsar.crawl.schedule.FetchSchedule.Companion
- It is unknown whether page was changed since our last visit.
- getStatusTracker() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getStopped() - function in ai.platon.pulsar.crawl.fetch.driver.NavigateEntry
- getStoreContent() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- getStoreContent() - function in ai.platon.pulsar.common.options.LoadOptions
- getStoreContent() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getStored() - function in ai.platon.pulsar.crawl.index.IndexerMapping.MappingField
- getStrictDf() - function in ai.platon.pulsar.common.options.FetchOptions
- getStringValues() - function in ai.platon.pulsar.crawl.index.IndexField
- getSubmitSelector() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getSuccesses() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getSuccesses() - function in ai.platon.pulsar.crawl.AmazonMetrics.Companion
- getSuccesses() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMetrics
- getSuccessTasks() - function in ai.platon.pulsar.crawl.CoreMetrics
- getSuccessTasksPerSecond() - function in ai.platon.pulsar.crawl.CoreMetrics
- getSuggestions() - function in ai.platon.pulsar.common.sites.amazon.AmazonSearcherJsEventHandler
- getSupportedMimeTypes() - function in ai.platon.pulsar.crawl.parse.ParserConfig
- getSupportJavascript() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- Whether the web driver has javascript support
- getSupportJavascript() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getSupportParallel() - function in ai.platon.pulsar.crawl.protocol.http.AbstractNativeHttpProtocol
- getSupportParallel() - function in ai.platon.pulsar.crawl.protocol.AbstractNativeHttpProtocol
- getSupportParallel() - function in ai.platon.pulsar.crawl.protocol.http.AbstractHttpProtocol
- getSupportParallel() - function in ai.platon.pulsar.crawl.protocol.Protocol
- getSuppressed() - function in kotlin.IllegalBusinessPreconditionException
- getSuppressed() - function in kotlin.IllegalApplicationContextStateException
- getSuppressed() - function in kotlin.BlockedException
- getSuppressed() - function in kotlin.HttpException
- getSuppressed() - function in kotlin.PrivacyContextException
- getSuppressed() - function in kotlin.FatalPrivacyContextException
- getSuppressed() - function in kotlin.UrlFilterException
- getSuppressed() - function in kotlin.IndexingException
- getSuppressed() - function in kotlin.ParseException
- getSuppressed() - function in kotlin.ParserNotFound
- getSuppressed() - function in kotlin.ProtocolException
- getSuppressed() - function in kotlin.ProtocolNotFound
- getSuppressed() - function in kotlin.ScoringFilterException
- getSymbols() - function in ai.platon.pulsar.crawl.common.FetchState
- getSystemInfo() - function in ai.platon.pulsar.common.metrics.AppMetrics.Companion
- getSzgbs() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getTabCount() - function in ai.platon.pulsar.crawl.fetch.driver.BrowserInstance
- getTabCount() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractBrowserInstance
- getTask() - function in ai.platon.pulsar.crawl.fetch.FetchResult
- getTaskId() - function in ai.platon.pulsar.common.options.LoadOptions
- getTasks() - function in ai.platon.pulsar.crawl.CoreMetrics
- The total bytes of page content of all success web pages
- getTasks() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getTasks() - function in ai.platon.pulsar.crawl.AmazonMetrics.Companion
- getTasks() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMetrics
- getTasksMonitor() - function in ai.platon.pulsar.crawl.fetch.batch.TaskScheduler
- getTaskTime() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- getTaskTime() - function in ai.platon.pulsar.common.options.LoadOptions
- The task time accepts date time format as the following:
ISO_INSTANT: yyyy-MM-ddThh:MM:ssZ
yyyy-MM-dd[ hh[:MM:ss]]
- getTEMP_MOVED() - function in ai.platon.pulsar.crawl.common.FetchState
- getTest() - function in ai.platon.pulsar.common.options.LoadOptionDefaults
- Set to be true if we are doing unit test or other test We will talk more, log more and trace more in test mode
- getTest() - function in ai.platon.pulsar.common.options.LoadOptions
- getText() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getText() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getText() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- The anchor text of this hyperlink
- getText() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- The anchor text of this hyperlink
- getText(String,Element) - function in ai.platon.pulsar.crawl.parse.html.JsoupExtractor.Companion
- getTextFilter() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getThreadGroup() - function in java.lang.IndexThread
- getTiebaUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getTIMEOUT_URLS_PAGE() - function in ai.platon.pulsar.crawl.fetch.LazyFetchTaskManager.Companion
- getTimeouts() - function in ai.platon.pulsar.crawl.StreamingCrawlerMetrics
- getTimeoutUrls() - function in ai.platon.pulsar.crawl.CoreMetrics
- Tracking hosts who is failed to fetch tasks.
- getTimePerPage() - function in ai.platon.pulsar.crawl.fetch.BatchStat
- getTimeReport() - function in ai.platon.pulsar.crawl.fetch.batch.TaskPool
- getTimeReport() - function in ai.platon.pulsar.crawl.fetch.batch.data.PoolQueue
- getTimers() - function in com.codahale.metrics.AppMetricRegistry
- getTimers(MetricFilter) - function in com.codahale.metrics.AppMetricRegistry
- getTopLinks() - function in ai.platon.pulsar.common.options.LoadOptions
- getTopN() - function in ai.platon.pulsar.common.options.GenerateOptions
- getTopNAnchorGroups() - function in ai.platon.pulsar.common.options.LoadOptions
- getTotalNetworkIFsRecvBytes() - function in ai.platon.pulsar.crawl.CoreMetrics
- The total all bytes received by the hardware last read from system
- getTotalSuccessBytes() - function in ai.platon.pulsar.crawl.fetch.BatchStat
- getTraceInfo() - function in ai.platon.pulsar.crawl.component.ParseComponent
- getType() - function in ai.platon.pulsar.common.domain.TopLevelDomain
- getType() - function in ai.platon.pulsar.crawl.index.IndexerMapping.MappingField
- getUncaughtExceptionHandler() - function in java.lang.IndexThread
- getUnfilteredLinks() - function in ai.platon.pulsar.common.collect.FatLinkExtractor.Companion.Counters
- getUNKNOWN() - function in ai.platon.pulsar.crawl.common.FetchState
- getUnknownUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getUnmodifiedConfig() - function in ai.platon.pulsar.context.PulsarContext
- getUnmodifiedConfig() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getUnmodifiedConfig() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getUnmodifiedConfig() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getUnmodifiedConfig() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The unmodified config
- getUnmodifiedConfig() - function in ai.platon.pulsar.session.AbstractPulsarSession
- getUnmodifiedConfig() - function in ai.platon.pulsar.session.BasicPulsarSession
- getUnmodifiedConfig() - function in ai.platon.pulsar.session.PulsarSession
- getUnparsableTypes() - function in ai.platon.pulsar.crawl.parse.PageParser
- getUnreachableHosts() - function in ai.platon.pulsar.crawl.CoreMetrics
- Tracking unreachable hosts
- getUpdateComponent() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getUpdateComponent() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getUpdateComponent() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getUpdateComponent() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The update component
- getUpdateComponent() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getUrl() - function in ai.platon.pulsar.common.urls.NormUrl
- getUrl() - function in ai.platon.pulsar.crawl.AddRefererAfterFetchHandler
- getUrl() - function in ai.platon.pulsar.crawl.common.url.ListenableHyperlink
- getUrl() - function in ai.platon.pulsar.crawl.common.url.StatefulListenableHyperlink
- getUrl() - function in ai.platon.pulsar.crawl.common.url.ParsableHyperlink
- getUrl() - function in ai.platon.pulsar.crawl.common.url.CompletableHyperlink
- The url of this hyperlink
- getUrl() - function in ai.platon.pulsar.crawl.common.url.CompletableListenableHyperlink
- getUrl() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getUrl() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getUrl() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask.Key
- getUrl() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- The url to navigate The browser might redirect, so it might not be the same with currentUrl()
- getUrl() - function in ai.platon.pulsar.crawl.fetch.driver.NavigateEntry
- getUrl() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- getUrl() - function in ai.platon.pulsar.crawl.index.IndexDocument
- getUrl() - function in ai.platon.pulsar.crawl.parse.ParserNotFound
- getUrl() - function in ai.platon.pulsar.crawl.protocol.ForwardingResponse
- getUrl() - function in ai.platon.pulsar.crawl.protocol.ProtocolNotFound
- getUrl() - function in ai.platon.pulsar.crawl.protocol.Response
- getUrlContains() - function in ai.platon.pulsar.common.options.LinkOptions
- getUrlFeeder() - function in ai.platon.pulsar.crawl.CrawlLoop
- getUrlFeeder() - function in ai.platon.pulsar.crawl.AbstractCrawlLoop
- The fetch iterable from which all fetch tasks are taken
- getUrlFeeder() - function in ai.platon.pulsar.crawl.StreamingCrawlLoop
- The fetch iterable from which all fetch tasks are taken
- getUrlFilter() - function in ai.platon.pulsar.crawl.filter.CrawlFilter
- getUrlFilters() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getUrlFilters() - function in ai.platon.pulsar.crawl.filter.CrawlFilters
- getUrlFilters() - function in ai.platon.pulsar.crawl.filter.CrawlUrlFilters
- getUrlNormalizer() - function in ai.platon.pulsar.common.collect.HyperlinkCollector
- getUrlNormalizer() - function in ai.platon.pulsar.common.collect.CircularHyperlinkCollector
- getUrlNormalizer() - function in ai.platon.pulsar.common.collect.PeriodicalHyperlinkCollector
- getUrlNormalizers() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getUrlNormalizers() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getUrlNormalizers() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getUrlNormalizers() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- Url normalizers
- getUrlNormalizers() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getUrlNormalizers() - function in ai.platon.pulsar.crawl.filter.CrawlFilters
- getUrlNormalizers() - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers
- getURLNormalizers(String) - function in ai.platon.pulsar.crawl.filter.CrawlUrlNormalizers
- TODO : not implemented
- getUrlPattern() - function in ai.platon.pulsar.common.collect.RegexHyperlinkExtractor
- getUrlPool() - function in ai.platon.pulsar.crawl.common.GlobalCache
- The url pool, hold on queues of urls to fetch
- getUrlPostfix() - function in ai.platon.pulsar.common.options.LinkOptions
- getUrlPrefix() - function in ai.platon.pulsar.common.options.LinkOptions
- getUrlRegex() - function in ai.platon.pulsar.common.options.LinkOptions
- getUrls() - function in ai.platon.pulsar.crawl.StreamingCrawler
- The url sequence
- getUrls() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getUrlsFromSeed() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getUrlStatistics() - function in ai.platon.pulsar.crawl.CoreMetrics
- Tracking statistics for each host
- getUrlsTooLong() - function in ai.platon.pulsar.crawl.fetch.UrlStat
- getUrlString() - function in ai.platon.pulsar.crawl.fetch.batch.JobFetchTask
- getUsedMemory() - function in ai.platon.pulsar.crawl.CoreMetrics.Companion
- getUserDataDir() - function in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId
- getUsername() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getUsernameSelector() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getUuid() - function in ai.platon.pulsar.common.collect.PeriodicalLocalFileHyperlinkCollector
- getValue() - function in ai.platon.pulsar.common.EncodingDetector.EncodingClue
- getValues() - function in ai.platon.pulsar.common.ReducerContext
- getValues() - function in ai.platon.pulsar.crawl.index.IndexField
- getVariable(String) - function in ai.platon.pulsar.session.AbstractPulsarSession
- getVariable(String) - function in ai.platon.pulsar.session.BasicPulsarSession
- getVariable(String) - function in ai.platon.pulsar.session.PulsarSession
- getVerbose() - function in ai.platon.pulsar.common.options.FetchOptions
- getVerbose() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- getVerbose() - function in ai.platon.pulsar.common.sites.amazon.AmazonSearcherJsEventHandler
- getVerbose() - function in ai.platon.pulsar.crawl.AbstractWebDriverHandler
- getVerbose() - function in ai.platon.pulsar.crawl.AbstractWebPageWebDriverHandler
- getVerbose() - function in ai.platon.pulsar.crawl.EmptyWebDriverHandler
- getVerbose() - function in ai.platon.pulsar.crawl.AbstractSimulateEventHandler
- getVerbose() - function in ai.platon.pulsar.crawl.WebPageWebDriverHandlerPipeline
- getVerbose() - function in ai.platon.pulsar.crawl.ExpressionSimulateEventHandler
- getVerbose() - function in ai.platon.pulsar.crawl.DefaultSimulateEventHandler
- getVerbose() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getVerbose() - function in ai.platon.pulsar.crawl.event.CloseMaskLayerHandler
- getVersion() - function in ai.platon.pulsar.common.options.LoadOptions
- getVERSION() - function in ai.platon.pulsar.crawl.index.io.IndexDocumentWritable.Companion
- getVolatileConfig() - function in ai.platon.pulsar.crawl.fetch.FetchTask
- getWaitForTimeout() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- getWaitNonBlank() - function in ai.platon.pulsar.common.options.LoadOptions
- getWarnUpUrl() - function in ai.platon.pulsar.crawl.event.LoginHandler
- getWatchInterval() - function in ai.platon.pulsar.common.metrics.EnumCounterReporter
- getWatchInterval() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyContextMonitor
- getWEB_GRAPH_SCORE() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getWebDb() - function in ai.platon.pulsar.common.message.MiscMessageWriter
- TODO: no WebDb dependency
- getWebDb() - function in ai.platon.pulsar.context.support.AbstractPulsarContext
- getWebDb() - function in ai.platon.pulsar.context.support.BasicPulsarContext
- getWebDb() - function in ai.platon.pulsar.context.support.ClassPathXmlPulsarContext
- getWebDb() - function in ai.platon.pulsar.context.support.StaticPulsarContext
- The web db
- getWebDb() - function in ai.platon.pulsar.crawl.component.BatchFetchComponent
- getWebDb() - function in ai.platon.pulsar.crawl.component.GenerateComponent
- getWebDb() - function in ai.platon.pulsar.crawl.component.InjectComponent
- getWebDb() - function in ai.platon.pulsar.crawl.component.LoadComponent
- getWebDb() - function in ai.platon.pulsar.crawl.component.UpdateComponent
- getWeight() - function in ai.platon.pulsar.crawl.index.IndexDocument
- getWeight() - function in ai.platon.pulsar.crawl.index.IndexField
- getWithSymbolicLink() - function in ai.platon.pulsar.common.message.LoadedPagesFormatter
- getXpathRules() - function in ai.platon.pulsar.common.options.deprecated.CollectionOptions
- getXpathRules() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions
- getZERO() - function in ai.platon.pulsar.crawl.scoring.NamedScoreVector.Companion
- getZgbs() - function in ai.platon.pulsar.crawl.AmazonMetrics
- getZkHostString() - function in ai.platon.pulsar.common.options.FetchOptions
- getZombieContexts() - function in ai.platon.pulsar.crawl.fetch.privacy.PrivacyManager
- getZoneId() - function in ai.platon.pulsar.common.options.deprecated.CrawlOptions
- GlobalCache - class in ai.platon.pulsar.crawl.common
- The global cache
- GlobalCacheFactory - class in ai.platon.pulsar.crawl.common
- GOOD - enum entry in ai.platon.pulsar.common.options.Condition
- guessEncoding(WebPage,String) - function in ai.platon.pulsar.common.EncodingDetector
- Guess the encoding with the previously specified list of clues.
- guessPageCategory(String) - function in ai.platon.pulsar.crawl.filter.CrawlFilter.Companion
- A simple regex rule to sniff the possible category of a web page