A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
_
B
- BasicPulsarContext - class in ai.platon.pulsar.context.support
- The main entry point for pulsar functionality.
- BasicPulsarSession - class in ai.platon.pulsar.session
- Created by vincent on 18-1-17.
- BatchFetchComponent - class in ai.platon.pulsar.crawl.component
- BatchStat - class in ai.platon.pulsar.crawl.fetch
- bbs - enum entry in ai.platon.pulsar.crawl.component.GenerateComponent.Companion.Counter
- BEST - enum entry in ai.platon.pulsar.common.options.Condition
- BETTER - enum entry in ai.platon.pulsar.common.options.Condition
- biAccept(Object,Object,BiConsumer,CompletableFuture.BiAccept) - function in java.util.concurrent.CompletableHyperlink
- biAccept(Object,Object,BiConsumer,CompletableFuture.BiAccept) - function in java.util.concurrent.CompletableListenableHyperlink
- biApply(Object,Object,BiFunction,CompletableFuture.BiApply) - function in java.util.concurrent.CompletableHyperlink
- biApply(Object,Object,BiFunction,CompletableFuture.BiApply) - function in java.util.concurrent.CompletableListenableHyperlink
- bipush(CompletableFuture,CompletableFuture.BiCompletion) - function in java.util.concurrent.CompletableHyperlink
- bipush(CompletableFuture,CompletableFuture.BiCompletion) - function in java.util.concurrent.CompletableListenableHyperlink
- biRun(Object,Object,Runnable,CompletableFuture.BiRun) - function in java.util.concurrent.CompletableHyperlink
- biRun(Object,Object,Runnable,CompletableFuture.BiRun) - function in java.util.concurrent.CompletableListenableHyperlink
- BlockedException - class in ai.platon.pulsar.crawl.protocol.http
- BlockFilter - class in ai.platon.pulsar.crawl.filter
- blog - enum entry in ai.platon.pulsar.crawl.component.GenerateComponent.Companion.Counter
- BOILERPIPE - enum entry in ai.platon.pulsar.common.options.ItemExtractor
- bringToFront() - function in ai.platon.pulsar.crawl.fetch.driver.AbstractWebDriver
- bringToFront() - function in ai.platon.pulsar.crawl.fetch.driver.WebDriver
- BrowserInstance - class in ai.platon.pulsar.crawl.fetch.driver
- BrowserInstanceId - class in ai.platon.pulsar.crawl.fetch.privacy
- Every browser instance have a unique data dir, proxy is required to be unique too if it is enabled
- BrowserInstanceId.Companion - class in ai.platon.pulsar.crawl.fetch.privacy.BrowserInstanceId
- BrowserTypeConverter - class in ai.platon.pulsar.common.options
- build() - function in ai.platon.pulsar.common.metrics.CodahaleSlf4jReporter.Builder
- Builds a Slf4jReporter with the given properties.
- build() - function in ai.platon.pulsar.common.options.LinkOptions
- build() - function in ai.platon.pulsar.common.options.deprecated.EntityOptions.Builder
- build(String,WebPage) - function in ai.platon.pulsar.crawl.index.IndexDocument.Builder
- Index a WebPage, here we add the following fields:
<tt>id</tt>: default uniqueKey for the IndexDocument.
<tt>digest</tt>: Digest is used to identify pages (like unique ID) and is used to remove duplicates during the dedup procedure. It is calculated
<tt>batchId</tt>: The page belongs to a unique batchId, this is its identifier.
<tt>boost</tt>: Boost is used to calculate document (field) score which can be used within queries submitted to the underlying indexing library to find the best results. It's part of the scoring algorithms. See scoring.link, scoring.opic, scoring.tld, etc.
- build() - function in ai.platon.pulsar.common.options.deprecated.JsoupExtractor
- buildMap() - function in com.codahale.metrics.AppMetricRegistry
- BY_DOMAIN - enum entry in ai.platon.pulsar.crawl.common.URLUtil.GroupMode
- BY_HOST - enum entry in ai.platon.pulsar.crawl.common.URLUtil.GroupMode
- BY_IP - enum entry in ai.platon.pulsar.crawl.common.URLUtil.GroupMode