-
public final class IndexDocument.Builder
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description public classIndexDocument.Builder.Companion
-
Constructor Summary
Constructors Constructor Description IndexDocument.Builder(ImmutableConfig conf)
-
Method Summary
Modifier and Type Method Description final IndexDocument.Builderwith(IndexingFilters indexingFilters)final IndexDocument.Builderwith(ScoringFilters scoringFilters)final IndexDocumentbuild(String key, WebPage page)Index a WebPage, here we add the following fields: <tt>id</tt>: default uniqueKey for the IndexDocument.
<tt>digest</tt>: Digest is used to identify pages (like unique ID) and is used to remove duplicates during the dedup procedure. It is calculated
<tt>batchId</tt>: The page belongs to a unique batchId, this is its identifier.
<tt>boost</tt>: Boost is used to calculate document (field) score which can be used within queries submitted to the underlying indexing library to find the best results. It's part of the scoring algorithms. See scoring.link, scoring.opic, scoring.tld, etc.
-
-
Method Detail
-
with
final IndexDocument.Builder with(IndexingFilters indexingFilters)
-
with
final IndexDocument.Builder with(ScoringFilters scoringFilters)
-
build
final IndexDocument build(String key, WebPage page)
Index a WebPage, here we add the following fields:
<tt>id</tt>: default uniqueKey for the IndexDocument.
<tt>digest</tt>: Digest is used to identify pages (like unique ID) and is used to remove duplicates during the dedup procedure. It is calculated
<tt>batchId</tt>: The page belongs to a unique batchId, this is its identifier.
<tt>boost</tt>: Boost is used to calculate document (field) score which can be used within queries submitted to the underlying indexing library to find the best results. It's part of the scoring algorithms. See scoring.link, scoring.opic, scoring.tld, etc.
- Parameters:
key- The key of the page (reversed url).page- The WebPage.
-
-
-