Package 

Class HyperlinkCollector

  • All Implemented Interfaces:
    ai.platon.pulsar.common.collect.CrawlableFatLinkCollector , ai.platon.pulsar.common.collect.collector.DataCollector , ai.platon.pulsar.common.collect.collector.PriorityDataCollector , kotlin.Comparable

    
    public class HyperlinkCollector
    extends AbstractPriorityDataCollector<UrlAware> implements CrawlableFatLinkCollector
                        

    Collect hyper links from the given seeds. The urls are restricted by loadArguments and urlNormalizer.

    • all urls are restricted by css outLinkSelector

    • all urls are restricted by urlPattern

    • all urls have to not be fetched before or expired against the last version