Class Tokenizer

  • All Implemented Interfaces:
    io.annot8.api.components.Annot8ComponentDescriptor<Tokenizer.Processor,​io.annot8.api.settings.NoSettings>, io.annot8.api.components.ProcessorDescriptor<Tokenizer.Processor,​io.annot8.api.settings.NoSettings>

    @ComponentName("OpenNLP Tokenizer")
    @ComponentDescription("Tokenizes words and sentences using OpenNLP tokenization models")
    public class Tokenizer
    extends io.annot8.common.components.AbstractProcessorDescriptor<Tokenizer.Processor,​io.annot8.api.settings.NoSettings>
    Tokenizes words and sentences using OpenNLP tokenization models
    • Constructor Detail

      • Tokenizer

        public Tokenizer()
    • Method Detail

      • createComponent

        protected Tokenizer.Processor createComponent​(io.annot8.api.context.Context context,
                                                      io.annot8.api.settings.NoSettings settings)
        Specified by:
        createComponent in class io.annot8.common.components.AbstractComponentDescriptor<Tokenizer.Processor,​io.annot8.api.settings.NoSettings>
      • capabilities

        public io.annot8.api.capabilities.Capabilities capabilities()