Enum TextExtractorOptions.TextFormattingMode

Info

Defines different modes which can be used while converting a PDF document into text. See Documentize.TextExtractorOptions class.

public enum TextExtractorOptions.TextFormattingMode

Fields

  • Plain = 2: Represents PDF text content taking into account the relative positioning of text fragments but without adding extra spaces.
  • Pure = 0: Represents PDF text content with a few formatting routines. This includes taking into account relative positions and adding extra spaces to align text to the width of the page.
  • Raw = 1: Represents PDF content as is, i.e. without formatting.

Namespace: Documentize Assembly: Documentize.dll

 English