Enum TextFormattingMode

Info

Defines different modes which can be used while converting a PDF document into text. See Documentize.TextExtractorOptions class.

public enum TextFormattingMode

Fields

  • Flatten = 2: Represent PDF content with positioning text fragments by their coordinates. It is basically similar to “Raw” mode. But while “Raw” focuses on preserving the structure of text fragments (operators) in a document, “Flatten” focuses on keeping text in the order it is read.
  • Pure = 0: Represents PDF text content with a few formatting routines. This includes taking into account relative positions and adding extra spaces to align text to the width of the page.
  • Raw = 1: Represents PDF content as is, i.e. without formatting.

Namespace: Documentize Assembly: Documentize.dll