SOLID FRAMEWORK 10.0.18708

Solid Framework SDK has been updated.

Improvements: 

  • [docx]     Improved column and row detection of hybrid split tables.
  • [office]     Implemented the recognition of non standard encoded vertical Japanese characters.
  • [office]     Improved the precision of non standard encoded Arabic character coordinates.
  • [docx]     Improved detection of single column non-table content.
  • [docx]     Improved table detection.
  • [office]     Improved the rendering of Type 3 font glyphs.
  • [docx]     Improved the optical character recognition of large images on 32 bit platforms.
  • [docx]     Implemented list recognition in pdfs using image bullets.
  • [json]     Implemented the option to ignore the detection of tiled pages in json export.
  • [json]     Improved json export of pages that exceed Microsoft size
  • [json]     Support nested tables being placed inside corresponding cell contents.
  • [office]     Applied custom language string options for Chinese text recovery.
  • [office]     Improved initialization mode for use of Thai trained data language file.
  • [docx]     Improved the detection of list items to prevent inclusion of undesirable footnote content.
  • [office]     Implemented automatic rotation detection of Japanese and Korean documents using optical character recognition.

Bugfixes: 

  • [docx]     Fixed a bug causing the misdetection of multiple glyph shapes representing a single letter “e” in a document.
  • [docx]     Fixed a bug preventing detection of a borderless table when the table contained extended spaces between rows.
  • [docx]     Fixed a bug causing the detection of unnecessary column breaks in a document with right to left aligned Arabic text.
  • [docx]     Fixed a relative height calculation issue preventing a very large document from opening in Microsoft Word.
  • [docx]     Fixed a bug preventing the detection of columns when ignoring the tagged table structure of a pdf.