SOLID FRAMEWORK 10.0.18708

Solid Framework SDK has been updated.

Improvements: 

  • [docx]     Improved column and row detection of hybrid split tables.
  • [office]     Implemented the recognition of non standard encoded vertical Japanese characters.
  • [office]     Improved the precision of non standard encoded Arabic character coordinates.
  • [docx]     Improved detection of single column non-table content.
  • [docx]     Improved table detection.
  • [office]     Improved the rendering of Type 3 font glyphs.
  • [docx]     Improved the optical character recognition of large images on 32 bit platforms.
  • [docx]     Implemented list recognition in pdfs using image bullets.
  • [json]     Implemented the option to ignore the detection of tiled pages in json export.
  • [json]     Improved json export of pages that exceed Microsoft size
  • [json]     Support nested tables being placed inside corresponding cell contents.
  • [office]     Applied custom language string options for Chinese text recovery.
  • [office]     Improved initialization mode for use of Thai trained data language file.
  • [docx]     Improved the detection of list items to prevent inclusion of undesirable footnote content.
  • [office]     Implemented automatic rotation detection of Japanese and Korean documents using optical character recognition.

Bugfixes: 

  • [docx]     Fixed a bug causing the misdetection of multiple glyph shapes representing a single letter “e” in a document.
  • [docx]     Fixed a bug preventing detection of a borderless table when the table contained extended spaces between rows.
  • [docx]     Fixed a bug causing the detection of unnecessary column breaks in a document with right to left aligned Arabic text.
  • [docx]     Fixed a relative height calculation issue preventing a very large document from opening in Microsoft Word.
  • [docx]     Fixed a bug preventing the detection of columns when ignoring the tagged table structure of a pdf.

SOLID FRAMEWORK 10.0.18610

Solid Framework SDK has been updated.

Improvements: 

  • [docx]     Improved optical character recognition of black text on a gray background.
  • [json]      Support link property in JSON conversion output.
  • [json]      Improved the detection of annotation identification.
  • [docx]     Improved the forced detection of non-standard encoding of Arabic, Chinese, Japanese and Korean characters of multiple page documents.
  • [docx]     Improved the optical character recognition preprocessing of vector text.
  • [docx]     Improved detection of paragraph styles.
  • [docx/xlsx]     Improved the detection of borderless tables.
  • [json]     Improved hyperlink detection.
  • [json]     Improved reliability of paragraph coordinates of rotated textboxes.
  • [json]     Improved detection of the page rotation angle when autorotate is manually disabled.

Bugfixes: 

  • [docx]     Fixed a bug causing an extra line of body text to be detected in the header.
  • [json]      Improved the detection of page orientation containing inconsistently orientated text.
  • [docx]     Fixed a bug resulting in the partial loss of specific bounding boxes of a document.

SOLID FRAMEWORK 10.0.18460

Solid Framework SDK has been updated.

Improvements: 

  • [docx]    Improved the detection of small images comprised of thousands of objects in the original pdf container.
  • [docx]    Improved the order detection of overlapping shapes and images.
  • [docx]    Improved the detection of Japanese characters.
  • [docx]    Improved the stability of font color detection in text boxes with varying fill colors.
  • [docx]    Improved the optical character recognition preprocessing of vector text.
  • [docx]    Improved the stability of page orientation when vertical Japanese text is detected.
  • [docx]    Improved the detection of bullet and list items in Korean language documents.
  • [docx]    Improved the detection of Latin characters in Korean language documents.
  • [docx]    Improved the detection of Japanese language.
  • [docx]    Implemented post processing of images to improve optical character recognition.
  • [docx]    Improved detection of full page black image overlaid with images of white text.

Bugfixes: 

  • [xlsx]     Fixed an issue resulting in various rows of a large table to become combined.
  • [docx]    Fixed an issue causing a textbox to convert with an incorrect fill color.
  • [docx]    Fixed an issue preventing the detection of a hyperlink.
  • [docx]    Fixed an issue resulting in text being converted as an image.
  • [docx]    Fixed an issue that caused the conversion time of a specific document to be extended.

Security 

A limited number of third-party libraries have been updated to include the latest security fixes.