SOLID FRAMEWORK 10.0.19392

Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Next release expected on 26 Nov 25

Improvements: 

  • [office]    Improved text encoding detection for complex glyphs like ligatures.
  • [office]    Improved encoding recovery for self-intersected glyphs.
  • [office]    Significant improvements to hyperlink detection with an emphasis on multiline hyperlinks.
  • [docx]      Improved text transparency conversion.

Bugfixes: 

  • [docx]      Fixed pdf annotation to MSWord comments conversion.
  • [pptx]      Fixed tabs placement. Prefer using space characters for small gaps.
  • [office]    Fixed a bug preventing the accurate detection of vector glyphs in a document.
  • [docx]      Fixed a bug causing borderless table headers to be detected as page headers in a document.

SOLID FRAMEWORK 10.0.19312

Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Next release expected on 08 Oct 25

Improvements: 

  • [office] Improved the support of transparent graphics. 
  • [docx] Improved the layout of type 1 font handling to improve paragraph detection.  
  • [docx] Improved detection of single borderless table structure. 
  • [docx] Improved the optical character recognition of Korean documents. 

Bugfixes: 

  • [pptx] Fixed a bug preventing the detection of alternate text on a slide. 

SOLID FRAMEWORK 10.0.19130

Solid Framework SDK has been updated.

Language detection, page orientation detection and OCR character recognition has been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Improvements: 

  • [office] Improved the shape detection of complex glyph structures in Chinese, Japanese and Korean language documents.
  • [office] Improved GNSE detection by expanding decision tree for lowercase letter ‘e’.
  • [docx] Support the detection of FreeText annotations.
  • [docx] Improved the order detection of overlapping shapes and text.
  • [docx] Improved borderless table detection.
  • [docx] Improved the detection of borderless tables within a two column layout.
  • [docx] Improved column detection.
  • [docx] Improved the detection of highlighted table content.
  • [docx] Improved the detection of a separate text layer when content is overlayed with a stamp image.
  • [docx] Improved the detection of multi-line cell content in hybrid tables.

Bugfixes: 

  • [docx] Fixed a bug causing a transparent hyperlink to disrupt the visible text.
  • [docx] Fixed a bug preventing the detection of a Japanese character in a specific document.
  • [docx] Fixed a bug causing an image to be detected as part of the header.
  • [docx] Fixed a bug that caused footer content to be detected as body text.
  • [docx] Fixed a bug preventing accurate text detection.
  • [docx] Fixed a bug causing footer detection to interfere with page numbering.