SOLID FRAMEWORK 10.0.19392

Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Next release expected on 26 Nov 25

Improvements: 

  • [office]    Improved text encoding detection for complex glyphs like ligatures.
  • [office]    Improved encoding recovery for self-intersected glyphs.
  • [office]    Significant improvements to hyperlink detection with an emphasis on multiline hyperlinks.
  • [docx]      Improved text transparency conversion.

Bugfixes: 

  • [docx]      Fixed pdf annotation to MSWord comments conversion.
  • [pptx]      Fixed tabs placement. Prefer using space characters for small gaps.
  • [office]    Fixed a bug preventing the accurate detection of vector glyphs in a document.
  • [docx]      Fixed a bug causing borderless table headers to be detected as page headers in a document.