SOLID FRAMEWORK 10.0.19506

Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Next release expected on 7 Jan 26

Improvements: 

  • [docx]  Expand support for the detection of Type3 fonts.
  • [docx/pptx] Improved the detection of hyperlinks containing special characters, multiple lines and E-mail addresses.
  • [docx] Support retention of special characters for URLs.
  • [pptx] Improved detection of multi-line hyperlinks.
  • [docx] Improved the detection of hyperlink text ending with ‘/*’.
  • [docx] Improved detection of text to display for E-mail address links.
  • [docx] Improved the conversion time of document containing a large amount of ortholine graphics.
  • [office]Improved the detection of Korean text.
  • [xlsx] Improved number detection to exclude IP addresses.

Bugfixes: 

  • [docx] Fixed an issue causing an incorrect style text style detection.
  • [docx] Fixed an issue preventing list detection in a certain document.
  • [docx] Fixed an issue causing a paragraph break to be detected in a multi-line list item.
  • [pptx] Fixed an issue preventing detection of an image in a document.
  • [pptx] Fixed an issue resulting in the trial watermark interfering with text detection.
  • [docx] Fixed an issue causing rows of a nested table to become merged in a document.
  • [docx] Fixed a reading issue of the Xref stream preventing successful conversion of a document.

SOLID FRAMEWORK 10.0.19392

Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Next release expected on 26 Nov 25

Improvements: 

  • [office]    Improved text encoding detection for complex glyphs like ligatures.
  • [office]    Improved encoding recovery for self-intersected glyphs.
  • [office]    Significant improvements to hyperlink detection with an emphasis on multiline hyperlinks.
  • [docx]      Improved text transparency conversion.

Bugfixes: 

  • [docx]      Fixed pdf annotation to MSWord comments conversion.
  • [pptx]      Fixed tabs placement. Prefer using space characters for small gaps.
  • [office]    Fixed a bug preventing the accurate detection of vector glyphs in a document.
  • [docx]      Fixed a bug causing borderless table headers to be detected as page headers in a document.

SOLID FRAMEWORK 10.0.19312

Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Next release expected on 08 Oct 25

Improvements: 

  • [office] Improved the support of transparent graphics. 
  • [docx] Improved the layout of type 1 font handling to improve paragraph detection.  
  • [docx] Improved detection of single borderless table structure. 
  • [docx] Improved the optical character recognition of Korean documents. 

Bugfixes: 

  • [pptx] Fixed a bug preventing the detection of alternate text on a slide.