Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Next release expected on 27 May 26
Improvements:
[docx]Improved detection of vector text.
[docx]Improved paragraph detection.
[docx]Improved soft hyphen detection.
[docx]Improved detection of certain non-standard encoded characters.
[docx]Implemented detection of additional list items in Chinese, Japanese and Korean language documents.
[office]Implemented the inclusion of producer information in conversion output.
Bugfixes:
[docx]Fixed a bug preventing the detection of text in a specific document.
[docx]Fixed a bug where the height of an empty textbox prevented the document from being opened in Microsoft Word.
[docx]Fixed a bug causing Type 3 text in a font bounding box to be clipped.
[docx]Fixed a bug causing the substitution of a symbol for a non-standard encoded character.
[docx]Fixed a bug causing performance issues in a document.
[docx]Fixed a bug causing underline style to be detected as a line shape.
[xlsx]Fixed a bug resulting in rows becoming merged on a document.
[pptx]Fixed a bug interfering with image transparency detection.
Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Next release expected on 1 Apr 26
Improvements:
[office] Enabled text detection support for PDFs with non-standard encoding and based on the following languages: Bengali, Gujarati, Hindi, Kannada, Malayalam, Manipuri (Meetei Meyah), Oriya, Punjabi, Santali, Tamil, Telugu and Thai.
[docx] Implemented a text filtering procedure for scanned pages when text presented with hexadecimal string.
[docx] Improved underline property detection.
[office] Implemented use of the font family from a PDF file to select fonts installed on the operating system.
Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Next release expected on 18 Feb 26
Improvements:
[pptx] Add an option to control the un-tiling of PowerPoint presentation handouts, resulting in one slide per handout page.
[docx] Improved detection of multiple separate tables on one page.
[docx] Improved paragraph detection when line spacing is larger than default line spacing.
[docx] Improved detection of hyperlinks.
Improved detection of paragraph styles.
[docx] Improved the detection of annotations as part of table and layout and detection.
Bugfixes:
[xlsx] Fixed a bug that caused certain text in a document to be detected outside of the table.
[pptx] Fixed a bug causing the last image of a slide to be omitted.
SOLID FRAMEWORK 10.0.19910
/in Release Notes /by Tammy RLanguage detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Next release expected on 27 May 26
Improvements:
Bugfixes:
SOLID FRAMEWORK 10.0.19752
/in Release Notes /by Tammy RLanguage detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Next release expected on 1 Apr 26
Improvements:
Bugfixes:
SOLID FRAMEWORK 10.0.19632
/in Release Notes /by Tammy RLanguage detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Next release expected on 18 Feb 26
Improvements:
Bugfixes: