Language detection, page orientation detection and OCR character recognition has been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Improvements:
[office] Improved the shape detection of complex glyph structures in Chinese, Japanese and Korean language documents.
[office] Improved GNSE detection by expanding decision tree for lowercase letter ‘e’.
[docx] Support the detection of FreeText annotations.
[docx] Improved the order detection of overlapping shapes and text.
[docx] Improved borderless table detection.
[docx] Improved the detection of borderless tables within a two column layout.
[docx] Improved column detection.
[docx] Improved the detection of highlighted table content.
[docx] Improved the detection of a separate text layer when content is overlayed with a stamp image.
[docx] Improved the detection of multi-line cell content in hybrid tables.
Bugfixes:
[docx] Fixed a bug causing a transparent hyperlink to disrupt the visible text.
[docx] Fixed a bug preventing the detection of a Japanese character in a specific document.
[docx] Fixed a bug causing an image to be detected as part of the header.
[docx] Fixed a bug that caused footer content to be detected as body text.
[docx] Fixed a bug preventing accurate text detection.
[docx] Fixed a bug causing footer detection to interfere with page numbering.
Language detection, page orientation detection and OCR character recognition has been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Improvements:
[docx] Content detection algorithm extended using pdf tags to bias output.
[office] Improved detection of bound orientation for specific languages and the preferred orientation of that language.
[office] Improve content detection of Korean language documents.
[docx] Improved the detection of textboxes with similar layout to a two-cell table.
[docx] Improved the detection of table header content.
[docx] Improved table detection when cells in multiple columns contain only one hyphen.
[docx] Improved the detection of line spacing.
[docx] Improved detection of textboxes between column breaks.
[docx] Improved the detection of grouped objects with similar layout to a table.
[docx] Improved the detection of page headers.
[json] Implemented strikethrough property in json export.
[json] Improved the detection of text bounds.
[docx] Improved the detection of tables within columns of text.
[docx] Improved the detection of header content after page orientation change within a document.
[pptx] Improved detection of character spacing.
[office] Improved OCR analysis of non Latin glyphs.
Bugfixes:
[docx] Fixed a bug preventing the detection of some tables in a certain document.
[docx] Fixed a bug causing four characters on a variant color background to be omitted from conversion output.
Fixed a bug causing additional text to be included in the target of a URL link.
[pptx] Fixed a bug causing a word to be mislocated on a slide.
[xlsx] Fixed a bug causing the cells of one row to become merged on a certain document.
SOLID FRAMEWORK 10.0.19130
/in Release Notes /by Tammy RSolid Framework SDK has been updated.
Note:
Language detection, page orientation detection and OCR character recognition has been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Improvements:
Bugfixes:
SOLID FRAMEWORK 10.0.18950
/in Release Notes /by Tammy RSolid Framework SDK has been updated.
Note:
Language detection, page orientation detection and OCR character recognition has been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Improvements:
Bugfixes:
SOLID FRAMEWORK 10.0.18816
/in Release Notes /by Tammy RSolid Framework SDK has been updated.
Improvements:
Bugfixes: