SOLID FRAMEWORK 10.0.18950
Solid Framework SDK has been updated.
Note:
Language detection, page orientation detection and OCR character recognition has been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Improvements:
- [docx] Content detection algorithm extended using pdf tags to bias output.
- [office] Improved detection of bound orientation for specific languages and the preferred orientation of that language.
- [office] Improve content detection of Korean language documents.
- [docx] Improved the detection of textboxes with similar layout to a two-cell table.
- [docx] Improved the detection of table header content.
- [docx] Improved table detection when cells in multiple columns contain only one hyphen.
- [docx] Improved the detection of line spacing.
- [docx] Improved detection of textboxes between column breaks.
- [docx] Improved the detection of grouped objects with similar layout to a table.
- [docx] Improved the detection of page headers.
- [json] Implemented strikethrough property in json export.
- [json] Improved the detection of text bounds.
- [docx] Improved the detection of tables within columns of text.
- [docx] Improved the detection of header content after page orientation change within a document.
- [pptx] Improved detection of character spacing.
- [office] Improved OCR analysis of non Latin glyphs.
Bugfixes:
- [docx] Fixed a bug preventing the detection of some tables in a certain document.
- [docx] Fixed a bug causing four characters on a variant color background to be omitted from conversion output.
- Fixed a bug causing additional text to be included in the target of a URL link.
- [pptx] Fixed a bug causing a word to be mislocated on a slide.
- [xlsx] Fixed a bug causing the cells of one row to become merged on a certain document.