SOLID FRAMEWORK 10.0.18708
Solid Framework SDK has been updated.
Improvements:
- [docx] Improved column and row detection of hybrid split tables.
- [office] Implemented the recognition of non standard encoded vertical Japanese characters.
- [office] Improved the precision of non standard encoded Arabic character coordinates.
- [docx] Improved detection of single column non-table content.
- [docx] Improved table detection.
- [office] Improved the rendering of Type 3 font glyphs.
- [docx] Improved the optical character recognition of large images on 32 bit platforms.
- [docx] Implemented list recognition in pdfs using image bullets.
- [json] Implemented the option to ignore the detection of tiled pages in json export.
- [json] Improved json export of pages that exceed Microsoft size
- [json] Support nested tables being placed inside corresponding cell contents.
- [office] Applied custom language string options for Chinese text recovery.
- [office] Improved initialization mode for use of Thai trained data language file.
- [docx] Improved the detection of list items to prevent inclusion of undesirable footnote content.
- [office] Implemented automatic rotation detection of Japanese and Korean documents using optical character recognition.
Bugfixes:
- [docx] Fixed a bug causing the misdetection of multiple glyph shapes representing a single letter “e” in a document.
- [docx] Fixed a bug preventing detection of a borderless table when the table contained extended spaces between rows.
- [docx] Fixed a bug causing the detection of unnecessary column breaks in a document with right to left aligned Arabic text.
- [docx] Fixed a relative height calculation issue preventing a very large document from opening in Microsoft Word.
- [docx] Fixed a bug preventing the detection of columns when ignoring the tagged table structure of a pdf.