SOLID FRAMEWORK 10.0.20070
Language detection, page orientation detection and OCR character recognition have been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.
Next release expected on 08 July 26
Improvements:
- [xlsx] Improved detection of vertical text in table headers.
- [json] Optimized the use of memory during the export of the internal database to JSON.
- [docx] Improved the detection of vector text.
- [pptx] Improved the order of objects to reflect read order instead of pdf order.
- [docx] Improved detection of invalid unicodes in pdf encoding and replacement with the valid character.
- [docx] Improved detection of strikethrough font effect.
- [pptx] Improved the detection of embedded fonts when available on the machine.
- [docx] Improved the detection of text with Type 3 fonts.
- [xlsx] Improved borderless table column detection.
Bugfixes:
- [Office] Fixed a bug interfering with the detection of a dark background color on a document.
- [docx] Fixed a bug causing an image to overlay the searchable text layer of a document.
- [docx] Fixed a bug preventing the accurate color detection of a transparent background element.
- [xlsx] Fixed a bug causing the height of a row to partially clip text.
- [docx] Fixed a bug causing certain cells in the header of a table to become merged.
- [docx] Fixed a bug that could cause font matching errors.
