SOLID FRAMEWORK 10.0.19130

Solid Framework SDK has been updated.

Note:

Language detection, page orientation detection and OCR character recognition has been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Improvements: 

  • [office] Improved the shape detection of complex glyph structures in Chinese, Japanese and Korean language documents.
  • [office] Improved GNSE detection by expanding decision tree for lowercase letter ‘e’.
  • [docx] Support the detection of FreeText annotations.
  • [docx] Improved the order detection of overlapping shapes and text.
  • [docx] Improved borderless table detection.
  • [docx] Improved the detection of borderless tables within a two column layout.
  • [docx] Improved column detection.
  • [docx] Improved the detection of highlighted table content.
  • [docx] Improved the detection of a separate text layer when content is overlayed with a stamp image.
  • [docx] Improved the detection of multi-line cell content in hybrid tables.

Bugfixes: 

  • [docx] Fixed a bug causing a transparent hyperlink to disrupt the visible text.
  • [docx] Fixed a bug preventing the detection of a Japanese character in a specific document.
  • [docx] Fixed a bug causing an image to be detected as part of the header.
  • [docx] Fixed a bug that caused footer content to be detected as body text.
  • [docx] Fixed a bug preventing accurate text detection.
  • [docx] Fixed a bug causing footer detection to interfere with page numbering.

SOLID FRAMEWORK 10.0.18950

Solid Framework SDK has been updated.

Note:

Language detection, page orientation detection and OCR character recognition has been improved for non Latin languages. To include these improvements please update required files using traineddata.zip from Solid Framework downloads.

Improvements: 

  • [docx]  Content detection algorithm extended using pdf tags to bias output.
  • [office]  Improved detection of bound orientation for specific languages and the preferred orientation of that language.
  • [office]  Improve content detection of Korean language documents.
  • [docx]  Improved the detection of textboxes with similar layout to a two-cell table.
  • [docx]  Improved the detection of table header content.
  • [docx]  Improved table detection when cells in multiple columns contain only one hyphen.
  • [docx]  Improved the detection of line spacing.
  • [docx]  Improved detection of textboxes between column breaks.
  • [docx]  Improved the detection of grouped objects with similar layout to a table.
  • [docx]  Improved the detection of page headers.
  • [json]  Implemented strikethrough property in json export.
  • [json]  Improved the detection of text bounds.
  • [docx]  Improved the detection of tables within columns of text.
  • [docx]  Improved the detection of header content after page orientation change within a document.
  • [pptx]  Improved detection of character spacing.
  • [office]  Improved OCR analysis of non Latin glyphs.

Bugfixes: 

  • [docx]  Fixed a bug preventing the detection of some tables in a certain document.
  • [docx]  Fixed a bug causing four characters on a variant color background to be omitted from conversion output.
  •   Fixed a bug causing additional text to be included in the target of a URL link.
  • [pptx]  Fixed a bug causing a word to be mislocated on a slide.
  • [xlsx]  Fixed a bug causing the cells of one row to become merged on a certain document.

SOLID FRAMEWORK 10.0.18816

Solid Framework SDK has been updated.

Improvements: 

  • [docx] Improved the detection of text when the left margin of the scanned document contains noise. 
  • [docx] Improved list detection in pdfs using image bullets. 
  • [office] Improved the detection of graphic tables. 
  • [xlsx] Improved column detection of partially bordered tables.  
  • [docx] Implemented internal bookmark to a specific page in the Word document to match the pdf links.  
  • [json] Improved detection of small caps.  
  • [docx] Improved the order detection of overlapping shapes and images.  
  • [docx] Improved detection of column breaks.  
  • [docx] Improved detection of vertical Japanese text. 
  • [docx] Improved detection of borderless tables.  
  • [json] Improved detection of ‘Table of Contents’ bounds.  
  • [json] Improved handling of arbitrary text rotation in json export.

Bugfixes: 

  • [json] Fixed an issue causing partial detection of a line of text. 
  • [docx] Fixed an issue causing incorrect merging of separate tables on a page. 
  • [docx] Fixed an issue causing rows of certain table content to become merged.