SOLID FRAMEWORK 10.0.18370

Improvements: 

  • [docx] Improved the detection of standard office bar charts and variants. 
  • [docx] Improved the detection of Chinese language.  
  • [docx] Improved the optical character recognition preprocessing of vector text.  
  • [docx] Improved the column detection of left to right aligned text.  
  • [docx] Improved the stability of graphic color detection.  
  • [docx] Improved the detection of header content.  
  • [docx] Improved detection of white text located on a dark background.   
  • [docx] Improved handling of text where the text and background colour match.  
  • [docx] Improved table detection.  
  • [docx] Improved the detection of diagrams.  
  • [docx] Improved detection of black text located on a grey background.    
  • [office] Improved language and page orientation detection. 

Bugfixes: 

  • [docx] Fixed an issue causing Latin characters in a Chinese document to be misplaced. 
  • [docx] Fixed an issue where a large graphic element caused text recovery failure. 
  • [pdf] Fixed an issue preventing the marked property from being retained. 
  • [docx] Fixed an issue preventing the detection of the correct bounds of a graphic element. 
  • [docx] Fixed a performance issue where dense vector graphics prevented successful optical character recognition of a file. 
  • [docx] Fixed an issue causing conversion delay of complex one-page document. 
  • [docx] Fixed a bug preventing the rendering of the first page of a detected Table of Contents. 

SOLID FRAMEWORK 10.0.18270

Solid Framework SDK has been updated.

Improvements: 

  • [pdf] Introduce option to save pdf page orientation as tagged data instead of auto rotating.
  • [office] Improved algorithm for averaging text properties of a paragraph that contains unicode groups to describe a single Arabic glyph. 
  • [docx] Improved list detection. 
  • [docx] Improved the text line assembly of Arabic content with diacritics. 
  • [office] Improved detection of small caps text. 
  • [office] Improved detection of Arabic language when minimal English text is near the Arabic language. 
  • [docx] Improved detection of header content. 
  • [docx] Improved z-order placement of graphic in conversion output. 
  • [docx] Improved detection of narrow columns on borderless tables. 
  • [docx] Improved conversion result of self-interesecting glyph outlines. 
  • [pdf] Improved tag support of various layout options. 
  • [office] Improved page margin calculation to be multiples of 1/4 inch for the imperial measurement system and 1/4 centimetre for the metric measurement system. 
  • [docx] Improved the rendering of Type 3 fonts. 
  • [docx] Improved use of tab stops to space content on a single line.  
  • [docx] Improved conversion result when encoding of original pdf contains large, broken text areas.  
  • [json] Support detection of table headers. 
  • [json] Support rectangle span element. 
  • [json] Improved detection of even-odd page header bounds. 
  • [json] Improved detection of table headers. 
  • [json] Support XObject ID for annotation graphic groups and textboxes. 
  • [json] Improved detection of span bounds for line with small caps.

Bugfixes: 

  • [pdf] Fixed a compression algorithm issue that caused the corruption of data during conversion of a specific file. 
  • [docx] Fixed an issue causing the background of an image to become transparent.  

Security 

  • A limited number of third-party libraries have been updated to include the latest security fixes.

SOLID FRAMEWORK 10.0.18108

Solid Framework SDK has been updated.

Improvements: 

  • [Office] Improved algorithms required to layout right to left body paragraph text.  
  • [Office] Improvements to right to left text character matching and diacritic handling. 
  • [Office] Improved Tatweel (Arabic) language detection. 
  • [Office] Optimized detection performance for non-standard encoded characters using Tesseract.  
  • [docx] Improved detection of footnote text. 
  • [docx] Improved header detection. 
  • [docx] Column detection improvements. 
  • [Office] Improved rendering of specific Type3 font. 

Bugfixes: 

  • [docx] Fixed an issue preventing successful conversion of a file. 

Misc: 

  • All projects are now compiled using C++17 language features