SOLID FRAMEWORK 10.0.17360

Solid Framework SDK has been updated.

Feature Updates: 

  • Added the ability to set the text recovery language to any language when the corresponding Tesseract traineddata file is available. 

Bugfixes: 

  • [docx] Fixed an issue that caused an extra paragraph to be inserted after a specific graphic group. 
  • [docx] Resolved an issue preventing the detection of a first page header. 
  • [docx] Fixed an issue that caused an extra space to be inserted incorrectly affecting content layout. 
  • [docx] Resolved an issue preventing the detection of a small portion of red background color on a graphic. 
  • [docx] Fixed an issue converting invisible text as visible when OCR detection results in very few words. 
  • [docx] Fixed an issue preventing successful conversion of a file on Linux arm64 operating systems only. 
  • [docx] Fixed an issue preventing successful GNSE detection of text when Roboto font is used. 
  • [docx] Resolved an issue preventing the successful conversion of a file containing detailed images. 
  • [docx] Fixed an issue that caused header content to be shifted down one line on certain pages of a document. 
  • [docx] Resolved an issue that caused a document to be incorrectly clipped diagonally causing content loss.  
  • [docx] Fixed an issue preventing text from being detected on certain pages of a specific scanned document. 
  • [docx] Resolved an issue that caused a font style change in the empty space following an underlined descending letter. 

Office Fidelity: 

  • [docx] Improved detection of page X of Y page number format. 
  • [docx] Improved consistency of detection of multi-line headers. 
  • [docx] Improved detection of repeated table headers as body content instead of as header content. 
  • [docx] Improved the recovery of Korean text when GNSE is enabled. 
  • [docx] Improved borderless table detection. 
  • [docx] Improved detection of alternating headers supporting odd and even pages. 
  • [docx] Improved detection of alternating footers on odd and even pages. 
  • [docx] Improved table detection. 
  • [docx] Improved detection of merged cells. 
  • [docx] Improved detection of self intersecting glyph outlines when GNSE is enabled. 
  • [docx] Improved recognition of multi column layouts. 
  • [docx] Extend our character detection for non standard encoding to use tesseract OCR when required. 

Security: 

  • Security scanning of our codebase is automated as part of our compilation process. 

SOLID FRAMEWORK 10.0.17268

Solid Framework SDK has been updated.

Feature Update:

Our Windows releases are now compiled with Visual Studio 2022 build tools.

PDF to .DOCX conversion improvements include:

  1. Fixed an issue preventing the underline style from applying to all characters of a word in a specific document.  
  2. Improved our compliance to the OpenXML standard when handling malformed hyperlinks. 
  3. Improved our compliance to the OpenXML standard after text layout improvements. 
  4. Improved our detection of list hierarchy. 
  5. Resolved a page count issue on macOS caused by the Helvetica font. 
  6. Fixed an issue causing a false link to be detected. 
  7. Resolved an issue preventing the detection of a page number in the footer on specific layout styles.  
  8. Improved inconsistent footer detection on specific layout styles.
  9. Improved detection and implementation of inline small graphic groups. 
  10. Improved detection of headers and footers located unusually far from the page edge. 
  11. Improved detection of underline style for descending letters g, p and y. 
  12. Improved detection of different odd and even headers and footers. 
  13. Reduced false detections of section title body content as headers. 
  14. Improved detection of headers when watermark graphics cross the header content. 
  15. Improved detection of borderless tables. 

PDF to PDF/A conversion improvements include:

  1. Fixed an issue with annotations and fields preventing verification of compliance with PDF/A-2a and 2b standards. 
  2. Resolved an issue preventing conversion with PDF/A-1a standards due to the limit for real values.

SOLID FRAMEWORK 10.0.17072

Solid Framework SDK has been updated.

PDF to .DOCX conversion improvements include:

  1. Fixed an issue causing an unnecessary table row merge
  2. Fixed an issue preventing conversion on Unix for files with long paths filepaths.
  3. Improved scanned page detection to adjust when OCR is enabled.
  4. Fixed an issue interfering with the character placement of certain type 3 font characters.
  5. Resolved an issue resulting in inconsistent OCR of similar scanned pages.
  6. Fixed an issue disrupting the font style assigned to a list indent.
  7. Improved bordered table detection of scanned documents.
  8. Improved header detection.
  9. Resolved an issue preventing conversion of a specific pdf on macOS only.
  10. Fixed a memory leak.
  11. Improved detection of underline style when the graphic line partially crosses a character.
  12. Resolved an issue causing a conversion error on certain Windows 7 machines.
  13. Improved header detection when the body contains a repeated table header.
  14. Resolved an issue resulting in the last character in a text box being clipped.
  15. Improved detection of hyperlinks.
  16. Improved borderless table detection.
  17. Improved footer detection.

PDF to .HTML conversion improvements include:

  1. Fixed an issue stopping a bold font style from being applied.
  2. Fixed an issue resulting in the loss of the background color of a textbox.
  3. Improved text placement in text boxes.

PDF to Office conversion improvements include:

  1. Enabled the loading of the freetype cmap table to ensure correct character detection.
  2. Enabled support for relative path file links in conversion output.

PDF to .PPTX conversion improvements include:

  1. Fixed unnecessary slide rotation.
  2. Improved missing slide auto-rotation.