SOLID FRAMEWORK 10.0.17926

Solid Framework SDK has been updated.

Feature Update: 

  • Enable support of a licensed installation of IRIS. 

Bugfixes: 

  • [docx] Fixed an issue preventing successful conversion of a file. 
  • [docx] Fixed an issue preventing one image of many from being correctly rendered. 
  • [docx] Fixed an issue preventing successful conversion of a file on Linux operating systems only. 
  • [docx] Fixed an issue preventing the detection of a Table of Contents due to the text order of the file. 
  • [pdf] Fixed an issue preventing PDFA-2b validation of a document when certain font combinations are installed.  

Office Fidelity: 

  • [docx] Improved detection of breaks on scanned documents containing Arabic text. 
  • [office] Streamlined optical character recognition workflow of large documents containing non-standard encoded text. 
  • [office] Allowed page snapshot deletion where annotations exist. 
  • [office] Improved processing of non-standard encoded characters to unicode. 
  • [office] Improved detection of combined characters. 
  • [office] Improved detection of Arabic diacritic characters. 
  • [office] Improved detection of transparent watermarks over scanned pages. 
  • [docx] Improved detection of Table of Contents. 
  • [rtf] Improved detection of characters when converting to RTF. 
  • [docx] Improved detection of shapes when converting to DOCX. 
  • [docx] Improved detection of serial images that contain underlines.  

Security 

  • A limited number of third-party libraries have been updated to include the latest security fixes.

SOLID FRAMEWORK 10.0.17650

Solid Framework SDK has been updated.

Feature Update: 

  • Enabled language detection for Arabic language documents. 

Bugfixes: 

  • [docx] Improved detection of bold font style on a specific document. 
  • [docx] Resolved a page count issue on macOS. 
  • [docx] Improved detection of small caps font style effect. 

Office Fidelity: 

  • [office] Improved whitespace detection between Arabic words. 
  • [docx] Improved header and footer detection targeting one-page documents with a horizontal line below body content. 
  • [docx] Improved detection of tables that overlap with body content. 
  • [docx] Improved detection of word order with right-to-left aligned text. 
  • [docx] Improved detection of Arabic characters when the text layer does not match the character glyphs. 

Security 

  • A limited number of third-party libraries have been updated to include the latest security fixes.   

SOLID FRAMEWORK 10.0.17490

Solid Framework SDK has been updated.

Improvements: 

  • Improvements to pdf rendering clipping algorithms: 
    • Added method for detecting a polygon that had degenerated into a polyline. 
    • Added methods for detecting self-intersections of polygon contours. 
    • Added method for automatic error detection in the clipping algorithm. 
    • Added method for changing the direction of a polygon. 
    • Support for winding/alternate rules has been added prior to polygon clipping. 
    • Rewrote the method for finding polygon intersections. 
    • Rewrote the method for adding the found intersections of polygons to the polygon structures. 
    • All clipping algorithm methods have been updated to operate with the same tolerance. 
    • Improved the accuracy of determining the type of vertices found near a polygon. 
    • Improved processing of polygon edges located very close to each other. 
  • Header and footer improvements specifically targeting one-page documents: 
    • Improved the exclusion of graphic lines, images and labels. 
    • Improved the exclusion of large tables, images and footnotes. 
    • Improved the exclusion of headings and titles. 
    • Improved the exclusion of images or text located close to other page content. 

Bugfixes: 

  • [pdf] Fixed an issue preventing conversion with PDF/A-1a and A-2b standards due to a specific page structure. 
  • [docx] Resolved an issue with CAD source content where vertical text around architectural details is displaced. 
  • [docx] Fixed an issue resulting in the top of characters in one line of text to be clipped. 
  • [docx] Resolved an issue causing five rows of a table to be incorrectly merged. 
  • [docx] Fixed an issue causing two columns of a table to be merged into one. 
  • [docx] Fixed an issue in the clipping engine preventing successful rendering of a specific pdf. 
  • [docx] Fixed an issue causing the last line of a right-to-left direction paragraph to have a hanging indent. 
  • [docx] Resolved an issue where right-to-left text was incorrectly left aligned. 
  • [docx] Fixed an issue preventing the rendering of leader (tabbing) characters in table of contents containing right-to-left text. 
  • [docx] Fixed incorrectly wrapped right-to-left text causing a page overflow issue.  
  • [docx] Resolved an indentation and alignment issue at list items in a right-to-left document. .  

Office Fidelity: 

  • [docx] Improved table of contents detection by optimizing sections across pages. 
  • [office] Improved GNSE detection to independently recognize glyphs and unicodes in separate stages. 
  • [office] Improved support for Arabic diacritical marks using analysis of scale and character spacing. 
  • [docx] Improved border line termination in specific table cases 
  • Improved the left margin alignment of a document. 
  • [docx] Resolved an issue causing text misplacement when viewed on Office 2016 only. 
  • [docx] Fixed a hybrid table detection issue resulting in two additional columns. 
  • [docx] Fixed an issue causing line shapes to be rendered as underlines. 
  • [pptx] Fixed an issue resulting in a block of text in a table to be incorrectly divided into six rows. 
  • [docx] Resolved an issue that caused one table to be incorrectly split into two tables. 
  • [docx] Resolved an issue causing a textbox to be divided in two parts. 
  • [docx] Improved Arabic language character unicode detection 
  • [docx] Improved alignment and indentation of content with right-to-left text direction.

SOLID FRAMEWORK 10.0.17360

Solid Framework SDK has been updated.

Feature Updates: 

  • Added the ability to set the text recovery language to any language when the corresponding Tesseract traineddata file is available. 

Bugfixes: 

  • [docx] Fixed an issue that caused an extra paragraph to be inserted after a specific graphic group. 
  • [docx] Resolved an issue preventing the detection of a first page header. 
  • [docx] Fixed an issue that caused an extra space to be inserted incorrectly affecting content layout. 
  • [docx] Resolved an issue preventing the detection of a small portion of red background color on a graphic. 
  • [docx] Fixed an issue converting invisible text as visible when OCR detection results in very few words. 
  • [docx] Fixed an issue preventing successful conversion of a file on Linux arm64 operating systems only. 
  • [docx] Fixed an issue preventing successful GNSE detection of text when Roboto font is used. 
  • [docx] Resolved an issue preventing the successful conversion of a file containing detailed images. 
  • [docx] Fixed an issue that caused header content to be shifted down one line on certain pages of a document. 
  • [docx] Resolved an issue that caused a document to be incorrectly clipped diagonally causing content loss.  
  • [docx] Fixed an issue preventing text from being detected on certain pages of a specific scanned document. 
  • [docx] Resolved an issue that caused a font style change in the empty space following an underlined descending letter. 

Office Fidelity: 

  • [docx] Improved detection of page X of Y page number format. 
  • [docx] Improved consistency of detection of multi-line headers. 
  • [docx] Improved detection of repeated table headers as body content instead of as header content. 
  • [docx] Improved the recovery of Korean text when GNSE is enabled. 
  • [docx] Improved borderless table detection. 
  • [docx] Improved detection of alternating headers supporting odd and even pages. 
  • [docx] Improved detection of alternating footers on odd and even pages. 
  • [docx] Improved table detection. 
  • [docx] Improved detection of merged cells. 
  • [docx] Improved detection of self intersecting glyph outlines when GNSE is enabled. 
  • [docx] Improved recognition of multi column layouts. 
  • [docx] Extend our character detection for non standard encoding to use tesseract OCR when required. 

Security: 

  • Security scanning of our codebase is automated as part of our compilation process. 

 

SOLID FRAMEWORK 10.0.17268

Solid Framework SDK has been updated.

Feature Update:

Our Windows releases are now compiled with Visual Studio 2022 build tools.

PDF to .DOCX conversion improvements include:

  1. Fixed an issue preventing the underline style from applying to all characters of a word in a specific document.  
  2. Improved our compliance to the OpenXML standard when handling malformed hyperlinks. 
  3. Improved our compliance to the OpenXML standard after text layout improvements. 
  4. Improved our detection of list hierarchy. 
  5. Resolved a page count issue on macOS caused by the Helvetica font. 
  6. Fixed an issue causing a false link to be detected. 
  7. Resolved an issue preventing the detection of a page number in the footer on specific layout styles.  
  8. Improved inconsistent footer detection on specific layout styles.
  9. Improved detection and implementation of inline small graphic groups. 
  10. Improved detection of headers and footers located unusually far from the page edge. 
  11. Improved detection of underline style for descending letters g, p and y. 
  12. Improved detection of different odd and even headers and footers. 
  13. Reduced false detections of section title body content as headers. 
  14. Improved detection of headers when watermark graphics cross the header content. 
  15. Improved detection of borderless tables. 

PDF to PDF/A conversion improvements include:

  1. Fixed an issue with annotations and fields preventing verification of compliance with PDF/A-2a and 2b standards. 
  2. Resolved an issue preventing conversion with PDF/A-1a standards due to the limit for real values.

SOLID FRAMEWORK 10.0.17072

Solid Framework SDK has been updated.

PDF to .DOCX conversion improvements include:

  1. Fixed an issue causing an unnecessary table row merge
  2. Fixed an issue preventing conversion on Unix for files with long paths filepaths.
  3. Improved scanned page detection to adjust when OCR is enabled.
  4. Fixed an issue interfering with the character placement of certain type 3 font characters.
  5. Resolved an issue resulting in inconsistent OCR of similar scanned pages.
  6. Fixed an issue disrupting the font style assigned to a list indent.
  7. Improved bordered table detection of scanned documents.
  8. Improved header detection.
  9. Resolved an issue preventing conversion of a specific pdf on macOS only.
  10. Fixed a memory leak.
  11. Improved detection of underline style when the graphic line partially crosses a character.
  12. Resolved an issue causing a conversion error on certain Windows 7 machines.
  13. Improved header detection when the body contains a repeated table header.
  14. Resolved an issue resulting in the last character in a text box being clipped.
  15. Improved detection of hyperlinks.
  16. Improved borderless table detection.
  17. Improved footer detection.

PDF to .HTML conversion improvements include:

  1. Fixed an issue stopping a bold font style from being applied.
  2. Fixed an issue resulting in the loss of the background color of a textbox.
  3. Improved text placement in text boxes.

PDF to Office conversion improvements include:

  1. Enabled the loading of the freetype cmap table to ensure correct character detection.
  2. Enabled support for relative path file links in conversion output.

PDF to .PPTX conversion improvements include:

  1. Fixed unnecessary slide rotation.
  2. Improved missing slide auto-rotation.

SOLID FRAMEWORK 10.0.16864

Solid Framework SDK has been updated.

Breaking Change: 

PowerPoint slide extraction of a PowerPoint ‘handout view’ now respects the pdf page indexes when selecting a ‘page range’ (subset) of the pdf. Previously ‘page range’ respected the extracted slide index. 

PDF to .DOCX conversion improvements include:

  1. Optimized predefined cmap loading to improve rendering of certain pages of a file
  2. Fixed an issue interfering with header detection.
  3. Fixed an issue preventing a portion of a certain borderless table from being detected.
  4. Resolved issue blocking optical character recognition of certain pages of an internal document.
  5. Fixed an issue halting optical character recognition after the third page of a specific document.
  6. Fixed an issue resulting in a false table being detected.
  7. Improved detection of text background color.
  8. Resolved multiple issues preventing the successful conversion of a specific pdf on Linux only.
  9. Improved header detection.
  10. Resolved an issue preventing the successful conversion of a specific pdf on Linux and macOS only.
  11. Improved borderless table detection.
  12. Improved detection of multiple line spacing in a text box.
  13. Improved column detection in tables.
  14. Resolved an internal issue causing the separation of table content in a specific file.
  15. Updated included libraries with potential vulnerabilities.

PDF to Office conversion improvements include:

  1. Fixed an issue where local font detection was scheduled more often than necessary.
  2. Resolved an issue causing conversion failure when input files contained special.

SOLID FRAMEWORK 10.0.16570

Solid Framework SDK has been updated.

PDF to .DOCX conversion improvements include:

  1. Fixed an issue that generated an ellipse shape as a square shape
  2. Fixed an issue causing only the top portion of an embedded chart to be converted
  3. Fixed an issue causing an image near the top margin to be clipped incorrectly
  4. Fixed an issue where only a portion of a logo image was rendered on conversion 
  5. Fixed an issue causing content of a table to be partially excluded
  6. Fixed an issue where the detection of a watermark resulted in the cells of a table to be incorrectly merged
  7. Resolved an issue preventing the successful conversion of a specific source pdf on OSX and Linux
  8. Resolved an issue preventing the successful conversion of a specific source pdf on OSX only
  9. Resolved an issue preventing the successful conversion of a certain document on Linux only
  10. Resolved an issue preventing the successful conversion of a certain document on all platforms
  11. Resolved a z-order issue rendering a semi-transparent layer on top of an image
  12. Resolved an issue resulting in the detection of only a portion of a table  
  13. Resolved an issue causing a table to be clipped incorrectly, which excluded content
  14. Improved list detection
  15. Improved detection of graphic lines
  16. Improved clipping of partially coincident polygons
  17. Improved the vertical metrics for Apple Gothic on certain OSX files
  18. Improved detection of graph content improves the output of many files
  19. Improved font scaling in files converted on macOS where Microsoft Office is installed
  20. Improved image mask creation
  21. Improved conversion of inline graphic groups
  22. Improved the detection of headers to decrease page overflow errors
  23. Improved the detection of footers to decrease page overflow errors
  24. Improved detection of non-repeated footers
  25. Improved detection of headers and footers when the content shifts horizontally or vertically between pages

PDF to .PPTX conversion improvements include:

  1. Improved the z-order detection of filled text boxes with a border 

Solid Framework 10.0.16278

Solid Framework SDK has been updated.

PDF to .DOCX conversion improvements include:

  1. Improve the scale/spacing correction for certain RTF format breaks
  2. Optimize conversion of paragraph height for empty paragraphs
  3. Improve conversion to prevent overflow of text
  4. Fixed a bug that could break the link to a previous footer upon increase in page number digit
  5. Fixed a bug causing text with a paragraph style background color to be rendered as a text box
  6. Fixed a bug preventing the detection of the first page number of a certain document as a footer
  7. Fixed a bug rendering the location of the content of a pin incorrectly
  8. Fixed a bug causing text and graphics to be rendered in the wrong order
  9. Support the ability to disable table detection on conversion
  10. Improve the detection of spacing in lists
  11. Fixed a bug causing one line of text in a specific document to be garbled
  12. Improve detection of borderless tables
  13. Fixed a bug that could cause certain files containing annotations to be detected with unreadable content
  14. Improve the detection and order of graphics for isolated groups over background groups

PDF to .PPTX conversion improvements include:

  1. Improve detection of glyph width
  2. Improve conversion to prevent overflow of text

Solid Framework 10.0.16078

Solid Framework SDK has been updated.

PDF to .DOCX conversion improvements include:

  1. Resolved a structured exception handling issue in post process on certain files
  2. Fixed a bug that produced an additional page of content on conversion
  3. Fixed a bug that could result in a crash or error when embedded font ‘OXQFLY + Franklin Gothic Demi’ was extracted
  4. Fixed a bug disrupting the detection of z-order with architecture CAD pins
  5. Improved the conversion of right to left aligned text
  6. Improved scanned page detection
  7. Improved detection and importing of columns with graphics
  8. Improved the detection of right to left paragraphs
  9. Improved detection of column alignment based on paragraph formatting
  10. Improved detection of z-order when document contains graphic text
  11. Improved detection of graphic header elements
  12. Improved detection of z-order when document contains graphics and horizontal lines
  13. Improved the conversion of right to left aligned dates and numerical content

PDF to .HTML conversion improvements include:

  1. Fixed a bug that caused certain ordinal numbers to be detected incorrectly

PDF to .XLSX conversion improvements include:

  1. Improved the conversion of right to left aligned text
  2. Improved the detection of mixed left to right and right to left aligned text

PDF to .PPTX conversion improvements include:

  1. Fixed a bug that was misaligning graphic elements with table text
  2. Improved detection of z-order when document contains shapes and lines

PDF to Image conversion improvements include:

  1. Fixed a bug that caused highlighting to obscure text