There are loads of great features in Solid Framework!

To make it easier for you to find, we put links to documents that describe some of these features on this page.

Document Conversion Features

Our Solid Framework SDK reconstructs content from PDF files into reusable data. We automatically recognise and replace soft hyphens, de-skew scanned documents, recreate PowerPoint presentations from handouts and much more.

Convert PDF files to Microsoft Word documents*

+ Easily convert PDF files into fully editable Microsoft® Word documents. Convert scanned PDFs to well formatted, editable Word documents using Solid OCR. Advanced options for converting or removing headers and footers; reconstruct bordered and borderless tables as table objects, with formatting, in Word; PDF form fields are recognized and converted into text boxes for easy editing in exact reconstruction mode; convert text from your PDF, no matter the orientation.

+ Choose how to reconstruct your document:
Flowing reconstruction (recover page layout, columns, formatting, graphics and preserve text flow)
Continuous reconstruction (detect layout and columns but only recover formatting, graphics and text)
Exact reconstruction (recover exact page presentation using text boxes in Word)

Extract tables from PDF to Microsoft Excel*

+ Extract and re-use tables from PDF files into Microsoft Excel worksheets and .xlsx or .csv.

Convert PDF files into reflowed HTML*

+ Use advanced document reconstruction to convert PDF to formatted W3C-compliant XHTML. Formats columns. Remove headers, footers and images.

Convert PDF files to Microsoft PowerPoint*

+ Convert each page in your PDF to a slide in PowerPoint and then edit. Features also include reconstruction of a PowerPoint presentation from a “handout” style PDF .

Convert PDF files into data (.CSV, MySQL, MsSQL, and minimal Excel)*

+ Extract data into common formats

Convert PDF to reflowed plain text

+ Extract flowing text content from PDF. Header, footer and column options available.

Extract images from PDF files

+ Just extract images

*Document conversion features included in Professional and Professional+OCR only

Solid CGM and Solid OCR

Solid Framework has the ability to reconstruct both PDFs that contain embedded text or which are scanned. Detailed information on Solid OCR can be found here.

Document image clean up*

+ Features include automatic de-skewing of scanned documents.

Image segmentation*

Image compression*

OCR (Optical charater recognition with Solid OCR

+ Optical Character Recognition (OCR)*

*Solid CGM and Solid OCR included in Professional+OCR only

Solid NSE (Non Standard Encoding)

Solid Framework has the ability to recover PDF content that is encoded in a non-standard way (NSE).
We have put together a couple of videos that explain what NSE is, and how Solid Framework compares with other products.
Click on the links to see the videos

Non Standard Encoding – what it is and Solid Framework does a great job with badly encoded text.
NSE and why it beats OCR.

Logo glyphs as vector graphics

Automatic correction of font styles

Icon font glyphs as vector graphics

Automatic correction of common symbolic fonts

Barcode fonts as vector graphics

Automatic correction of all ligatures

AllCap font detection and correction

SmallCap font detection and correction

Automatic correction of alphanumeric glyphs

Automatic correction of ligatures with no Unicode equivalent

NSE features included in all versions of Solid Framework

Core Model API Features

Access all reconstructed text blocks from the PDF as Unicode through API*

Obtain original PDF bounds for text blocks in the reconstructed core model through API*

*Core model features included in Professional and Professional+OCR only

PDF/A Features

Robust PDF/A archival creation, conversion and validation with PDF/A-1b, PDF/A-2b, PDF/A-2u, PDF/A-3b and PDF/A-3u ISO 19005 formats supported.

PDF/A Validation (with and without detailed report)*

+ Validate PDF/A. Verify ISO 19005-1 and ISO 19005-2 compliance for existing PDF documents and repair common issues.

Convert PDF to PDF/A*

+ Convert existing normal or image PDF files into fully searchable ISO 19005-1 and ISO 19005-2 compliant archivable documents.

Convert scanned PDF to searchable text (OCR required)**

+ Add searchable text layer. Easy for indexing and archiving legacy and paper documents.

PDF/A validation (simple pass or fail)

Create PDF/A compliant files

+ Create PDF/A documents which are fully compliant with current ISO archiving standards.

* PDF/A features included in Professional and Professional+OCR only
** PDF/A feature included in Professional+OCR only

PDF Rendering Features

Render PDF pages as bitmaps (for thumbnails)

*PDF rendering features included in all versions of Solid Framework

PDF Editing Features

Browse content and internal structure of PDF files

Modify viewer preferences and PDF document info

+ Acrobat Reader Settings. Set default view including page layout, initial zoom and page thumbnail view.

Secure PDF files: encryption, permissions and passwords

+ You can restrict who can view, edit, copy, print or add comments to your document. 256 AES Encryption. This new high level encryption is supported in Adobe® Acrobat® 9 or higher. 128-bit RC4 or AES encryption algorithms also supported.

*PDF editing features included in all versions of Solid Framework