Solutions Document Management Document Conversion

Document and Content Management related applications often require existing documents to be imported into the system. Sometimes they are already in a compatible format, sometimes not. Quick and accurate integration of relevant documents into your new application can be critical to its success. Ockham provides an efficient and flexible service for document conversion. Formats include data formats, text formats, XML, PDF, SGML, HTML, Microsoft Reader, Adobe Reader and Palm Reader e-Book formats. Processes include scanning, digitization, indexing, key wording, conversion and content security. We integrate capabilities and experience in information architecture, data conversion and metadata development. This enables us to create solutions for customers needing to create large-scale web-content.

XML transformation

Ockham converts media-rich HTML, SGML, proprietary formatted documents and unstructured content to XML. We call this our content architecture service. We strongly advocate the use of XML publishing, as this rich format provides great flexibility in the modification, adaptation and re-use of on-line documents. XML is being embraced as the core technology for the next generation Internet. It will fundamentally change the way we receive and search for information. As part of its service, Ockham provides complete documentation known as the tag library. This details proper usage, syntax and client specific definition of each DTD element, attribute and entity. It also includes a graphical representation of the entire document hierarchy.

PDF conversion

Ockham creates full-resolution PDF files which are suitable for printing or on-line delivery. PDF files are created from any source material to any specifications. Navigational aids, indexes, security and links to external resources are included. Using a unique process we can combine PDF conversions of text and images combined on a single page.

Text conversion

Hard copy or PDF files are scanned using Optical Character Recognition (OCR). Complex documents with non-standard or handwritten characters are entered manually, using double or triple keying techniques.

Data conversion

Ockham can convert either electronic files or hard copy documents. The documents are indexed, structured, edited and laid out in various formats such as Microsoft Access, Microsoft Excel, CSV, and tab or pipe delimited formats.


Items in this article:

XML transformation

PDF conversion

Text conversion

Data conversion

Subscribe here for our free WEBISO and STARJET
Workshops!