Chemnitz University of Technology, Germany, Institute for Print & Media Technology
How can historic document scans be automatically coded and processed? What options are available if optical character recognition (OCR) is not possible? One answer is given by the Venod system. A document-specific font is created in a vector format (SVG) and the document is re-typeset with these fonts, creating a scalable, compressed rendering with special attention to all symbols printed on the pages. An XML output format allows the automatic conversion to different media, preserving all textual details. Some of the challenges and future possibilities of this method will be addressed in this session.