What is the best way to click a picture of an ID and extract text from it for further processing?
I am creating a mobile app, that reads bar codes + can click pictures of Driving License etc. and extract text from the ID.
While the bar-code scanning part is easy, I'm confused about the OCR part. Any suggestions on how to achieve this?
Several options here: http://sourceforge.net/directory/?q=OCR&sort=score
It would helped if you told us which platform(s) you are targeting with the mobile app and which coding tools you will be using. I'll assume you will code the app with standard tools (Xcode, Eclipse, etc.). I suggest to find an open source OCR library which can be used by the application. There are lot's of OCR libraries you can use to add to your app. You will also need to build some form of parsing after the OCR had returned the text string to eliminate the "garbage" strings which will be recognised from the picture. The IDs usually have a "start" stings ex: ">>>>") which you can use to recognise that part of the text.
There are some is freeware software out there which should be easy to integrate into what you are doing. Hope that helps, BTW I found a few with a quick google search. SO it should be easy to find one which works for you.
Exporting to HTML is an easy way to get your InDesign content into web-ready form. When you export content to HTML, you can control how text and images are exported. InDesign preserves the names of paragraph, character, object, table, and cell styles applied to the exported contents by marking the HTML contents with CSS style classes of the same name. Using Adobe Dreamweaver or any CSS-capable HTML editor, you can quickly apply formatting and layout to the contents.
What gets exported
InDesign exports all stories, linked and embedded graphics, SWF movie files, footnotes, text variables (as text), bulleted and numbered lists, internal cross-references, and hyperlinks that jump to text or web pages. Tables are also exported, but certain formatting, such as table and cell strokes, is not exported. Tables are assigned unique IDs, so they can be referenced as Spry data sets in Dreamweaver. Placed audio and h.264 video files are enclosed in HTML5 and tags.
What doesn’t get exported
InDesign does not export objects you draw (such as rectangles, ovals, and polygons), hyperlinks (except for links to web pages and links applied to text that jump to text anchors in the same document), pasted objects (including pasted Illustrator images), text converted to outlines, XML tags, books, bookmarks, SING glyphlets, page transitions, index markers, objects on the pasteboard that aren’t selected and don’t touch the page, or master page items (unless they’re overridden or selected before export).