Intelligence in Document Imaging
TechVision's innovative products grew out of research and development in the areas of document
image analysis as well as automated image data capture. Our research in these areas continues to
lead to exciting product development.
Below are brief descriptions of TechVision AccuForm recognition engine,
the core technology used in our solution products, and is also made available for developers and
system integrators who want to add intelligent forms processing capabilities into their own
Below are brief descriptions of TechVision AccuForm recognition engine, the core technology used in our solution products, and is also made available for developers and system integrators who want to add intelligent forms processing capabilities into their own applications.
In AccuForm, form recognition and identification are based on advanced neural network and fuzzy
logic technologies that require neither user input nor pre-knowledge about forms.
Form documents are identified via global fuzzy analysis that compares similarities of form GLS
(Graphics & Line Structure) between the document forms and a target reference.
^Top Form Registration
Once the form has been identified, a complete registration process must be carried out to find
the positional difference between the reference form and the input form. In AccuForm, the
following registration parameters are detected automatically for each incoming form, which
requires neither user input nor pre-printed anchor marks on forms:
^Top Data Image Extraction
After form registration parameters are detected for each input form, data
images can be extracted by applying geometric transformations on each data
region defined on the reference form:
This recognition module is used for recognizing optical marks in extracted check box zones.
Typical application areas are in questionnaires, educational tests and in reporting or ordering sheets,
where the documents to be processed are form-like and filled with check marks for selected choices.
The frame of optical mark zone can be a rectangle, a hexagon, a circle, or an ellipse. It can be filled
in by any method (x, tick, hatching, etc.). The frame may be visible or invisible (scanner dropout) in the
image sent for recognition.
AccuForm has a reliable built-in bar code engine that can recognize
most popular bar code symbologies used in business documents, including
Code 39, Code 128, Codabar, Int-2of5, UPC-A, UPC-E, EAN-8 and EAN-13.
Linear Mark Recognition
Linear mark is another special data type suitable for automated data recognition, in which a user-drawn vertical bar crosses a horizontal line to indicate where the user feels within a given range. The value of the linear mark is defined as the length from the beginning of the horizontal line to the user-drawn mark position over the total length of the line, which is automatically captured and linearly interpreted as a floating point value between 0.0 and 1.0.
Below are examples of using linear marks in customer survey and medical treatment forms:
In the above example, the user-drawn mark value is recognized as 0.8, which indicates that the customer feels 80% satisfied with the provided service.
In this example, a patient feels a "0.6 headache" on a given scale of none (0.0) to strong headache (1.0).