Augmented Paper Form Translation
Total Page:16
File Type:pdf, Size:1020Kb
University of Cape Town Honours project Augmented Paper Form Translation Author: Supervisor: Rickert Mulder Prof. Gary Marsden Category Chosen Total 1 Requirement Analysis and Design 15 20 2 Theoretical Analysis 0 25 3 Experiment Design and Execution 10 20 4 System Development and Implementation 15 15 5 Results, Findings and Conclusion 10 20 6 Aim Formulation and Background Work 10 15 7 Quality of Report Writing and Presentation 10 10 8 Adherence to Project Proposal and Quality of Deliverables 10 10 9 Overall General Project Evaluation 0 10 Total Marks 80 October 31, 2011 Abstract There are many inefficiencies associated with paper based data collection. While the world is moving on to more efficient digital data capture systems, a shortage of skilled technicians prevents the developed world from adopting such systems. Existing systems that attempt to enable the transition, fall short in terms of usability. To address this issue, a web based system was proposed that recognises existing paper forms, allows users to correct any inaccuracies, and subsequently produces web accessible versions for easy data capture. A formative evaluation of the interface found it to be very simple and easy to use. The recognition system was however, only a proof of concept and is not suitable for general use. Contents 1 Introduction 4 1.1 Proposed solution . .4 1.2 Ethics . .5 1.3 Intellectual property . .5 1.4 Outline . .5 2 Background 6 2.1 Context of form processing in the developing world . .6 2.1.1 NGO data processing needs . .6 2.1.2 Existing systems . .7 2.1.3 Summary . .8 2.2 Image processing . .8 2.2.1 Filtering . .9 2.2.2 Binary morphology . .9 2.2.3 Edge detection . 10 2.2.4 Thresholding . 10 2.2.5 Hough transforms . 11 2.2.6 Connected components . 11 2.2.7 Contour finding . 12 2.2.8 Polygonal approximation . 12 2.3 Interaction design . 14 2.3.1 Understanding users . 14 2.3.2 Design . 15 2.3.3 Evaluation . 20 2.3.4 Conceptual model extraction . 20 2.3.5 Heuristic Evaluation . 21 2.3.6 Cognitive walkthroughs . 21 2.3.7 Pluralistic walkthroughs . 21 3 Design 22 3.1 Design considerations . 23 3.1.1 Platform . 23 3.2 Use cases . 24 3.3 Form recognition engine . 25 3.3.1 Form analysis . 25 3.3.2 Design . 27 3.4 Interface design . 28 3.4.1 Common translation errors . 28 3.4.2 Technology probe . 28 1 3.4.3 Interface design . 28 4 Implementation 35 4.1 Recognition engine . 35 4.1.1 Libraries . 35 4.1.2 Preprocessing . 36 4.1.3 Segmentation . 36 4.1.4 Data structures . 39 4.1.5 Parsing . 40 4.1.6 Serialisation . 41 4.1.7 Evaluation . 42 4.2 Web application . 42 4.2.1 Web framework considerations . 42 4.2.2 Overview . 42 4.2.3 Translation interface . 42 4.2.4 Form access and administration . 43 5 Evaluation 44 5.1 Methodology . 44 5.2 Participants . 44 5.3 Ethics . 45 5.4 Bias . 45 5.5 Pilot study . 45 5.6 Procedure . 45 5.7 Findings . 46 5.7.1 Element representations . 46 5.7.2 Navigation . 46 5.7.3 Expectations . 46 5.7.4 Manipulation . 47 5.7.5 Errors . 47 5.7.6 Unintended uses . 47 5.7.7 Viewing the final form . 48 5.8 List of changes . 48 5.9 Discussion . 48 6 Conclusion 49 6.1 Future work . 49 A Screenshots 53 B Test forms 58 2 List of Figures 2.1 Erosion . 10 2.2 Dilation . 10 2.3 Thresholding . 11 2.4 Connectivity . 12 2.5 Contour detection . 13 2.6 Douglas-Peucker algorithm . 13 2.7 Push-Pull Door . 17 2.8 Hick’s law . 18 3.1 Sequence diagram . 22 3.2 Use cases . 25 3.3 Interface objects: text, label, textbox, checkbox . 29 3.4 Normal toolbar & expanded toolbar . 30 3.5 Throbber animation . 31 3.6 Selected textbox . 31 3.7 Drag select . 32 3.8 Element transparency . 32 3.9 Unlinked error animation . 33 4.1 Example text fields . 41 A.1 Before loading a form . 53 A.2 After loading a form . 54 A.3 After processing a form . 55 A.4 Preview . 56 A.5 Created . 57 B.1 Form 1 . ..