dc.contributor.advisor | George, K. M. | |
dc.contributor.author | Penny, Latoyia Devonne | |
dc.date.accessioned | 2014-04-15T18:33:08Z | |
dc.date.available | 2014-04-15T18:33:08Z | |
dc.date.issued | 2009-07-01 | |
dc.identifier.uri | https://hdl.handle.net/11244/8221 | |
dc.description.abstract | The conversion of a portable document structures into an editable format is formally described. Conversion of paper based documents to electronic form is a necessity encountered by public and private sectors. The converted electronic form may not be editable. There are several applications that need documents in editable or plain text form. In this thesis we address this problem with the design and implementation of a conversion tool, P2X. The conversion tool was developed to automatically convert batches of PDF tabular data to editable spreadsheet format using a novel approach. We show that significant improvements to the quality of data conversion can be achieved at insignificant cost and with minimal complexity. | |
dc.format | application/pdf | |
dc.language | en_US | |
dc.publisher | Oklahoma State University | |
dc.rights | Copyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material. | |
dc.title | Design & Implementation of a PDF to Excel Conversion Tool (P2X) | |
dc.type | text | |
dc.contributor.committeeMember | Hedrick, G. E. | |
dc.contributor.committeeMember | Park, Nophill | |
osu.filename | Penny_okstate_0664M_10005.pdf | |
osu.college | Arts and Sciences | |
osu.accesstype | Open Access | |
dc.description.department | Computer Science Department | |
dc.type.genre | Thesis | |
dc.subject.keywords | conversion | |
dc.subject.keywords | data | |
dc.subject.keywords | delimiter | |
dc.subject.keywords | excel | |
dc.subject.keywords | pdf | |
dc.subject.keywords | unicode | |