1

I need to organize a large number of scanned receipts. The receipts are scanned in PDF, and I need is software that will intelligently look at a document and parse out date, location / vendor, total amount paid, etc. The key is that these fields need to extracted from each document for tabulation; it isn't enough to just have them OCRed for search purposes.

I think I am essentially looking for similar functionality as is in the NeatReceipts scanners, but without the need to do the scanning. Are there any 3rd party tools for doing this sort of specialized parsing or OCR?

Ramon
  • 113

2 Answers2

2

Have a look at ABBYY FlexiCapture, it's made for specialized OCR of single-type documents: forms, invoices, recepiets etc

ABBYY FlexiCapture 10 is intelligent, accurate and scalable document capture and data extraction software. It provides a single entry point to automatically transform the stream of different forms and documents of any structure and complexity to usable and accessible data ready to be exported into your business applications and databases

Ivo Flipse
  • 24,894
Nikolay
  • 151
1

Well, the NeatWorks software can import PDFs as if you scanned them (there's an option to import in the Quick Scan UI).

Depending on volume, time sensitivity, data sensitivity, etc. you might also check out Shoeboxed, an online service that lets you either mail, scan, upload photos, etc. of receipts and business cards, then provides the information from them. It's kind of like NeatWorks in the cloud.

Shoeboxed pricing is high enough that if you're going to be in an office scanning then you're better off with the Neat products, but if you want to be able to simply take pictures of receipts, etc. while on the go then Shoeboxed may be a better bet for you (if more expensive over time). Shoeboxed may also be more amenable to truly batch processing since you can email in documents.

fencepost
  • 1,106