Can Alfresco be used to manage scanned paper documents?

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎05-02-2006 12:20 AM
Seems Alfesco looks great as a broader CMS, could you outline why Alfresco would be suitable for managing scanned in documents, along the normal lines as required for any office's general administration.
Ie. All the mail arrives in the morning, the receptionist feeds it into the fancy feeder scanner, then indexes/files it online into the document management software. Mail includes, client correspondence, tax office letters, seminar invites, bills to pay etc.
Users can view new documents 'marked for their attention' as well as everything is essentially pre-filed and available using a search as well as under standard categories/folder like retrieval structure.
Could anyone outline if & how we could achive something along these lines using Alfresco. Also any references to tutorials/resources on how this is done, ie From scanner through to Alfresco, would be much appreciated.
Thanks in advance.
Evolve2k
- Labels:
-
Archive
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎07-18-2006 06:03 PM
Thanks,
Sean

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎07-19-2006 06:29 PM
Alfresco integrates with Kofax and eCopy; leading scanning and capture solutions. This means that scanned documents can be added to Alfresco automatically.
Alfresco can then categorise and file those documents according to user defined "Rules". These are like Inbox rules in MS Office.
Search can be performed against the content, content meta-data, folder location and category. The scanning solutions can extract important values from scanned documents which may be used as content meta-data for advanced Alfresco searches e.g. find tax office letter with reference number 12456.
Workflows or notifications (such as an e-mail) may be triggered on addition of new content or an rss feed may be subscribed to.
All of this available by configuring scanning integration and rules. No coding is required.
I suggest you send an e-mail to info@alfresco.com with your requirements where more information about how to get the scanning integration can be made available.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎08-04-2006 04:13 AM
Are you aware of anything similar to the Kofax release script planned or available for Nuance's OmniPage Professional 15? I see that Nuance are not on your partners list.
thanks
Jason
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎04-23-2007 11:16 PM
Is it licensed by scanning workstation?
************************
Is Alfresco going to consider implementing capture features in the near future?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎04-24-2007 09:37 AM
Just gives you some ideas on how the cost would be impacted.

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎04-24-2007 11:32 PM
I recently purchased the "Alfresco book" by Munwar Shariff, and there is a Chapter (13) dedicated to Implementing Imaging and Forms Processing, which runs through an example showing how a French bank scans and processes 20,000 documents / hour… nice.
I recommend you have a read of this book. I cannot comment anymore cause I haven't implemented any scanning yet and I do not want to infringe on the books Copyright protection.
Cheers,
Bradley

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎12-26-2007 10:27 AM
eg:
http://code.google.com/p/tesseract-ocr/
http://code.google.com/p/ocropus/
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎12-29-2007 10:30 AM
But wouldn't it be nice to integrate open source OCR software into alfresco? That would be completely into the Alfresco philisophy and would save a lot of money for medium sized business without large volume production scanners. I can imagine flat bed scanners at every medium sized department.
eg:
http://code.google.com/p/tesseract-ocr/
http://code.google.com/p/ocropus/
I don't think there's yet a Tesseract based package to create a searchable PDF, or at least one that's free. (I have my suspicions that ScanWiz may be Tesseract based).
I guess Alfresco could index PDFs without making them searchable using Tesseract in the way that DocMGR does : http://docmgr.sourceforge.net/install.php . Still, the real solution is to make the PDFs searchable in the first place, and then Alfresco would index them quite happily.
As an aside it would be nice if Alfresco could pass search-words to Acrobat Reader so that PDFs open with search-words already highlighted. This can be done through Acrobat Readers "Open Parameters" via a URL.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
‎02-28-2011 03:38 AM
But wouldn't it be nice to integrate open source OCR software into alfresco? That would be completely into the Alfresco philosophy and would save a lot of money for medium sized business without large volume production scanners. I can imagine flat bed scanners at every medium sized department.
Sorry to resurrect an old thread, but I am trying to achieve this, and it does not look very difficult.
I wrote a few lines of code to add invisible text to an existing PDF (using the Open Source Java PDFBox library).
So now I guess I have all of the pieces, and it becomes an Alfresco question: How to best architecture this?
Maybe an Alfresco action that calls Tesseract via command line and then inserts the OCR'd text into the PDF?
Or the same as a transformer?
Thanks for any feedback!
Nicolas Raoul
