The Industry.Net site received documents for Web-publication from outside clients in diverse formats. Handling these documents speedily and cost-effectively is critical.
I put together a proof-of-concept and feasibility evaluation for semi-automated acquisition of announcements and press releases received in paper format. I headed a team of four people to put together a system involving two PCs with scanners to scan, categorize by company, and OCR documents, and thence FTP them automatically to a Solaris box for indexing (via Excite) and automated Web publishing. The system was used at two trade-shows, and revealed very good throughput and cost measures.
Back
Rujith de Silva 1997-05-13