Intelligent Enterprise featuring Transform
START NEWS & ANALYSIS OPINION CHANNELS PRODUCT GUIDES REVIEWS TECHWEBCASTS
CONTACTS ARCHIVES ADVANCED SEARCH

November 2000

FIRST LOOKS:

Server-Side File Conversion for the Web

By Lowell Rapaport

More and more companies are communicating with their branch offices, suppliers and customers using Web technology. But ordinary browsers need help handling the wide variety of file formats used in business. Last month's First Looks examined a pair of client-based file and image viewers that integrate with just about any browser. Once installed on the user's PC, they make viewing, downloading and/or printing almost any type of file a seamless, browser-based activity.

This month we look at three products that approach Web-based file viewing from the server side. Why would you want a server-based solution for file delivery? The main advantage is that you won't have to install viewing software at the client level to make information accessible. This is particularly important if you have large deployments, multiple locations or applications extending outside your enterprise - to suppliers, service providers or customers. Massive internal client installs are costly and time consuming, and it's much harder to guarantee success if the desktops are outside your enterprise.

Server-based file delivery gives you near universal compatibility with any Web client. Preprocessing files into a Web-compatible format at the server will ensure that all your users and customers will be able to get the information they need, without installing viewers or plug-ins.

The downside of server-side file processing is that your server needs to have much more horsepower than an ordinary file or Web server. In addition to serving the files, the server also has to convert them to a form that the browser will understand.

Another challenge with server-based viewing is the browser's limited features as a document viewer. There are no zoom, thumbnail or annotation functions like you get in a full-blown file viewer. If you want to provide these features, they have to be implemented on the server via scripts. Two scripting alternatives are to use either downloadable Java applications or Javascript. These tools let the server turn file processing and viewing functions over to the client machine. Since Java and Javascript are built into most browser platforms, users won't have to install helper applications or plug-ins.

In developing NetVue, Java was the technology of choice for Accusoft, Westborough, MA (www.accusoft.com). The NetVue client is a Java application with an assortment of common image-viewer tools, and it will automatically download and cache within the browser. On the server, NetVue maintains a document repository in a SQL or open database connectivity database. This lets it integrate with existing document management systems with a compatible database.

NetVue converts more than 100 file formats, including an add-on module for Microsoft Office files. It is popular with application service providers (ASPs) because it lets them support their customers' existing archives without converting the documents for Web compatibility. It also allows them to support any hardware on the client side. ASP Network Computing Devices of Mountain View, CA (www.ncd.com), uses NetVue to support an array of financial services and manufacturing clients. Datahub of Westlake Village, CA (www.datahubonline.com), uses NetVue as part of a service that allows municipalities to distribute government documents such as title deeds and birth certificates online.

NetVue uses a high-speed streaming protocol to download documents faster than conventional HTTP. The product also offers an annotation module that supports up to 256 layers of annotations. According to AccuSoft, annotations can be used to turn NetVue into an ad-hoc workflow solution.

The latest version of NetVue, version 3.1, adds a number of enhancements. Released in September, version 3.1 offers an optional Content Manager that can search NetVue's document repository for keywords and/or phrases. An optical character recognition (OCR) engine lets you search content in nontext files such as TIFF images. AccuSoft says these documents are automatically processed in batches during system idle time, and the search metadata is saved. This prevents users from having to wait for OCR processing when they initiate a search. The content manager flags keywords found within documents and allows easy navigation between occurrences of those words.

Scheduled for future release are Unix and Linux ports for the NetVue server. The company hopes to have the Linux version ready this winter and full Unix support available next spring. Current prices for the NT version of NetVue start at $5,000 per server plus $1,600 to $5,000 for the PDF, CAD, Microsoft Office and Content Manager add-on modules

The venerable Outside-In server is another alternative for delivering a range of file types through ordinary browsers. Developed by the Chicago-based Information Exchange Division of IntraNet Solutions (www.outsideinserver.com), Outside-In converts all documents to HTML text and all images to GIF, JPEG or PNG formats that can be directly rendered by a Web browser. There are no plug-ins, Java applications or Javascript applets, but there are also no annotation, image navigation or image processing features.

Outside-In has been around since the days of Windows 3.X, and for many years it was embedded within Lotus Notes. Today it is positioned as a Web publishing solution for more than 200 file formats, including, Microsoft Office, WordPerfect and other leading desktop application formats. The emphasis is on HTML formatting, and the server software includes templates for all formats and browser types supported. This makes Outside-In very flexible. The HTML code it generates can be custom designed for Internet appliances like Web TV, handheld computers using Wireless Markup Language (WML), and standard Netscape and Internet Explorer browsers. An upgrade specifically supporting handheld computers is expected to bow by the end of this year.

One caveat: Outside-In server has to co-exist with the Web server on the same machine. Because conversion and rendering are processor intensive, scaling up can be difficult, but it is compatible with load-balancing clustering software, a popular solution for scaling-up servers.

Outside-In is file-system-based; integrating it with a document management system's database is possible through the use of scripts, but it is awkward. The HTML rendering engine is available fully integrated with Filenet, Documentum and PC/Docs document management solutions, so a careful look at the Outside-In product will give you an idea how those document management systems have been Web enabled.

Outside-In is of particular interest to businesses that encounter a wide range of document types. Yahoo!, for example, uses Outside-In as the online document viewer for Yahoo Mail. It is also useful in environments such as schools and hospitals, where there is a wide variety of client computers. Outside-In is also useful for businesses that need to make unstructured documents, like human resources information, resumes and freeform documents made in a wide variety of application software available to online users. Outside-In is available at $10,000 per server license or through software leasing plans.

One of the simplest server-side solutions available is the Doc-to-Net image conversion tool from Skyline Tools, Woodland Hills, CA (www.skylinetools.com). Doc-to-Net is a file format filter that converts scanned document images such as Group 3 compressed TIFFs into JPEGs, GIFs or PNGs that can be displayed by a Web browser. Like Outside-In, it is a pure server-side product; no scripts or applet are downloaded to the client. Doc-to-Net does include image-processing tools, like deskew, despeckle and zoom, that are not offered through Outside-In.

Doc-to-Net comes in two versions. A CGI script application that can be added to a Web server and a dynamically linked library (DLL) for conventional application development. On the Web a client accesses Doc-to-Net's image-processing features by clicking on links to CGI or ASP scripts embedded in a Web page.

"Doc-to-Net is a tool for Web developers," says Skyline President Jan Dekkers. "Our customers include a number of Internet fax companies that use Doc-to-Net to make Group 3 compressed faxes available to Web users."

Doc-to-Net is also available as a traditional DLL object to be added to any document imaging application. The tool gives developers an easy way to Web-enable a document archive without converting an archive of scanned document images into browser-compatible formats.

Because Doc-to-Net relies entirely on server-side processing, it will even work with browsers older than version 4.X. It won't format Microsoft Office application files for the Web, but at $600 per server license, it is highly affordable. If your Web publishing needs revolve around scanned document images, it's the perfect solution.

"Doc-to-Net is a tool for Web developers," says Skyline President Jan Dekkers. "Our customers include a number of Internet fax companies that use Doc-to-Net to make Group 3 compressed faxes available to Web users."

Doc-to-Net is also available as a traditional DLL object to be added to any document imaging application. The tool gives developers an easy way to Web-enable a document archive without converting an archive of scanned document images into browser-compatible formats.

Because Doc-to-Net relies entirely on server-side processing, it will even work with browsers older than version 4.X. It won't format Microsoft Office application files for the Web, but at $600 per server license, it is highly affordable. If your Web publishing needs revolve around scanned document images, it's the perfect solution.




Channels
Business Process Management
Content Storage
Content Management
Compliance
Enterprise Solutions
Document Scanning & Capture
Content Delivery & Publishing
Collaboration & Knowledge Management
Search and Classification
Locate an article from our print magazine. Just enter your Locator ID Number below.
ID#


NEWS FROM THE PIPELINE

OpenOffice.org 2.0 Closes On Final

New Study Finds Steep Growth For Smartphones

PalmSource Sale Cleared By Federal Agency

CTIA Panel Examines Enterprise Security Risks

[more]






HOME | ARCHIVE | REALWARE AWARDS

A Publication of the Network Computing Enterprise Architecture Group
Brought to you by CMP Media LLC, Copyright © 2005
Privacy Statement | Your California Privacy Rights | Terms Of Service