Search - PDFzone
PDFZone Ziff-Davis Enterprise
Authoring | Utilities | Content Management | Document Management | Mobile | DRM | Other Formats | Tips
Home arrow Search arrow Google Desktop Plug-In Enables PDF Search
Google Desktop Plug-In Enables PDF Search
By Don Fluckinger

Rate This Article:
Add This Article To:
ScanSoft provides a plug-in that can scan text PDFs as well as index and search image-based documents such as faxes and paper scans.

The Google Desktop utility went live Monday, after about a six-month beta cycle. The 1.0 release supports PDF search, for the first time. Moreover, ScanSoft has brought to market a beta plug-in called OmniPage Search Indexer that not only supports PDFs containing text, but also can OCR and index image-based PDFs with scanned text and return the results on a locally served Google-type page.

"We're very pleased to be one of the first developers to work with Google and their new API to enable this," said Robert Weideman, senior vice president of marketing and product strategy for ScanSoft's Productivity Applications Division.

Weideman added that OmniPage Search Indexer also handles other image file formats such as BMP, MAX and TIFF. "We see this as an important event [for ScanSoft] and one we're evaluating, should we decide to support OmniPage Search Indexer with other desktop search products."

ADVERTISEMENT

That's likely to happen, Weideman said, because there's a need. Most companies that offer desktop search utilities—like Google, Yahoo!, Ask Jeeves and Microsoft Corp.'s MSN—live in the Internet search space, where there is little call for image-based document search, as most companies don't post faxes and scans of paper documents to the Web. So search vendors don't develop tools to search them.

Google Desktop Search pleases eWEEK.com's reviewer. Click here to read more.

When entering the desktop arena—where users need to tap into archives of image documents stored on hard drives or on company intranets—an image-based document search tool suddenly becomes important.

"While it's rare that there's [scanned text documents] on the Web, it's actually quite common that it's on a person's PC," Weideman said. "There's a big gap right now when the companies traditionally participating in the public Web search arena came to the desktop. ... If you're a lawyer, you're not posting contracts you've scanned in on your public Web site, but you definitely have them on your PC and in your network environment."

Neither a spokesperson for Google—expressing a desire to display equal enthusiasm for all third-party plug-in developers—nor ScanSoft, bound by non-disclosure, offered much information about how the two companies came together.

Weideman did say, however, that the companies have a "mutual interest" in seeing scanned paper publications and speech content made visible to Internet search engines beyond the Google Desktop.

He also said that the two companies has worked together for some time to bring the OmniPage Google Desktop plug-in to market.

"We worked with Google on their definition of the API; we were [involved] very early in the process and provided them feedback on how they can make the API better," Weideman said.

Currently, only an English version of OmniPage Search Indexer is available. ScanSoft says it plans to make Dutch, French, German, Italian, Portuguese and Spanish localized versions available at the same download sites within 30 days.

The current beta version of OmniPage Search is free. ScanSoft may charge for the commercial release version—which will come after what Weideman estimates will be a 30- to 60-day beta cycle—but he also added that a free, time-limited demo will remain available for download from the Google site.

In addition, he pointed out that while ScanSoft may be first to market with a PDF search tool for the Google Desktop, that's not an exclusive. In time there could be competing tools. ScanSoft was able to customize a search tool the fastest in part, Weideman said, because the company owns six different OCR engines. Each requires various file-size, accuracy and speed trade-offs; the one the company settled on turned into about a 5MB runtime download.

"I would expect to see some of our competitors come to the party with their own offerings," Weideman said. "Their challenge is ... getting their runtimes down to 5MB or smaller."

For more information on OmniPage Search, go here.


Discuss Google Desktop Plug-In Enables PDF Search
 
>>> Be the FIRST to comment on this article!
 

 
 
>>> More Search Articles          >>> More By Don Fluckinger
 



FREE ZIFF DAVIS ENTERPRISE ESEMINARS AT ESEMINARSLIVE.COM
  • Dec 5, 2 p.m. ET
    Case Studies in MSP Profitability: 10 Processes to Automate to Achieve 2008 Goals
    with Michael Krieger. Sponsored by Autotask
  • Dec 6, 12:30 p.m. ET
    The State of the Great Windows Vista Migration
    with Aaron Goldberg. Sponsored by Dell & Microsoft
  • Dec 6, 2 p.m. ET
    Three Best Practices for Securing Microsoft Exchange
    with Michael Krieger. Sponsored by Entrust
  • Dec 6, 3 p.m. ET
    Simplify Your World, part 2: A Virtual Desktops Case Study
    with Joel Shore. Sponsored by EqualLogic
  • 12-19 VTS LOGO for BotMod
    Join us on Dec. 19 for Discovering Value in Stored Data & Reducing Business Risk. Join this interactive day-long event to learn how your enterprise can cost-effectively manage stored data while keeping it secure, compliant and accessible. Disorganized storage can prevent your enterprise from extracting the maximum value from information assets. Learn how to organize enterprise data so vital information assets can help your business thrive. Explore policies, strategies and tactics from creation through deletion. Attend live or on-demand with complimentary registration!
    FEATURED CONTENT

    Sponsored by Ziff Davis Enterprise Group


    DOWNLOADABLE ROI CALCULATORS & TOOLS FROM BASELINE
      Calculate Cost and ROI of Spam, VOIP, RFID, Sarbanes-Oxley and more...


    Featured Calculators:

     



    See More Tools!
    By Category| Planners |Calculators | Quizzes

     

    Special Report


    PDFzone Special Report: Making the Perfect PDF
    The Perfect PDF
    PDFzone shows you how to shine and polish your PDF by adding the reader-friendly touches your audience desires.

    Special Report


    PDFzone Special Report: Microsoft's PDF Play
    Microsoft's PDF Play
    Microsoft planned to offer a "Save to PDF" function in Office 2007, but the threat of legal action from Adobe may have them reconsidering.

    Special Report


    PDF conversion
    PDF Conversion Central
    Convert anything and everything to PDf and back again. Word docs, RSS, AutoCAD and more.
    ADVERTISEMENT