PDFZone Ziff-Davis Enterprise
Authoring | Utilities | Content Management | Document Management | Mobile | DRM | Other Formats | Tips
Home arrow Content Management arrow Data Warehouse Guru Offers Pointers on Unstructured Data
Data Warehouse Guru Offers Pointers on Unstructured Data
By W.H. Inmon

Rate This Article:
Add This Article To:
Pressure builds to deal effectively with documents, email and other non-traditional data formats.

Unstructured data has been around for a long time - certainly longer than the computer. Consider the Bible, the Egyptian hieroglyphics, and the Kama Sutra. They long predated silicon chips. And search engines have been around for a while as well - but not as long as the printed word. When it comes to unlocking the valuable information contained in unstructured data, even with sophisticated search engines, the world really has not come very far. So, why would this be the case?

There is a missing ingredient that needs to be present in order for search engines to unlock the real value of unstructured data. To help explain that missing ingredient, consider the oldest information technology conundrum of all: GIGO or “Garbage In, Garbage Out.” What happens when a powerful search engine is used against textual data that is essentially unscrubbed, unwashed and unintegrated? The answer is that the result of the search engine’s work, which is returned to the end user, is also unscrubbed and unwashed.

ADVERTISEMENT

In order for a search of text to be really powerful, the text that the search is conducted on needs to be integrated before the search is done. Once it is completed, you won’t start with garbage in – and you then wouldn’t expect garbage out.

Read the rest of this article on eWEEK.com.





Discuss Data Warehouse Guru Offers Pointers on Unstructured Data
 
>>> Be the FIRST to comment on this article!
 

 
 
>>> More Content Management Articles          >>> More By W.H. Inmon
 



FREE ZIFF DAVIS ENTERPRISE ESEMINARS AT ESEMINARSLIVE.COM
  • Dec 5, 2 p.m. ET
    Case Studies in MSP Profitability: 10 Processes to Automate to Achieve 2008 Goals
    with Michael Krieger. Sponsored by Autotask
  • Dec 6, 12:30 p.m. ET
    The State of the Great Windows Vista Migration
    with Aaron Goldberg. Sponsored by Dell & Microsoft
  • Dec 6, 2 p.m. ET
    Three Best Practices for Securing Microsoft Exchange
    with Michael Krieger. Sponsored by Entrust
  • Dec 6, 3 p.m. ET
    Simplify Your World, part 2: A Virtual Desktops Case Study
    with Joel Shore. Sponsored by EqualLogic
  • 12-19 VTS LOGO for BotMod
    Join us on Dec. 19 for Discovering Value in Stored Data & Reducing Business Risk. Join this interactive day-long event to learn how your enterprise can cost-effectively manage stored data while keeping it secure, compliant and accessible. Disorganized storage can prevent your enterprise from extracting the maximum value from information assets. Learn how to organize enterprise data so vital information assets can help your business thrive. Explore policies, strategies and tactics from creation through deletion. Attend live or on-demand with complimentary registration!
    FEATURED CONTENT

    Sponsored by Ziff Davis Enterprise Group


    DOWNLOADABLE ROI CALCULATORS & TOOLS FROM BASELINE
      Calculate Cost and ROI of Spam, VOIP, RFID, Sarbanes-Oxley and more...


    Featured Calculators:

     



    See More Tools!
    By Category| Planners |Calculators | Quizzes

     

    Special Report


    PDFzone Special Report: Making the Perfect PDF
    The Perfect PDF
    PDFzone shows you how to shine and polish your PDF by adding the reader-friendly touches your audience desires.

    Special Report


    PDFzone Special Report: Microsoft's PDF Play
    Microsoft's PDF Play
    Microsoft planned to offer a "Save to PDF" function in Office 2007, but the threat of legal action from Adobe may have them reconsidering.

    Special Report


    PDF conversion
    PDF Conversion Central
    Convert anything and everything to PDf and back again. Word docs, RSS, AutoCAD and more.
    ADVERTISEMENT