Book Description
What developers need to know about the rapidly growing technologies of document warehousing and text mining
This unique book shows warehouse developers and managers how to build this new type of warehouse, how to organize free-form text for easy access, and, most importantly, how to exploit text mining techniques to provide timely and accurate information for decision-makers. The author covers the complete process of building and managing a document warehouse, including examples of actual implementations, a review of security issues and tools such as XML and Wide Area Information Servers and their selection criteria, and how text mining techniques are different from data mining techniques.
Back Cover Copy
"This book combines a thorough introduction to document warehousing with an in-depth technical tutorial for implementation. Dan Sullivan truly leaves no stone unturned. This book is my de-facto document warehousing resource!"Jill Dyche, Baseline Consulting Group
Most business information isnt neatly stored in databases. Its more likely found in a swirl of millions of Web pages, e-mails, and free-form text documents. To capture and tame this flood of information for decision making, businesses are now turning to document warehousing and text mining techniques.
This book provides database and data warehouse developers and managers with complete guidance on how to build and manage a document warehouse, how to organize unstructured text for easy storage and retrieval, and how to leverage text mining techniques to provide timely and accurate information for decision-makers.
Youll learn how document warehousing and text mining technologies compare with their numeric data warehousing and data mining counterparts, and discover essential tools for tapping into information resources within your company and across the Internet.
- Scripts for common tasks
- Document templates used in the design stage of document warehouse development
- Links to related sites
See all Editorial Reviews