Dominant Systems - Michigan Network Solutions Provider Dominant Systems - Michigan Network Solutions Provider
Dominant Systems - Michigan Network Solutions Provider Dominant Systems - Michigan Network Solutions Provider
ARCSPIDER SEARCH
Enter Keywords:

Powered by Arc Spider - Smart Product Search Services 
Privacy Statement
PARTNER LINKS

Buy.com Coupons

Sony VAIO PC Special Offers

The Hottest Notebook Deals Are Here!


Text Mining Application Programming (Programming Series)
Home > Computer/ Network Books > Data Mining > Item 9
View Previous Product in Data Mining View Next Product in Data Mining

Click here to buy  Text Mining Application Programming (Programming Series)  by Manu Konchady. Text Mining Application Programming (Programming Series)
by Manu Konchady
Sales Rank: 139602
Discount: 40 %
$30.73
At Amazon
Get More Info On  Text Mining Application Programming (Programming Series) ! Buy  Text Mining Application Programming (Programming Series)  Now!

  • Paperback: 432 pages
  • Publisher: Charles River Media; 1 edition May 4, 2006
  • Language: English
  • ISBN-10: 1584504609
  • ISBN-13: 978-1584504603
  • Product Dimensions: 9 x 7.3 x 1.1 inches
  • Shipping Weight: 1.8 pounds

    Book Description
    Text Mining Application Programming teaches software developers how to mine the vast amounts of information available on the Web, internal networks, and desktop files and turn it into usable data. The book helps developers understand the problems associated with managing unstructured text, and explains how to build your own mining tools using standard statistical methods from information theory, artificial intelligence, and operations research. Each of the topics covered are thoroughly explained and then a practical implementation is provided. The book begins with a brief overview of text data, where it can be found, and the typical search engines and tools used to search and gather this text. It details how to build tools for extracting and using the text, and covers the mathematics behind many of the algorithms used in building these tools. From there you'll learn how to build tokens from text, construct indexes, and detect patterns in text. You'll also find methods to extract the names of people, places, and organizations from an email, a news article, or a Web page. The next portion of the book teaches you how to find information on the Web, the structure of the Web, and how to build spiders to crawl the Web. Text categorization is also described in the context of managing email. The final part of the book covers information monitoring, summarization, and a simple Question & Answer (Q&A) system. The code used in the book is written in Perl, but knowledge of Perl is not necessary to run the software. Developers with an intermediate level of experience with Perl can customize the software. Although the book is about programming, methods are explained with English-like pseudocode and the source code is provided on the CD-ROM. After reading this book, you'll be ready to tap into the bevy of information available online in ways you never thought possible.

    About The Author
    Manu Konchady (Oakton,VA) is a consultant working on open source text mining software. Previously, he worked at Mitre Corp. where he designed and developed software to mine the Internet. He received his Ph.D. in Information Technology from George Mason University and his articles have appeared in Dr. Dobb¿s Journal and Linux Journal.

    Customer Reviews & Comments
    There is an old expression that half of knowing anything is knowing where to find it. And there is little more frustrating to be looking at 'My Computer' trying to find what you know you have stored in a file somewhere. Well, perhaps just as frustrating is to go to one of the search engines and try to find something that you know is there but just don't know the proper words to find it. In this book Dr. Konchady talks about how to go find data that is in text form on your system, on your network or out on the web somewhere. It talks about search engines, but also about other techniques that can be used only by programming. The CD that comes with the book contains several Perl software snippets that help to find named entities, parts of speech, phrases and gives a summary of text documents. This area includes developing web crawlers that can be adapted by individual users to go out and find specialized information. It further contains an Open Source software package called Text Mine that is designed for mining operations. In addition it has utilities to build and enhance Text Mine and utilities to build and manage MySQL database tables. This is an excellent book on everything from the basic hints and types through some of the mathematics that underlies text mining. His section on the nature of an English language Question and Answer system is the best I've ever seen. Comment | Permalink | (Report this)

  • Text Mining Application Programming (Programming Series)
    Discount: 40 %
    Available from Amazon
    Price: $30.73
    Get More Info On  Text Mining Application Programming (Programming Series) ! Buy  Text Mining Application Programming (Programming Series)  Now!
    Home |  About Us |  Network Services |  Security Services |  Testimonials |  Case Studies
    Tips & Tools |  Press Room |  Newsletters |  Employment |  Contact Us

    Copyright © 2008, Dominant Systems Corporation

    Dominant Systems Corporation