Taming Text: How to Find, Organize, and Manipulate It
Taming Text: How to Find, Organize, and Manipulate It
Book Details
- Author: Grant S. IngersollThomas S. MortonDrew Farris
- Binding: Paperback
- Publisher: Manning Publications
- Published: 2013-01-24
- Edition: 1st
Regular price
$6.95 USD
Regular price
Sale price
$6.95 USD
Unit price
/
per
Summary
Taming Text, winner of the 2013 Jolt Awards for Productivity, is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are built.
About this Book
There is so much text in our lives, we are practically drowningin it. Fortunately, there are innovative tools and techniquesfor managing unstructured information that can throw thesmart developer a much-needed lifeline. You'll find them in thisbook.
Taming Text is a practical, example-driven guide to working withtext in real applications. This book introduces you to useful techniques like full-text search, proper name recognition,clustering, tagging, information extraction, and summarization.You'll explore real use cases as you systematically absorb thefoundations upon which they are built.Written in a clear and concise style, this book avoids jargon, explainingthe subject in terms you can understand without a backgroundin statistics or natural language processing. Examples arein Java, but the concepts can be applied in any language.
Written for Java developers, the book requires no prior knowledge of GWT.
Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.
Winner of 2013 Jolt Awards: The Best Books—one of five notable books every serious programmer should read.
What's Inside
Grant Ingersoll is an engineer, speaker, and trainer, a Lucenecommitter, and a cofounder of the Mahout machine-learning project. Thomas Morton is the primary developer of OpenNLP and Maximum Entropy. Drew Farris is a technology consultant, software developer, and contributor to Mahout,Lucene, and Solr.
"Takes the mystery out of verycomplex processes."—From the Foreword by Liz Liddy, Dean, iSchool, Syracuse University
Table of Contents
Taming Text, winner of the 2013 Jolt Awards for Productivity, is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are built.
About this Book
There is so much text in our lives, we are practically drowningin it. Fortunately, there are innovative tools and techniquesfor managing unstructured information that can throw thesmart developer a much-needed lifeline. You'll find them in thisbook.
Taming Text is a practical, example-driven guide to working withtext in real applications. This book introduces you to useful techniques like full-text search, proper name recognition,clustering, tagging, information extraction, and summarization.You'll explore real use cases as you systematically absorb thefoundations upon which they are built.Written in a clear and concise style, this book avoids jargon, explainingthe subject in terms you can understand without a backgroundin statistics or natural language processing. Examples arein Java, but the concepts can be applied in any language.
Written for Java developers, the book requires no prior knowledge of GWT.
Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book.
Winner of 2013 Jolt Awards: The Best Books—one of five notable books every serious programmer should read.
What's Inside
- When to use text-taming techniques
- Important open-source libraries like Solr and Mahout
- How to build text-processing applications
Grant Ingersoll is an engineer, speaker, and trainer, a Lucenecommitter, and a cofounder of the Mahout machine-learning project. Thomas Morton is the primary developer of OpenNLP and Maximum Entropy. Drew Farris is a technology consultant, software developer, and contributor to Mahout,Lucene, and Solr.
"Takes the mystery out of verycomplex processes."—From the Foreword by Liz Liddy, Dean, iSchool, Syracuse University
Table of Contents
- Getting started taming text
- Foundations of taming text
- Searching
- Fuzzy string matching
- Identifying people, places, and things
- Clustering text
- Classification, categorization, and tagging
- Building an example question answering system
- Untamed text: exploring the next frontier
The More Than Words double bottom line: Every purchase provides hands on job training opportunities, and all revenue supports our nonprofit to empower youth to take charge of their lives.
Shipping Info
Shipping Info
We offer standard and express shipping starting at $5.99. Live local? We offer local pickup on select items at our Boston Store and Mobile Bookstore location.
Returns
Returns
We accept returns within 30 days of purchase for a refund. Simply reach out to us if you have any trouble with your order.
Care Instructions
Care Instructions
We are here to help with any questions or concerns you may have at any time. Reach out to us and we can't wait to help!
can't find what you are looking for...
We Can HelpJoin our list to learn more about our mission We promise never to sell your information to another party and we promise we won't spam your inbox!