Taming Text is released!

A new exciting book just published from Manning, with the catchy title Taming Text, by Grant S. Ingersoll (fellow Apache Lucene committer), Thomas S. Morton, and Andrew L. Farris.

Text processing has become vital for businesses to remain competitive in this digital age, with the amount of online unstructured content growing exponentially with time. Yet, text is also a messy and therefore challenging science: the complexities and nuances of human language don’t follow a few simple, easily codified rules and are still not fully understood today.

The book describe search techniques, including tokenization, indexing, suggest and spell correction. It also covers fuzzy string matching, named entity extraction (people, places, things), clustering, classification, tagging, and a question answering system (think Jeopardy).

Share the knowledge

You Might Also Like

The History of MCP and ACP: Where Did These Ideas Come From and Who’s Driving Adoption?

In the past year, two acronyms have quietly rewritten the playbook for...

Read More

AI Search Is Disrupting Everything. Here’s What B2B Marketing Leaders Should Do First.

Generative AI didn’t just change search. It changed how every buyer, seller,...

Read More

Will ACP Become the “New Checkout Button”? What Enterprises Need to Know

In digital commerce, every few decades, a single innovation reshapes the entire...

Read More

Quick Links