Taming Text is released!

A new exciting book just published from Manning, with the catchy title Taming Text, by Grant S. Ingersoll (fellow Apache Lucene committer), Thomas S. Morton, and Andrew L. Farris.

Text processing has become vital for businesses to remain competitive in this digital age, with the amount of online unstructured content growing exponentially with time. Yet, text is also a messy and therefore challenging science: the complexities and nuances of human language don’t follow a few simple, easily codified rules and are still not fully understood today.

The book describe search techniques, including tokenization, indexing, suggest and spell correction. It also covers fuzzy string matching, named entity extraction (people, places, things), clustering, classification, tagging, and a question answering system (think Jeopardy).

You Might Also Like

Top 5 Use Cases for ACP in B2B Commerce

The rise of agentic commerce opens compelling new frontiers for B2B businesses.

Read More

The Role of Open Standards in MCP and ACP — Why Interoperability Matters

Open standards are what make MCP (Model Context Protocol) and ACP (Agentic...

Read More

How Agentic Commerce Protocol Could Transform Digital Commerce

The introduction of Agentic Commerce Protocol (ACP) by OpenAI and Stripe signals...

Read More

Quick Links