Lucidworks CTO Grant Ingersoll’s latest column on gives you a rundown on five of the most popular and powerful open source projects for taming text and processing natural language in both queries and indexing. Highlights projects from Stanford and the Apache Software Foundation:

“Thankfully, open source is chock full of high-quality libraries to solve common problems in text processing like sentiment analysis, topic identification, automatic labeling of content, and more. More importantly, open source also provides many building block libraries that make it easy for you to innovate without having to reinvent the wheel. If all of this stuff is giving you flashbacks to your high school grammar classes, not to worry—we’ve included some useful resources at the end to brush up your knowledge as well as explain some of the key concepts around natural language processing (NLP). To begin your journey, check out these projects.

Read all of Grant’s columns on or follow him on Twitter.

About Lucidworks

Read more from this author


Contact us today to learn how Lucidworks can help your team create powerful search and discovery applications for your customers and employees.