Lucene is a gem in the open-source world‹-a highly scalable, fast search engine. It delivers performance and is disarmingly easy to use. Lucene in Action is the authoritative guide to Lucene. It describes how to index your data, including types you definitely need to know such as MS Word, PDF, HTML, and XML. It introduces you to searching, sorting, filtering, and highlighting search results.
Lucene powers search in surprising places‹-in discussion groups at Fortune 100 companies, in commercial issue trackers, in email search from Microsoft, in the Nutch web search engine (that scales to billions of pages). It is used by diverse companies including Akamai, Overture, Technorati, HotJobs, Epiphany, FedEx, Mayo Clinic, MIT, New Scientist Magazine, and many others. Adding search to your application can be easy. With many reusable examples and good advice on best practices, Lucene in Action shows you how.
What's Inside - How to integrate Lucene into your applications - Ready-to-use framework for rich document handling - Case studies including Nutch, TheServerSide, jGuru, etc. - Lucene ports to Perl, Python, C#/.Net, and C++ - Sorting, filtering, term vectors, multiple, and remote index searching - The new SpanQuery family, extending query parser, hit collecting - Performance testing and tuning - Lucene add-ons (hit highlighting, synonym lookup, and others)
Customer Reviews
Average Rating:
Rating: - Lucene best practices, distilled
Albeit outdated, 'Lucene in Action' remains a must read for anyone who's looking for a fast and scalable solution for a full-text indexing and search solution. Instead of stumbling through the API documentation all by yourself, the authors provide a high-level overview of the core components of the Lucene API, how to extend them, and best practices associated with each component.
For our application, we've adopted Solr, which abstracts the majority of Lucene API's, but the contents ... Read More
Rating: - A must read if you're working with Lucene
This book is a MUST read if you're working with Lucene. While there is information on the internet about Lucene, I don't know how we can have succeeded on a project without this book. The authors know Lucene inside and out and every single page contains information that cannot be skimmed over.
Rating: - Must-read for anyone using Lucene
Lucene In Action is well-written and easy to understand. It provides in-depth coverage in key areas such as indexing (tokenizing) and retrieval. Definitely a must-read for anyone learning or working with Lucene.
Rating: - Not as useful as I hoped
Beware that this book is quite outdated by now: some of the APIs described in the book (e.g. "Field.Keyword") don't work anymore in recent versions of Lucene, lots of new stuff isn't covered, some information (such as that IndexWriter can't delete documents) is no longer true.
It's still useful as a "Getting Started" tutorial, because there's no such thing in Lucene's official documentation, but the price is a bit high for that.
Rating: - Missing the big picture
This is a good technical book, it has plenty of examples and code bits. I was hoping for more examples of implementation - how to apply it to customer systems. It also had little information regarding hardware and how to scale. I think it's a good book, but I might have had other expectations.