java-user@lucene.apache.org
[Top] [All Lists]

Re: Mixing Case and Case-Insensitive Searching

Subject: Re: Mixing Case and Case-Insensitive Searching
From: "Erick Erickson"
Date: Tue, 17 Apr 2007 14:59:05 -0400
Yeah, what Hoss said. That's a much more elegant solution
than I suggested. If you use the same filter for indexing and searching,
it'll all "just happen" for you.

Erick

On 4/17/07, Chris Hostetter <hossman_lucene@xxxxxxxxx> wrote:


: I've run into a case where we want to search for the acronym 'LET',
: however this three letter word occurs very frequently in quite a
: number of documents.
:
: What I'm looking to do is a query that's case insensitive _except_ for
: that specific term.

it sounds like you need to create a customized bastard stepshild of
StopFilter and LowercaseFilter ... take in a dictionary of known
capitalized acronimes in the constructor, then for each Token lowercase it
unless:
   - it's in all caps
   - it's in your acronym list.



-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@xxxxxxxxxxxxxxxxx
For additional commands, e-mail: java-user-help@xxxxxxxxxxxxxxxxx


<Prev in Thread] Current Thread [Next in Thread>