java-user@lucene.apache.org
[Top] [All Lists]

Re: How to search in metadata? (filename, document title, cocument creat

Subject: Re: How to search in metadata? filename, document title, cocument creator, ...
From: "Erick Erickson"
Date: Mon, 20 Oct 2008 11:32:18 -0400
If Grant's suggestions don't help you, some examples of your search
code would be helpful to further pinpoint things...

Best
Erick


On Mon, Oct 20, 2008 at 11:29 AM, Grant Ingersoll <gsingers@xxxxxxxxxx>wrote:

>
> On Oct 20, 2008, at 10:32 AM, mil84 wrote:
>
>
>> I've a problem witch searching. I need to search not only in file
>> contents,
>> but also in metadata. But I don't know how to do it. My code:
>>
>> Document doc = new Document();
>> doc.add(new Field("contents", new FileReader(f)));
>> writer.addDocument(doc);
>> ...
>> QueryParser parser = new QueryParser("contents", new StandardAnalyzer());
>> Query q = parser.parse(list.get(i));
>> Hits hits = is.search(createQuery(searchText));
>>
>> Everything is ok, it searches document's contents. But how to search for
>> example in filenames? I I add this:
>>
>> doc.add(new Field("filename", f.getCanonicalPath(), Field.Store.YES,
>> Field.Index.UN_TOKENIZED));
>>
>
>
> You'll have to tokenize the filename in order for it to match anything
> other than an exact query.  I believe the Lucene demo shows adding other
> fields.  Lucene doesn't have a concept of "metadata".  It only has a concept
> of Document and Fields.  It's up to you to put in your metadata, just as you
> have suggested here.  Try just making that filename Field a tokenized field
> and see what happens.
>
> If you're looking for tools to get metadata from files, then have a look at
> Tika or Aperture, which can easily be paired with Lucene.  Aperture, I
> believe, even comes with a Lucene example.
>
>
>>
>> it doesn't work. Same problem is searching in metadata...Anyone knows how
>> to
>> do it? I browsed whole google 3 hours, but found nothing, not even just
>> one
>> working example, simple nothing....Thanx a lot.
>> --
>> View this message in context:
>> http://www.nabble.com/How-to-search-in-metadata--%28filename%2C-document-title%2C-cocument-creator%2C-...%29-tp20069124p20069124.html
>> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@xxxxxxxxxxxxxxxxx
>> For additional commands, e-mail: java-user-help@xxxxxxxxxxxxxxxxx
>>
>>
> --------------------------
> Grant Ingersoll
> Lucene Boot Camp Training Nov. 3-4, 2008, ApacheCon US New Orleans.
> http://www.lucenebootcamp.com
>
>
> Lucene Helpful Hints:
> http://wiki.apache.org/lucene-java/BasicsOfPerformance
> http://wiki.apache.org/lucene-java/LuceneFAQ
>
>
>
>
>
>
>
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@xxxxxxxxxxxxxxxxx
> For additional commands, e-mail: java-user-help@xxxxxxxxxxxxxxxxx
>
>
<Prev in Thread] Current Thread [Next in Thread>