java-user@lucene.apache.org
[Top] [All Lists]

Re: highlighter / fragmenter performance for large fields

Subject: Re: highlighter / fragmenter performance for large fields
From: Brian Beard
Date: Mon, 20 Oct 2008 17:42:48 -0400
Karsten,

Thanks, I will look into this.

>Hi Brian,
>
>I don't know the internals of highlighting („explanation“) in lucene.
>But I know that XTF (
>http://xtf.wiki.sourceforge.net/underHood_Documents#tocunderHood_Documents5
>) can handle very large documents (above 100 Mbyte) with highlighting very
>fast. The difference to your approach is, that xtf devide the document in
>small (overlapping) chunks and store the original text as xml separately
>with connection to lucene indexed fields via numbered xml-nodes.
>For large texts (above 200 KByte), it is the best tool I know.
>
>Best regards
>  Karsten


_________________________________________________________________
Store, manage and share up to 5GB with Windows Live SkyDrive.
http://skydrive.live.com/welcome.aspx?provision=1?ocid=TXT_TAGLM_WL_skydrive_102008
<Prev in Thread] Current Thread [Next in Thread>