Thanks, I will look into this.
>I don't know the internals of highlighting („explanation“) in lucene.
>But I know that XTF (
>) can handle very large documents (above 100 Mbyte) with highlighting very
>fast. The difference to your approach is, that xtf devide the document in
>small (overlapping) chunks and store the original text as xml separately
>with connection to lucene indexed fields via numbered xml-nodes.
>For large texts (above 200 KByte), it is the best tool I know.
Store, manage and share up to 5GB with Windows Live SkyDrive.