|Subject:||Re: [jira] Commented: (LUCENE-2392) Enable flexible scoring|
|Date:||Mon, 12 Apr 2010 04:31:18 -0400|
I disagree. I think what Mike has defined here is way beyond a baby-step: its all the stats needed to support modern IR models in Lucene: BM25, additional vector space algorithms, divergence from randomness, and language modelling.
I think the ability to calculate your own random statistics and shove them into the index (this would be messy like how to get access to the aggregates you need anyway) is something different entirely, best left to research systems.
You can't even do that with Terrier now.
On Mon, Apr 12, 2010 at 3:35 AM, Shai Erera (JIRA) <jira@xxxxxxxxxx> wrote:
|<Prev in Thread]||Current Thread||[Next in Thread>|
|Previous by Date:||[jira] Commented: (LUCENE-2373) Change StandardTermsDictWriter to work with streaming and append-only filesystems, Shai Erera (JIRA)|
|Next by Date:||Re: [jira] Commented: (LUCENE-2392) Enable flexible scoring, Shai Erera|
|Previous by Thread:||[jira] Commented: (LUCENE-2392) Enable flexible scoring, Shai Erera (JIRA)|
|Next by Thread:||Re: [jira] Commented: (LUCENE-2392) Enable flexible scoring, Shai Erera|
|Indexes:||[Date] [Thread] [Top] [All Lists]|