Mailing List Archive

How to extract highest TF-IDF terms from Lucene index?
Hi,

Assuming that there is a large lucene collection, and I want to extract top
N terms with highest TF/IDF scores from some field.
The collection does not have term vectors stored. Does Lucene have some
utility to do this?

Thanks!
Michael
Re: How to extract highest TF-IDF terms from Lucene index? [ In reply to ]
There is a tool named HighFregTerms, in contrib/misc that does this...

Mike

Sent from my iPad

On May 9, 2012, at 4:18 PM, Michael Berkovsky <michael.berkovsky@gmail.com> wrote:

> Hi,
>
> Assuming that there is a large lucene collection, and I want to extract top
> N terms with highest TF/IDF scores from some field.
> The collection does not have term vectors stored. Does Lucene have some
> utility to do this?
>
> Thanks!
> Michael

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: How to extract highest TF-IDF terms from Lucene index? [ In reply to ]
Thanks!

On Wed, May 9, 2012 at 2:01 PM, Mike McCandless
<lucene@mikemccandless.com>wrote:

> There is a tool named HighFregTerms, in contrib/misc that does this...
>
> Mike
>
> Sent from my iPad
>
> On May 9, 2012, at 4:18 PM, Michael Berkovsky <michael.berkovsky@gmail.com>
> wrote:
>
> > Hi,
> >
> > Assuming that there is a large lucene collection, and I want to extract
> top
> > N terms with highest TF/IDF scores from some field.
> > The collection does not have term vectors stored. Does Lucene have some
> > utility to do this?
> >
> > Thanks!
> > Michael
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>