Corpora: Using a relational database to store conc pointers

Chris Brew cbrew at ling.ohio-state.edu
Fri Mar 31 18:09:57 UTC 2000


There's a discussion of compressing inverted indices
in the excellent "Modern Information Retrieval" by Baeza-Yates and
Ribeiro-Neto, pp 184 ff.


>
> Hi Mickel,
>
> can you post the reference to that Moffat article? Sounds interesting.
>
> Thanks & regards,
> Jochen
>
> > It is also possible to sensibly reduce the above mentioned array
> > by compressing ordered lists of occurrence positions. I found a
> > paper by Alistair Moffat at the Dept. of Computer Science of Univ.
> > of Melbourne describing a method for compressing ordered list of
> > numbers.
> [...]
> > 18 bits on average, which is almost half of the 32 bits you would need
> > when storing such a list of numbers in the obvious way.
> > If you want, I can send you the program to have a look at it.
>
>
> --



More information about the Corpora mailing list