I'm Albert a Spanish programmer.
I'm developing a C++ versión of mg, I have been seeing mgpp.
I have a question about the implementation of the operator NEAR.
The Greenstone implementation don't store word + document number + offset in
the collections file,
only store Word and Offset. This implementation is good for
NEAR but is bad for exact queries, are very slow.
Why don't use the implementation explained in Managing
Word <freq,number Document1 [offset1,offset2,....],number Document2
[offset1,offset2,....] , ... n >
Thanks for all.