Table of Contents NAME mg_invf_rebuild - Rebuild an mg inverted file with or without skips....
Table of Contents
mg_invf_rebuild - Rebuild an mg inverted file with or without skips.
[ -0 | -1 [ -k num ] | -2 [ -s num ] [ -m num ] ] [ -d directory ] -f name
mg_invf_rebuild builds a new inverted file, with or without skipping, from an old inverted file. This file uses *.invf.ORG and *.invf.idx.ORG as the source from which it builds *.invf and *.invf.idx. If *.invf.ORG or *.invf.idx.ORG do not exist, the program renames *.invf or *.invf.idx to *.invf.ORG or *.invf.idx.ORG as appropriate. The old inverted file may contain skipping. This means that it is possible to delete the *.ORG files after the new inverted file is built.
Options may appear in any order.
- This displays a usage line on stderr.
- This generates a non-skipped inverted file. This option is normally only needed if the *.ORG files have been deleted.
- This generates a skipped inverted file. The -k num argument specifies the number of pointers hopped over with each skip.
- This option generates a skipped inverted file. The skipped inverted file is built so that it is `optimal' for ranking using a specific number of accumulators. Each term in the inverted file has a different skip length. The arguments -s and -m control the sizes of the skips.
- -k num
- This specifies the number of pointers that should be hopped over with each skip. This option is only valid if -1 is specified.
- -m num
- This specifies the intended number of accumulators that will be used when ranking queries are done on the collection.
- -s num
- This specifies the minimum size for skips. If the calculation of the optimal skip size results in a number smaller than num, the skip size is set to num.
MGDATA If this environment variable exists, then its value is used as the default directory where the mg(1)
collection files are. If this variable does not exist, then the directory "." is used by default. The command line option -d directory
overrides the directory in MGDATA.
- Inverted file.
- Original inverted file.
- The index into the inverted file.
- The original index into the inverted file.
- Compressed stemmed dictionary.
Table of Contents