
130 Chapter 9: Indexing Collections with Verity Spider
See also -regexp.
-indinclude
Syntax:
-indinclude exp_1 [exp_n] ...
Specifies that only those files and paths in URLs that match the expressions be followed and
indexed. If you use backslashes, you must double them so that they are properly escaped; for
example:
C:\\test\\docs\\path
You can use wildcard expressions, where the asterisk (*) is for text strings and the question mark
(?) is for single characters; for example:
'/my_doc*/year199?'
In Windows, include double-quotation marks around the argument to protect the special
characters, such as the asterisk (*). On UNIX, use single-quotation marks. This is only required
when you run the indexing job from a command line. Quotation marks are not necessary within
a command file (the
-cmdfile option).
To use regular expressions, also specify the
-regexp option.
Where the
-include option prevents Verity Spider from even following anything that does not
match the specified expressions, the
-indinclude option allows Verity Spider to follow anything
while only indexing that which matches the specified expressions.
Example
If you want to index all documents that include "search" in the URL at http://web.verity.com,
you cannot use the following:
vspider -collection collname -start http://web.verity.com
-include '*search*'
This is because the starting point does not match the -include option criteria. Instead, use the
-indinclude option to follow all documents (unless you have specified any of the exclude
options) and index only those documents that match your criteria. Replace the
-include option
with the
-indinclude option in the preceding example.
Note: When specifying a URL, you must use full, absolute paths using the same format that appears
in the HTML hyperlink. If the link is relative, you must change it to absolute to use it with the
-indinclude option.
See also -regexp.
-indmimeexclude
Syntax:
-indmimeexclude mime_1 [mime_n] ...
Specifies that only those MIME types that match the expressions be followed but not indexed.
Commentaires sur ces manuels