Previous Topic: Classification in EmailsNext Topic: Parameter 6 Functions


Classification Parameters

The parameters for classifying documents are:

Parameter 1

This defines a list of compulsory words or phrases. All MUST be present for CA DataMinder to confirm a document classification. If validated, this parameter raises the document score by +1. In effect, this parameter specifies a Boolean AND condition.

Parameter 2

This defines a list of required words or phrases, one of which MUST be present for CA DataMinder to confirm a document classification. If validated, this parameter raises the document score by +1. In effect, this parameter specifies a Boolean OR condition.

Parameter 3

This defines a list of preferred words or phrases. These are positive-indicators. If any are present, this increases the probability that the document matches the specified classification. CA DataMinder raises the document score by +1 for each occurrence of a listed word or phrase.

Parameter 4

This defines a list of words or phrases that imply a possible non-match. These are negative-indicators. If any are present, this lowers the probability that the document matches the specified classification. CA DataMinder reduces the document score by -1 for each occurrence of a listed word or phrase.

Parameter 5

This defines a list of words or phrases that indicate a definite non-match. If any are present, the document classification fails. If none are present, CA DataMinder raises the document score by +1. In effect, this parameter specifies a Boolean NOT condition.

Parameter 6

You can include functions to modify the document score that CA DataMinder uses to confirm a document classification. For example: MinScore(n) and MaxScore(n) specify respectively the minimum and maximum scores necessary to confirm document classification.

Parameter 7

This defines a list of key words or phrases that you can use to search for events captured by a Document Classifier trigger. If a word is found in the document when the Document Classifier trigger activates, the word is saved as an attribute of the capture or control event. You can then run searches that focus exclusively on documents containing this word or phrase.

For example, if you define a document classification for sales proposals, you can add a list of products to parameter 7. If CA DataMinder detects one of these product names (say, Product X), in a captured sales proposal, the term 'Product X' is saved as an attribute of this event. This enables you to generate a report focusing on all sales proposals for Product X.

Note: For details about the extensive search text variables available when defining parameter 7, see the Search Text Syntax appendix.

Parameter 8

This defines a list of 'definite match' words or phrases. If any are present, CA DataMinder effectively confirms the document classification.

This parameter is useful if you need to detect specific types of Web site, typically those with limited text content. For example, a Web-based email site may contain very little text except for the term 'webmail'.

Note: In technical terms, this parameter overrides the Boolean conditions of Parameters 1, 2 and 5 and adds 100 onto the document score.

Parameters 9 and 10

These are not currently used in generic classifications.

More information:

Example Document Classification