Lexical Expressions

You can configure a content rule to look for particular words, phrases or character patterns sent to or from your organization. These phrases or Lexical Expressions, could include confidential information, sensitive terminology, profanities or user-defined expressions.

Lexical expressions provide a powerful way of preventing specific content leaving or arriving at your Gateway. They are grouped and managed in Lexical Expression Lists which are then used by content rules.

You can also configure specific lexical expressions called Text Entities. Text entities are lexical expressions which detect character patterns such as credit card numbers, identity numbers, or user-defined regular expressions.

Tell me about...

Lexical Expressions and Text Entities
You can create your own lexical expressions directly, or you can use text entities to build them. A text entity is a predefined or user-defined component or building block that you can include in a lexical expression.

There are three types of text entity:
- Predefined Entities
  
  Predefined Entities are pre-configured, standard lexical patterns which are frequently used. For example predefined entities can match against Credit Card Numbers or identification numbers from different regions (identity card, driving license, passport and Japanese My Number). Predefined entities are fixed patterns and cannot be edited.
- User Defined Entities
  
  You can configure your own reusable text entities. These user-defined entities are displayed in a list and are available for use in lexical expressions. See User Defined Entities for more information.
- Lexical Expression Qualifiers
  
  Lexical expression qualifiers are specific values you might want to detect, rather than general lexical patterns. For example, you might have a particular list of identification numbers you want to redact or block. You can import the list as a set of qualifiers and then use them in a lexical expression. See Lexical Expression Qualifiers for more information.

Thresholds, Weighting and If matched

You can assign lexical expressions a weighting score between +1 and +10 using the If matched option. The weight of an expression determines its impact on a content rule, when detected. Expressions with a larger weight are more likely to violate the content security policy and trigger the What To Do? actions.

Expression Lists are configured with a Threshold, which corresponds to the minimum total weighting score required to trigger the policy to which it has been added.

PERL/POSIX Regular Expressions

You can use PERL/POSIX regular expressions to create more flexible and powerful user-defined lexical expressions. For example, you might want to create user defined entities which detect telephone numbers, identity numbers beginning with a fixed character, or repeated words or phrases.

See Regular Expressions for more information.
Redaction

Adaptive Redaction enables you to hide sensitive information by finding and obscuring lexical expressions. Rather than blocking or stopping the communication, redaction ensures the message is delivered, or content transferred, with the offending expressions hidden by * characters.

You can enable redaction of individual expressions within a list. You can also enable redaction for an entire list.

For more information, see About Adaptive Redaction.

How do I...

Create a Lexical Expression List?

Navigate to Policy > Policy References > Lexical Expressions. The Lexical Expressions page is displayed.
Select the Lexical Expression Lists tab. All the existing lexical expression lists are displayed.
Click New. An editing page for the new lexical expression list is displayed.
In the Overview panel, click Click here to change these settings. Edit the Name and Notes of the content rule as required, and click Save.

Use the Lexical Expression panel, click Click here to change these settings. Configure a Threshold for your lexical expression list. This indicates the minimum total weight required to trigger any content rule to which this list is added.

Click Save.

Each Expression may trigger only once for each part of the message

This option ensures that each expression only scores once per message part, when there is more than one occurrence of some text matching the expression in a message part (subject, body, attachment).

Even if different portions of text match a particular expression, for example two different account numbers match an account number pattern, this expression still scores only once with this option enabled. If you would prefer to score each unique occurrence, consider disabling this option and enabling the Ignore duplicate occurrences option on the expression instead.

Click New to add expressions, if required. Set the scoring weight using the If matched drop-down menu. Click Add to add the expression to the list.

User-defined expressions can be configured as Case sensitive by selecting the check box. Case sensitive expressions

are indicated in the Lexical Expression List.

For more information on how to create and Lexical Expressions, see Create a lexical expression.

Apply the configuration.

Import expressions into a Lexical Expression List?

You can import expressions into a Lexical Expressions List using a Unicode .txt file.

Each Expression must be listed on a separate line in the .txt file. Blank lines or lines beginning with # will be ignored.

Use an expression to detect specific values?

Secure Email Gateway uses pattern matching technology to detect character patterns. You can also qualify a regular expression or predefined text entity to look for specific data. For example, you might want to detect a unique set of account numbers, names from an address list or credit card numbers which are stored in an external data source.

See Lexical Expression Qualifiers for more information.

Apply an expression list to a content rule?
Content rules use policy references (such as lexical expression lists) to look for content which violates your security policy. When you have configured an expression list, you can configure a content rule to detect the expressions it contains.
Example: English swear words
I want to create a content rule which detects and blocks any communication containing English swear words.

Swear words and profanities are defined by managed lists. Add the Swear Words: English managed list to a content rule.

Click Policy > Content Rules > New to create a content rule.

Select Detect Lexical Expression from the list of templates.

Configure the What To Look For? actions Lexical Expression section.

Select Swear Words: English from the Expression list drop-down menu.

Configure your What To Do? actions to block or hold the communication.

Apply the configuration.
For more information on configuring a content rule, see Content rules.

Lexical Expressions

Tell me about...

How do I...

See also...