Go to the table of contents Go to the previous page Go to the next page View or print as PDF
Selecting the data to fingerprint
Administrator Help | Forcepoint DLP | Version 8.7.x
Fingerprinting is a powerful means of data monitoring and protection, but the processing can be time-consuming. For this reason, carefully consider what information to fingerprint.
When selecting the data to fingerprint, follow the rules below to achieve the right balance between optimal performance and accurate detection of your sensitive data.
1. Avoid fingerprinting short values
Fingerprinting columns with short field values can lead to multiple false-positive incidents.
For numeric fields, we recommend that you fingerprint values with 5 digits and higher (>=10000) because:
*
*
*
The validation script template is a script that removes numbers with values less than the configured minimum (see Patterns & Phrases for more details).
 
Note 
For non-numeric fields, we recommend that you fingerprint values with 4 or more characters. The reasoning is that:
*
*
*
The validation script template removes non-numeric fields shorter than the configured length in characters.
 
Note 
2. Avoid fingerprinting columns with repetitive values
Columns having repetitive values are quite common in databases. Fingerprinting such columns may cause performance issues both during the fingerprinting stage and real-time analysis. Fingerprinted repetitive fields may lead to large amounts of records matching analyzed transactions, and it will take time for the policy engine to go over the results.
For now, Forcepoint recommends that you avoid fingerprinting columns with repetitive values. Many times, such columns have a very limited range of values, and they actually can be turned into a dictionary and attached to other policy rules in a database policy.
3. Avoid fingerprinting uninteresting/irrelevant values
Some database tables/CSV files may contain values that should be ignored and excluded from fingerprinting. For example, a table may contain a value of 'N/A' instead of valid SSN. Looking through incidents (after the data was fingerprinted), you may locate additional candidates for ignoring.
The validation script template (described under Creating a validation script) allows you to ignore values that are specified in an external "ignored dictionary" file. If preferred, you can write your own scripts that filter any custom type of irrelevant data.

Go to the table of contents Go to the previous page Go to the next page View or print as PDF
Copyright 2020 Forcepoint. All rights reserved.