/
Pattern Matching

Pattern Matching

The pattern matching functionality allows users to identify particular pieces of information in a document. A pattern is considered as a RegEx that will be matched and the rules that may or may not apply once detected.

Pattern Matching page

Select Administration and then Pattern Matching:

 

image-20240628-154811.png

 

A selection of pre-configured patterns available:

image-20240628-154846.png

If these patterns are detected during a scan they will be presented using the “Keyword Hits” to the user. On the Pattern Matching page, users have access to various Filters and Options:

 

  • Filters and options explained:

    • Search: Enter text here to filter patterns based in name

    • Classification: Filter by classification tags associated with patterns

    • Compliance: Filter by compliance tags associated with patterns

    • Distribution: Filter by distribution tags associated with patterns

    • Categories: Filter by file categories associated with patterns

    • Subcategories: Filter by file subcategories associated with patterns

    • Enabled: Filter by patterns that have been enabled or disabled.

    • Published: Filter by patterns that have been published or unpublished

    • Add New Pattern: Create a custom pattern

    • Publish: Push changes to the pattern matching system for start using

    • Clear filters: Remove all previously selected filters

 

Create a New Pattern

 

 

  • Options to create pattern matching explained:

    • Pattern Name: identifies the RegEx when it is found by the software

    • Regular Expression: the sequence to be matched

    • Enabled: whether the pattern will be searched for by the software

    • Hide RegEx in UI: obfuscates the regular expression

    • Tag Overrides: when the RegEx is found these tags will be written to the file

    • Classifications: security levels

    • Compliance: regulations that apply to data

    • Distribution: policies on how data should distributed

    • Category: data grouping

    • Subcategory: data subgrouping

    • Cancel: exit without saving

    • Create: save pattern information and exit

Glossary of Pattern Matching terms

RegEx: Regular Expression, a sequence or pattern that is searched for in text. Ex-ID uses Java RegEx notation.

Rules: Instructions for Ex-ID about what to do when a RegEx is detected in a file.

Pattern: The RegEx and rules associated with its detection.

Pattern Name: Used to identify the pattern when it is detected.

Classification: Tags that help secure documents and other files. e.g. Public, Internal, and Confidential.

Compliance: Tags that help organisations conform to certain regulatory regimes. By applying compliance tags such as GDPR/PII to RegEx such as Social Security number, organisations can identify all related documents.

Distribution: Tags that specify how a files should be moved either within or outside an organisation.

Category: From Getvisibility’s ML model. These are groupings of information based on their use. e.g. Finance, HR, or Technical Documents.

Subcategory: From Getvisibility’s ML model. These are sub-groupings of information based on their particular use. e.g. CV (resume), Code, or Sales Agreement.

Publish: The action of pushing the enabled patterns to be used. As some parts of the system need to be restarted in order to take on a new pattern matching configuration, we allow users to chose when to enact the configuration so as not to impact the workflow of others.

Unpublished: A pattern that has been created, changed, or edited but has not been pushed to the pattern matching system.

Published: A pattern that is currently part of the pattern matching configuration.

Disabled: A pattern that is currently part of the pattern matching configuration but is not to be detected.

Enabled: An active pattern. One that is part of the configuration and will be used by the pattern matching system.

Hide RegEx: Ex-ID allows for RegEx notations to be obfuscated for security and intellectual property reasons.

Related content

Ex-ID 1.0 (Pattern Matching Software from Getvisibility)
Ex-ID 1.0 (Pattern Matching Software from Getvisibility)
More like this
Agent not receiving ML based suggestions
Agent not receiving ML based suggestions
Read with this
Agent 4.5.0 - Windows and Mac/O365
Agent 4.5.0 - Windows and Mac/O365
Read with this
[22.0.4+] Keycloak User Federation Configuration (LDAP/AD)
[22.0.4+] Keycloak User Federation Configuration (LDAP/AD)
Read with this

Classified as Getvisibility - Partner/Customer Confidential