Privacy PII Detection Architecture and Flow
The File Access Management Data Privacy feature is powered by SpaCy (an open-source library for advanced natural language processing textual analysis). Using the SpaCy AI model, File Access Management can detect names and addresses using contextual analysis of the scanned documents' contents.
Text PII Detection
File Access Management uses various tools to detect PII:
-
Name and Address – using the SpaCy NER module (named entity recognition) with SailPoint custom trained model allows the detection of both name and address
-
Identifications (Social Security Numbers, employee ID number, etc.) – File Access Management will use regular expressions to identify IDs
-
Emails – File Access Management uses the SpaCy built-in email regular expression pattern
-
Phone Numbers – File Access Management uses the SpaCy pattern matching feature to detect phone numbers based on predefined formats
File Access Manager's Privacy PII detection engine will attempt to match both local and international phone number patterns if they comply with the standard format of the relevant country.
Supported Formats
Unsupported Formats
+1.253.215.8782
1300030886
(212) 465-6471
22.4389483
(212) 465-6471