Privacy PII Detection Architecture and Flow

The File Access Management Data Privacy feature is powered by SpaCy (an open-source library for advanced natural language processing textual analysis). Using the SpaCy AI model, File Access Management can detect names and addresses using contextual analysis of the scanned documents' contents.

Text PII Detection

File Access Management uses various tools to detect PII:

  • Name and Address – using the SpaCy NER module (named entity recognition) with SailPoint custom trained model allows the detection of both name and address

  • Identifications (Social Security Numbers, employee ID number, etc.) – File Access Management will use regular expressions to identify IDs

  • Emails – File Access Management uses the SpaCy built-in email regular expression pattern

  • Phone Numbers – File Access Management uses the SpaCy pattern matching feature to detect phone numbers based on predefined formats

    File Access Manager's Privacy PII detection engine will attempt to match both local and international phone number patterns if they comply with the standard format of the relevant country.

     

    Supported Formats

    Unsupported Formats

    +1.253.215.8782

    1300030886

    (212) 465-6471

    22.4389483

    (212) 465-6471