Optical Character Recognition (OCR)
File Access Manager can identify text from within image files either directly or embedded in other files – such as a scanned documents or a collection of scans stored in a zip file. Files less than 1000 pixels across will not be scanned to avoid less reliable results from low resolution images.
The data privacy engine can analyze files containing sensitive data in image form.
Note: The optical character recognition process is resource intensive and should be configured carefully taking the run-time into consideration. It is disabled by default.
OCR Capability can be added to the scope selected in the DSAR Scope screen.
Enabling Optical Character Recognition
By default, optical character recognition is disabled on the entire scope of the DSAR. To enable optical character recognition on a resource, edit the application scope line.
-
Find the desired application from the DSAR Scope screen.
-
Click Edit.
-
Click Optical Character Recognition (OCR) to enable OCR analysis for this application.