Skip to content

Crawling Overview

Crawling is the process that discovers the business resources (BRs) of a specific application type. It is the first task involving an application, since BRs are required for many other activities involving applications, such as Permissions Collection and Access Certification.

For example, a crawler may discover folders (BR) on a file server (an application type), or mailboxes and folders (BRs) on Exchange (an application type).

Before beginning the crawling process, you must install and run the permissions collection service for each application.

The crawling process involves the following:

  • Discovery of business resources and the population of a BR tree
  • Business resource size calculation
File Name File Type Size
Finance Balance Sheet.xls Excel (*.xls) 2 M
Finance Salaries.docx Word (*.docx) 1 M
Finance Departments.txt Text (*.txt) 3 M
Finance Organization.ppt PowerPoint (*.ppt) 5 M
Finance Other Files (An uncommon file type) 4 M
  • Summary of business resource size by file type
Category Name Size
Office Files (2M + 1M + 5M) = 8M
Text Files 3M
Finance Other Files 4M

The Business Resource Trees display the results of crawling in various locations in File Access Manager.

Interaction of Crawling with Permissions Analysis

The permissions analysis process, in brief:

  • The crawling process collects applications BRs.

  • In parallel, the Identities Collector collects users and groups (which may occur before the crawler collects the BRs, since these collections are unrelated).

  • The Permissions Collector collects the BRs, users, and groups, and associates them with permission types to create permissions.