Features - Data Classification Enabler

Features - Data Classification Enabler

Table of Contents Overview System Requirements - Data Classification Enabler Installation z Install Data Classification - Windows z Install Data Classification - Unix Setting Up Data Classification Enabler DC Client Command Line Tool for Unix Data Classification Console - Windows DC Client Command Line Tool for Unix Rules and Queries Users and User Groups Services Frequently Asked Questions

Page 1 of 65 Features - Data Classification Enabler

Overview - Data Classification Enabler

Topics | Support

Choose from the following topics:

z Introduction z Supported Data Types z Tree Levels in Data Classification z Change Journal z License Requirement

Introduction The Data Classification Enabler is a feature whose purpose is to enhance or "enable" agent capabilities beyond their standard scope. The enhancements that the enabler brings to the agent include:

z Enhanced and improved scan capabilities and speeds z Ability to extend the rules for which data archiving takes place beyond traditional files and folder paths. Supported for the File Archiver for Windows Agent (Local instance) and File Archiver for Unix Agent Data Classification is designed to be robust and fault-tolerant. Some of the ways the enabler accomplishes this include the following:

z It automatically recreates its metadata if the database is deleted or compromised z It automatically re-scans if the initial scan does not complete z It automatically resynchronizes with the system data if the services are interrupted z It automatically detects and scans new volumes as they come online. Data Classification on Windows can attempt to scan all the affected volumes even if the Data Classification scan fails on one volume. Therefore, the fallback scan methods (including Classic File Scan and Change Journal) if available will be used only on those volumes where Data Classification is not accessible. The following components can be used with the Data Classification Enabler on Unix:

z File Archiver for Unix Agent z Unix File System iDataAgents The following components can be used with the Data Classification Enabler on Windows:

z Exchange Mailbox Archiver z File Archiver for Windows Agent (Local File System instance) z Microsoft Exchange 2003/2007 Mailbox iDataAgents z Online Content Indexing for Exchange z Online Content Indexing for Windows File System iDataAgent z Windows File System iDataAgents z SRM Windows File System Agent

Supported Data Types For Windows, NTFS volumes on local disks are supported. For Exchange, all mailbox contents except journal mailbox contents are supported.

Page 2 of 65 Features - Data Classification Enabler

The Data Classification Enabler on Unix supports the following file system types:

Supported File System(s) Platform Associated with the Enabler Enhanced Journal File System (JFS2) AIX Extended 2 File System (ext2) Linux Extended 3 File System (ext3) Linux Global File System 2 (GFS2) Linux Unix File System (UFS) Solaris VERITAS File System (VxFS) Solaris 'X' File System (XFS) Linux Zettabyte File System (ZFS) Solaris

For Unix, you can add or delete the file system types to be monitored.

Tree Levels in Data Classification To use Data Classification with a supported agent, you must install the agent along with the Data Classification software. Once installed, several agents require configuration from the agent tree in the CommCell Console. These agents are discussed in this section.

To use the Data Classification Enabler with an eligible Exchange 2007 agent, both the enabler and the agent must be installed on a proxy.

File Archiver for Windows Once you install the required items for a File Archiver for Windows Agent, your CommCell Browser may be displayed as follows:

cocoa71: Client

File Archiver: Agent

z To administer local Windows data for the File Archiver for Windows Agent, you must create a Local File System Instance type. Once you do this, a DataClassSet (backup set) will be displayed below the instance. You must then create a DataClassSet Subclient. To enable the agent to use Data Classification, configure the agent as discussed in Enable and Configure the Agents: File Archiver for Windows Agent: Local File System Instance.

File Archiver for Unix For the File Archiver for Unix Agent, a DataClassSet (backup set) is displayed once the agent and enabler are installed. To administer data for this agent, you must create a DataClassSet Subclient. To enable the agent to use Data Classification, configure the agent as discussed in Enable and Configure the Agents:

Page 3 of 65 Features - Data Classification Enabler

File Archiver for Unix Agent.

Change Journal ContinuousDataReplicator on Windows, the Data Classification Enabler on Windows, and the Windows File System iDataAgent use Change Journal to track updates made to Windows File Systems. On very large or very busy file systems, it may be necessary to increase the size of the change journal in cases where the agent or enabler is performing full scans too frequently. You can control the amount of volume space that is allocated for Change Journal when it is created by using the dwCJSizeAsPercentOfVolumeSize registry key value.

License Requirement The Data Classification Enabler does not consume a license. Some agents or components that support the enabler require configuration. For more information, see Tree Levels in Data Classification and Enable and Configure the Agents. Back to Top

Page 4 of 65 Features - Data Classification Enabler

System Requirements - Data Classification Enabler

The following requirements are for Data Classification:

Operating System Processor

AIX AIX 5.2 with maintenance level 7 (or higher) and Power PC (Includes IBM System p) runtime library xlC.rte 8.0.0.0 or higher AIX 5.3 with technology level 6 (or higher) and runtime library xlC.rte 8.0.0.0 or higher AIX 6.1

Linux Red Hat Enterprise Linux Red Hat Enterprise Linux AS 4.0 with kernel 2.6.9- Intel Pentium or compatible minimum 34 (Update 3) required Red Hat Enterprise Linux AS 4.0 with kernel 2.6.9- 42 (Update 4) Red Hat Enterprise Linux AS/ES 4.0 (Update 5) Red Hat Enterprise Linux AS 4.0 with kernel 2.6.9- 78 (Update 7) Red Hat Enterprise Linux 5 Advanced Platform with glibc 2.5.x Red Hat Enterprise Linux 5 Advanced Platform with kernel 2.6.18-92 (Update 2) Red Hat Enterprise Linux 5 Advanced Platform with kernel 2.6.18-128 (Update 3)

Red Hat Enterprise Linux AS 4.0 with kernel 2.6.9- x64 34 (Update 3) Red Hat Enterprise Linux AS 4.0 with kernel 2.6.9- 42 (Update 4) Red Hat Enterprise Linux AS/ES 4.0 (Update 5) Red Hat Enterprise Linux AS 4.0 with kernel 2.6.9- 78 (Update 7) Red Hat Enterprise Linux 5 Advanced Platform with glibc 2.5.x Red Hat Enterprise Linux 5 Advanced Platform with kernel 2.6.18-92 (Update 2) Red Hat Enterprise Linux 5 Advanced Platform with kernel 2.6.18-128 (Update 3)

For Red Hat Linux, GFS is supported on single nodes. Clustered file systems are not supported.

SuSE Linux Intel Pentium or compatible minimum required SuSE Linux 9.0 Enterprise Server Edition with glibc

Page 5 of 65 Features - Data Classification Enabler

2.3.x (Patch Level 3) SuSE Linux 9.0 Enterprise Server with kernel 2.6.16.46-0.12 (Patch Level 3) SuSE Linux 10 Enterprise Server with kernel 2.6.16.60-0.21 (Patch Level 1) SuSE Linux 10 Enterprise Server with kernel 2.6.16.60-0.21 (Patch Level 2) SuSE Linux 10 Enterprise Server with kernel 2.6.16.60-0.54.5 (Patch Level 3)

SuSE Linux 9.0 Enterprise Server with kernel x64 2.6.16.46-0.12 (Patch Level 3) SuSE Linux 10 Enterprise Server with kernel 2.6.16.60-0.21 (Patch Level 1) SuSE Linux 10 Enterprise Server with kernel 2.6.16.60-0.21 (Patch Level 2)

Solaris Solaris 8 with a minimum of Service Packs 108528- Sun Sparc5 or higher recommended 13 Solaris 9 with a minimum of Service Packs 111711- 02 Solaris 10.x

Solaris 10.x x64

For Solaris, the Zettabyte File System (ZFS) is required to ensure that the FSF Driver install will work.

Windows Windows XP All Windows-compatible processors supported XP Professional 32-bit and x64 Editions

Windows 2003 Microsoft Windows Server 2003 32-bit, 64-bit and x64 Editions with a minimum of Service Pack 1

Windows Vista Microsoft 32-bit and x64 Editions

Windows 2008 Microsoft Windows Server 2008 32-bit and x64 Editions* Microsoft Windows Server 2008 R2 Editions*

*Core Editions not supported Windows 7 Microsoft Windows 7 32-bit and x64 Editions

Cluster Support See Clustering - Support

Page 6 of 65 Features - Data Classification Enabler

Supported Applications The Data Classification Enabler for Exchange is supported for the following applications:

z Microsoft Exchange 2003 Server z Microsoft Exchange 2007 64-bit Server

Data Classification for Exchange is not supported only if Exchange 2007 Cluster Continuous Replication (CCR) Configuration is installed on Windows 2003.

SRM Support See System Requirements - SRM Windows File System Agent

Processor All Windows-compatible processors supported

Memory 8 MB plus an additional 8-13 MB for each NTFS volume minimum required

Hard Disk For the software, a file stored at the root of each NTFS volume of approximately 40 MB for every 100,000 files/folders on that volume with the default Indexes. The DC_CREATE_INDEX registry key allows you to administer indices for Data Classification databases. The metafiles (databases) created by Data Classification usually consume about 5% of the total space on the hard disk. Depending on the type of data and folder layout, the metafiles may consume additional space.

Peripherals DVD-ROM drive Network Interface Card

Miscellaneous TCP/IP Services configured on the computer. Task Scheduler services configured on the computer The Data Classification Enabler on Unix supports the following file system types:

Supported File System(s) Platform Associated with the Enabler Enhanced Journal File System (JFS2) AIX Extended 2 File System (ext2) Linux Extended 3 File System (ext3) Linux Global File System 2 (GFS2) Linux Unix File System (UFS) Solaris VERITAS File System (VxFS) Solaris 'X' File System (XFS) Linux Zettabyte File System (ZFS) Solaris

Data Classification can be installed independently of any agent. For System Requirements and install information specific to the File System iDataAgents, refer to:

Page 7 of 65 Features - Data Classification Enabler

z System Requirements - File Archiver for Windows Agent z System Requirements - Windows File System iDataAgent Microsoft Visual C++ 2008 Redistributable Package is automatically installed. Note that Visual C++ 2008 Redistributable Package can co-exist with other versions of this software. .NET Framework 2.0 is automatically installed. Note that .NET Framework 2.0 can co-exist with other versions of this software. If you have SELinux enabled on the client computer, create the SELinux policy module as a root user before performing a backup. The SELinux Development package must be installed on the client. To create SELinux policy module, perform the following steps as user "root": 1. Create the following files in the /usr/share/selinux/devel directory:

File Name Content of the file /.te The content of the file should be as follows:

where: policy_module(,) is /usr/share/selinux/devel ##############################

is the name of the Unix file, created to where: save the policy module statement. It is a good idea to is the name of the policy module. You can use the same name for policy module and the file. give any unique name to the policy module, such For example: When you are creating a policy module as a or application name. for backup_IDA application, you can use the following is the version of the policy module. It file name: backup_IDA.te can be any number, such as 1.0.0.

For Example: While creating a policy module for the backup_IDA application, you can use the following content. policy_module(backup_IDA,1.0.0) /.fc The content of the file should be as follows:

where: Note that the following list of files is not exhaustive. If the process fails to launch, is /usr/share/selinux/devel check /var/log/messages. Also, if required, add is the name of the Unix file, created to it to the following list of files. save the policy module statement. It is a good idea to /opt//Base/libCTreeWrapper.so -- gen_context For example: When you are creating a policy module (system_u:object_r:texrel_shlib_t,s0) for backup_IDA application, you can use the following file name: backup_IDA.fc /opt//Base/libCVMAGuiImplgso -- gen_context (system_u:object_r:texrel_shlib_t,s0) /opt//Base/libdb2locale.so.1 -- gen_context (system_u:object_r:texrel_shlib_t,s0) /opt//Base/libdb2osse.so.1 -- gen_context (system_u:object_r:texrel_shlib_t,s0) /opt//Base/libDb2Sbt.so --

Page 8 of 65 Features - Data Classification Enabler

gen_context (system_u:object_r:texrel_shlib_t,s0) /opt//Base/libdb2trcapi.so.1 -- gen_context (system_u:object_r:texrel_shlib_t,s0) /opt//Base/libDrDatabase.so -- gen_context (system_u:object_r:texrel_shlib_t,s0) /opt//Base/libIndexing.so -- gen_context (system_u:object_r:texrel_shlib_t,s0) /opt//Base/libSnooper.so -- gen_context (system_u:object_r:texrel_shlib_t,s0)

2. Create the policy file from command line. Use the following command. Ensure that you give the following commands in the /usr/share/selinux/devel directory.

[root]# make backup_IDA.pp Compiling targeted backup_IDA module /usr/bin/checkmodule: loading policy configuration from tmp/backup_IDA.tmp /usr/bin/checkmodule: policy configuration loaded /usr/bin/checkmodule: writing binary representation (version 6) to tmp/backup_IDA.mod Creating targeted backup_IDA.pp policy package rm tmp/backup_IDA.mod tmp/backup_IDA.mod.fc [root]# semodule -i backup_IDA.pp [root]#

3. Execute the policy module. Use the following command:

[root]# restorecon -R /opt/

SELinux is now configured to work with this application.

DISCLAIMER

Minor revisions and/or service packs that are released by application and vendors may, in some cases, affect the working of our software. Although we may list such revisions and/or service packs as “supported” in our System Requirements, changes to the behavior of our software resulting from an application or operating system revision/service pack may be beyond our control. However, we will make every effort to correct such disruption as quickly as possible. When in doubt, please contact your software provider to ensure support for a specific application or operating system.

Additional considerations regarding minimum requirements and End of Life policies from application and operating system vendors are also applicable.

Page 9 of 65 Features - Data Classification Enabler

Install Data Classification - Windows

Click on a link below to go to a specific section of the software installation:

z Install Requirements z Install Checklist z Before You Begin z Install Procedure { Getting Started { Select Components for Installation { Data Classification Destination Folder { Administer Data Classification { Verify Summary of Install Options { Install Remaining Cluster Nodes { Setup Complete z Post-Install Considerations

Install Requirements

As a standalone module, Data Classification can be installed on the computer independently or along with other eligible agents. Data Classification can be installed on a computer that either has or does not have Multi Instancing. If the computer has Multi Instancing, Data Classification can be installed on an instance other than Instance 001, but it cannot be installed on more than one instance. Data Classification does not require a Windows File System iDataAgent. In a clustered environment, Data Classification is installed only on the physical nodes. The Data Classification software itself will run just on the physical nodes and keep the metafile information for each volume on the same volume. By installing the Data Classification Enabler on the physical nodes of the cluster, Data Classification will be automatically enabled on the virtual node(s). The Data Classification – Windows install can be used to enable the Data Classification Enabler for the following agents:

z Exchange Mailbox Archiver z Microsoft Exchange Server 2003/2007 Mailbox iDataAgents z File Archiver for Windows Agent (Local File System instance) z Online Content Indexing for Exchange z Online Content Indexing for Windows File System iDataAgent z SRM Windows File System Agent z Windows File System iDataAgent Verify that the computer in which you wish to install the software satisfies the requirements specified in System Requirements - Data Classification. The following procedure describes the steps involved in installing Data Classification. If you choose to install multiple components simultaneously, refer to the appropriate procedures for installation requirements and steps specific to the component. Note that when you install multiple components, the sequence of the install steps may vary. Review the following Install Requirements before installing the software: General z Agents should only be installed after the CommServe and at least one MediaAgent have already been installed in the CommCell. Also, keep in mind that the CommServe and MediaAgent must be installed

Page 10 of 65 Features - Data Classification Enabler

and running (but not necessarily on the same computer), before you can install the Agent. z This version of the software is intended to be installed in a CommCell where the CommServe and MediaAgent(s) version is 8.0.0. z Close all applications and disable any programs that run automatically, including anti-virus, screen savers and operating system utilities. Some of the programs, including many anti-virus programs, may be running as a service. Stop and disable such services before you begin. You can re-enable them after the installation. z Ensure there is an available license on the CommServe for the Agent. z Verify that the software installation disc is appropriate to the operating system of the computer on which the software is being installed. Make sure that you have the latest software installation disc before you start to install the software. If you are not sure, contact your software provider. Firewall

z If the CommServe® Server, MediaAgent and/or Clients communicate across two-way firewall(s): { Ensure port 8400 is allowed connections through the firewall. { In addition a range of bi-directional ports (consecutive or discrete) must also be allowed connections through the firewall. For information on configuring the range of ports, see Port Requirements for Two-way Firewall.

z If the CommServe Server, MediaAgent and/or Clients communicate across one-way firewall(s): { Identify a range of outbound ports (consecutive or discrete) for use by the software. For information on configuring the range of ports, see Port Requirements for One-way Firewall.

z If the MediaAgent/Client communicates with the CommServe Server across a one-way firewall, you must add the MediaAgent/Client host name (or the IP address) in the CommServe computer before installing the necessary software on the MediaAgent/Client computer. Exchange z For Data Classification for Exchange, install the affected components as follows: { For the supported Exchange 2003 agents, install the Data Classification Enabler on the Exchange Server. For the supported Exchange 2007 agents, install the Data Classification Enabler on an Exchange Proxy. See Installation for more information regarding your agent. { Install the affected Exchange agent on the Exchange Proxy. See Installation for more information regarding your agent. { Ensure that the Exchange database is installed on the Exchange Server. z For Exchange 2007, be sure to use no more than on Exchange Proxy to prevent Data Classification database inconsistencies (since Data Classification logs can be consumed by only one proxy).

Install Checklist

Collect the following information before installing the software. Use the space provided to record the information, and retain this information in your Disaster Recovery binder.

1. Data Classification Destination Folder: ______See Data Classification Destination Folder for more information.

2. Data Classification Service Start Date and Time:______See Administer Data Classification for more information.

Before You Begin

Page 11 of 65 Features - Data Classification Enabler

z Log on to the client as local Administrator or as a member of the Administrators group on that computer.

Install Procedure

Getting Started

1. Place the software installation disc for the Windows platform into the disc drive. After a few seconds, the installation program is launched. If the installation program does not launch automatically:

z Click the Start button on the Windows task bar, and then click Run. z Browse to the installation disc drive, select Setup.exe, click Open, then click OK. NOTES

z If you are installing on a x64 version of Windows 2008 R2, go to the AMD64 folder and run Setup.exe.

2. In this screen, you choose the language you want to use during installation. Click the down arrow, select the desired language from the pull-down list, and click Next to continue.

3. Select the option to install software. NOTES

z This screen will only appear when the bAllow32BitInstallOn64Bit registry key has been created and enabled on this computer.

4. Select the option to install software on this computer. NOTES

z The options that appear on this screen depend on the computer in which the software is being installed.

5. Read the Welcome screen. Click Next to continue, if no other applications are running.

6. Read the virus scanning software warning. Click OK to continue, if virus scanning software is disabled.

7. Read the license agreement, then select I accept the terms in the license agreement. Click Next to continue.

Select Components for Installation

Page 12 of 65 Features - Data Classification Enabler

8. Select the component(s) to install. NOTES

z Your screen may look different from the example shown. z Components that either have already been installed, or which cannot be installed, will be dimmed. Hover over the component for additional details. z The Special Registry Keys In Use field will be enabled when GalaxyInstallerFlags registry keys have been enabled on this computer. Hover over the field to see which keys have been set, as well as their values. For more information, see Registry Keys. Click Next to continue. To install Data Classification, expand the DataArchiver Agents folder and select Data Classification Enabler.

Data Classification Destination Folder

9. Specify the location where you want to install the software. NOTES

z Do not install the software to a mapped network drive. z Do not use the following characters when specifying the destination path: / : * ? " < > | It is recommended that you use alphanumeric characters only. z If you intend to install other components on this computer, the selected installation directory will be automatically used for that software as well. z If a component has already been installed, this screen may not be displayed if the installer can use the same install location as previously used.

z If you intend to use the SnapProtect™ feature for Windows File System iDataAgent, the agent should be installed on a non-system drive and not a filer volume. Click Browse to change directories. Click Next to continue.

Administer Data Classification

10. Specify when you prefer to start Data Classification Service. NOTES

z If this option is not selected, the Data

Page 13 of 65 Features - Data Classification Enabler

Classification service will start scanning the system as soon as the installation is complete. This may cause increased I/O and CPU usage. Also, if there is activity on the system during the initial scan (e.g., keyboard use, mouse use), the initial scan will not run until 30 seconds after such activity stops. If this option is selected, you can schedule the service to start at a later date and time of your convenience and therefore avoid these issues. (Alternatively, you can avoid using the computer for some time depending on the amount of data in your system.) Click Next to continue.

11. Click Yes if you have multiple computers with Data Classification in your domain and you want to control these computers from a single computer. Otherwise, click No.

12. Select Download Pack(s) and Install to download and install the latest service packs and post packs from the software provider. NOTES

z Internet connectivity is required to download updates. z This step is applicable when installing on the first instance. z Updates are downloaded to the following directory: /Base/Temp/DownloadedPacks. They are launched silently and installed automatically for the first instance. Click Next to continue.

Verify Summary of Install Options

13. Verify the summary of selected options. NOTES

z The Summary on your screen should reflect the components you selected for install, and may look different from the example shown. Click Next to continue or Back to change any of the options. The install program now starts copying the software

Page 14 of 65 Features - Data Classification Enabler

to the computer. This step may take several minutes to complete.

14. The System Reboot message may be displayed. If so, select one of the following: z Skip Reboot This option will be displayed if the install program finds any files belonging to other applications, that need to be replaced. As these files are not critical for this installation, you may skip the reboot and continue the installation and reboot the computer at a later time. z Reboot Now If this option is displayed without the Skip Reboot option, the install program has found files required by the software that are in use and need to be replaced. If Reboot Now is displayed without the Skip Reboot option, reboot the computer at this point. The install program will automatically continue after the reboot. z Exit Setup If you want to exit the install program, click Exit Setup.

Install Remaining Cluster Nodes

15. If you are installing/upgrading the software on the physical node in a clustered environment, use this option to install/upgrade the software on the remaining physical nodes of the cluster. z To install/upgrade the software on the remaining nodes of the cluster, click Yes. z To complete the install/upgrade for this node only, click No. See Install/Upgrade Remaining Cluster Nodes for step-by-step instructions.

Setup Complete

16. Setup displays the successfully installed components. NOTES

z The Setup Complete message displayed on

Page 15 of 65 Features - Data Classification Enabler

your screen will reflect the components you installed, and may look different from the example shown. z If you install an Agent with the CommCell Console open, you need to refresh the CommCell Console (F5) to see the new Agents. z If Reboot Now button is displayed make sure to reboot the computer before performing any other operations from the computer. Click Finish to close the install program. The installation is now complete.

Post-Install Considerations

General z Install post-release updates or Service Packs that may have been released after the release of the software. If you are installing a Service Pack, verify and ensure that it is the same version as the one installed in the CommServe Server. Alternatively, you can enable Automatic Updates for quick and easy installation of updates in the CommCell component. z After installing the Agent, you may want to configure the Agent before running a data protection operation. The following list includes some of the most common features that can be configured: { Configure your subclients - see Subclients for more information. { Schedule your data protection operations - see Scheduling for more information. { Configure Alerts - See Alerts and Monitoring for more information. { Schedule Reports - See Reports for more information. The software provides many more features that you will find useful. See the Index for a complete list of supported features. Agent Specific z To ensure that the Data Classification Enabler is installed on both the physical nodes, refer to the following: { Verify the driver file [volume]_db.db (e.g., for C:\, it would be c_db.db) exists on the install root directory. For mount points, ensure that the driver file is located at the root of each mount point. { You can also verify by review of on each physical node. At this registry location, the list of volumes being monitored by Data Classification Enabler will be displayed. z To verify that the Data Classification Enabler has successfully built the database on each physical node, see Database Considerations. z To enable and configure the Data Classification Enabler for Agents, see Enable and Configure the Agent or Components. z The following configuration tasks are required before performing an Archive Operation using the Data Classification Enabler: { Create a storage policy (see Storage Policies for more information). { Verify that the DataArchiver Agent is already installed in order to see the DataClassSet. { Ensure that the DataClassSet is viewable in the CommCell Browser. { Create a subclient. See Subclients: DataClassSet Subclients for more information. Disaster Recovery Considerations

Page 16 of 65 Features - Data Classification Enabler

z Before you use your agent, be sure to review and understand the associated full (or disaster recovery) procedure. The procedure for some agents may require that you plan specific actions or consider certain items before an emergency occurs. See Disaster Recovery for more information regarding your agent.

Page 17 of 65 Features - Data Classification Enabler

Install Data Classification - Unix

Click on a link below to go to a specific section of the software installation:

z Install Requirements z Install Checklist z Before You Begin z Install Procedure { Getting Started { Select Components for Installation { Base Software Installation { Administer Data Classification Directory { Administer Volumes { Administer Volume Filters { Setup Complete z Post-Install Considerations

Install Requirements

As a standalone module, Data Classification can be installed on the computer independently or along with other eligible agents. Data Classification does not require a Unix File System iDataAgent. In a clustered environment, Data Classification is installed on the physical node. The Data Classification – Unix install can be used to enable the Data Classification Enabler for the following agents:

z File Archiver for Unix Agents z Unix File System iDataAgents You can install the software in monitor mode. This means that you can choose to monitor only the Data Classification volumes that you specify. In such a case, any new volumes that are included in the Data Classification database will not be recognized by the system. On the other hand, if you choose to monitor all the affected volumes, all the new volumes will be recognized. See Administer Volumes for the appropriate steps. Verify that the computer in which you wish to install the software satisfies the requirements specified in System Requirements - Data Classification. The following procedure describes the steps involved in installing Data Classification. If you choose to install multiple components simultaneously, refer to the appropriate procedures for installation requirements and steps specific to the component. Note that when you install multiple components, the sequence of the install steps may vary. Review the following Install Requirements before installing the software: General z Verify that the software installation disc is appropriate to the operating system of the computer on which the software is being installed. Make sure that you have the latest software installation disc before you start to install the software. If you are not sure, contact your software provider. Agent Specific z Do not install the software on a destination volume that is being used for replication by ContinuousDataReplicator. z Before you install the software, consider which volumes you want Data Classification to monitor and

Page 18 of 65 Features - Data Classification Enabler

not monitor. In most cases, you will want Data Classification to monitor only selective volumes (monitor mode). You can specify these volumes during the install procedure; alternatively, you can specify these volumes post-install by using the DcClient -start command (see DC Client Command Line Tool for Unix for more information). z It is recommended that you create a volume (e.g., /home/DCTemp) to hold a record of Data Classification changes and also to hold Data Classification databases. This volume should never be monitored by Data Classification. For example, during the install, create /home/DCTemp/cache to hold the record of changes. Post-install, use the DcClient -relocate command to create /home/DCTemp/DcDbs to hold the Data Classification databases (see DC Client Command Line Tool for Unix for more information).

Install Checklist

Collect the following information before installing the software. Use the space provided to record the information, and retain this information in your Disaster Recovery binder.

1. Install directory location:______The default is /opt, but you may designate any location you want.

See Base Software Installation for more information.

2. Log files directory location:______The default is /var/log, but you may designate any location you want.

See Base Software Installation for more information.

3. Data Classification directory location: ______See Administer Data Classification Directory for more information.

4. Data Classification volumes to monitor: ______See Administer Volumes for more information.

5. Data Classification volumes to filter: ______See Administer Volume Filters for more information.

Before You Begin

z Log on to the client as root. z The install package requires super-user permissions to execute.

Install Procedure

Getting Started

1. Place the software installation disc for the Unix platform into the disc drive. You can also install the product using a disc drive mounted on another computer on the network. z On Solaris, double-click the cvpkgpush program

Page 19 of 65 Features - Data Classification Enabler

from the window. z On other Unix platforms, open the Terminal window, navigate to the installation disc and then enter cvpkgpush. NOTES

z If you are installing on Solaris 2.6, an Action Run window is displayed. Click OK to close the window and continue the install process.

2. The product banner and other information is displayed. Press Enter to continue.

3. Read the license agreement. Type y and press Enter to continue.

Select Components for Installation

4. Enter the number corresponding to the CVGxDC Install Calypso on physical module. machine client.company.com NOTES Select the Calypso module that you would like to install z Your screen may look different from the example shown. 1) Media Agent <= CVGxMA z Components that either have already been installed, 2) FileSystem iDataAgent <= or which cannot be installed, will not be shown. CVGxIDA z In addition, the list of modules that appear depends 3) Exit this menu on the specific Unix File System in which the package Module number: [1] is installed. (e.g., CVGxWA will appear only when the installation package is run on a Solaris computer.) Press Enter to continue.

Base Software Installation

5. Press Enter to install the file system driver and the Here is a list and status of the Calypso Base0 module. dependent modules: 1) File System Filter Driver NOT INSTALLED 2) Calypso Base0 Module NOT INSTALLED If there are any modules listed as not installed/upgraded, they will be installed/upgraded first. Press to proceed ...

6. Specify the location where you want to install the Please specify where you want us software. to install Calypso binaries. NOTES It must be a local directory and there should be at least 176MB of z The amount of free space required depends on the free space available. All files components selected for install, and may look will be installed in a "calypso" different from the example shown. subdirectory, so if you enter "/opt", the files will actually Press Enter to accept the default path and continue, or

Page 20 of 65 Features - Data Classification Enabler

Enter a path and then press Enter to continue. be placed into "/opt/calypso". Press Enter again to confirm the path. Installation Directory: [/opt] .. Calypso will be installed in /opt/calypso. Press ENTER to continue ...

7. Specify the location for the log files. Please specify where you want to keep Calypso log files. NOTES It must be a local directory and z All the modules installed on the computer will store there should be at least 100MB of the log files in this directory. free space available. All log z The amount of free space required depends on the files will be created in a components selected for install, and may look "calypso/Log_Files" subdirectory, different from the example shown. so if you enter "/var/log", the logs will actually be placed into Press Enter to accept the default path and continue, or "/var/log/calypso/Log_Files". Enter a path and then press Enter to continue. Log Directory: [/var/log] Press Enter again to confirm the path. .. Calypso log files will be created in /var/log/calypso/Log_Files. Press ENTER to continue ...

Administer Data Classification Directory

8. Specify the location of the Data Classification directory Calypso DC needs a dedicated and then press Enter. directory where it will maintain the cache of file system changes. The amount of free space in that directory should be at least 512MB (you will be able to customize it later.) DC Cache Directory: /dcache

Administer Volumes

9. To monitor only specific volumes from the Data By default DC would monitor all Classification database now, accept yes in response to volumes except system volumes the question, press Enter, and go to the next step. (/, /opt, /usr, /tmp ...) and Otherwise, type no, press Enter, and go to Administer filtered volumes. You can alter Volume Filters. this behavior by choosing to monitor only specific volumes. Do you want to monitor only specific volumes? [yes]

10. To specify the volumes you want to monitor now, type You can give list of volumes to yes and press Enter. Then type the appropriate volume monitor now or you can specify paths at the prompt. Use a space to separate each those volumes later after entry. To specify the volumes after the install, accept install. the no default and press Enter. Do you want to specify the volumes now? [no] DC Volumes to monitor:

11. To delay creating Data Classification databases instead Calypso DC can start creating the of creating the databases immediately, specify the time DBs immediately after install, or

Page 21 of 65 Features - Data Classification Enabler

in minutes after which the databases should be created you can specify a time to delay and then press Enter. Otherwise, just press Enter. the creating. Creating DB delayed in minutes: [0]

Administer Volume Filters

12. To specify the volumes you want to filter out now, type You can filter out certain yes and press Enter. Then type the appropriate volume volumes now or you can add those paths at the prompt. Use a space to separate each filters later after install. entry. To specify the volumes to be filtered out after the Do you want to specify the install, accept the no default and press Enter. filters now?: [no] NOTES DC Volumes to filter out: z System volumes are automatically filtered out. z It is highly recommended that you filter out the volume where the Data Classification cache directory resides to prevent errors and to keep volume monitoring from stopping.

13. To delay creating Data Classification databases instead Calypso DC can start creating the of creating the databases immediately, specify the time DBs immediately after install, or in minutes after which the databases should be created you can specify a time to delay and then press Enter. Otherwise, just press Enter. the creating. Creating DB delayed in minutes: [0]

Setup Complete

14. The install program now starts copying the software to ..... the computer. The progress of the operation is ..... displayed...... Press Enter to continue. Successfully copied xx files ...... Successfully installed .

Press ENTER to continue ...

15. This menu may be displayed only when you are Select the Calypso module that installing on AIX, Linux, or Solaris computers. If this is you would like to install. the last package that you wish to install/upgrade, enter 1) Proxy FileSystem iDataAgent <= the number corresponding to the Return option and CVGxProxyIDA then press Enter to continue. 2) Oracle iDataAgent <= NOTES CVGxOrIDA 3) DB2 iDataAgent <= z Only modules that are not installed/upgraded appear CVGxDB2 in the list. 4) Return z Your screen may appear different from the example Module number: [1] shown. z If you are installing on AIX, FreeBSD, IRIX or Tru64 computers, if this module was the last possible module to install, you are automatically exited from the program. Otherwise, type the number for the

Page 22 of 65 Features - Data Classification Enabler

Exit option and then press Enter. The installation is completed.

16. Enter Yes to download and install the latest service Download and Install Latest packs and post packs from the software provider. Service Pack NOTES If you choose to download the latest service pack from the z Internet connectivity is required to download software provider website now, updates. please make sure you have internet connectivity at this z This step is applicable for multi instancing. time. This process may take some Press Enter to continue. time depending on the internet connectivity. Do you want to download the latest service pack now ? [no] Press to continue ...

17. This prompt is displayed only when you are installing on Certain Calypso packages can be HP-UX, Linux, or Solaris computers. Enter the number associated with a virtual IP, or corresponding to the Exit option and then press Enter in other words, installed on a to continue. "virtual machine" belonging to some cluster. At any given time The installation is now complete. the virtual machine's services and IP address are active on only one of the cluster's servers. The virtual machine can "fail-over" from one server to another, which includes stopping services and deactivating IP address on the first server and activating the IP address/services on the other server. Currently you have Calypso installed on physical node stone.company.com. Now you have a choice of either adding another package to the existing installation or configure Calypso on a virtual machine for use in a cluster. 1) Add another package to stone.company.com 2) Install Calypso on a virtual machine 3) Exit Your choice: [1]

Post-Install Considerations

Agent Specific z The following configuration tasks are required before performing an Archive Operation using the Data Classification Enabler: { Create a storage policy (see Storage Policies for more information). { Verify that the DataArchiver Agent is already installed in order to see the DataClassSet. { Ensure that the DataClassSet is viewable in the CommCell Browser.

Page 23 of 65 Features - Data Classification Enabler

{ Create a subclient. See Subclients: DataClassSet Subclients for more information. z To specify volumes to be monitored, run the DcClient -start command (see DC Client Command Line Tool for Unix for more information). z To create a folder to hold the Data Classification databases, run the DcClient -relocate command (see DC Client Command Line Tool for Unix for more information). Cluster Specific Once you install the software on all the physical computers, do the following to ensure that failovers will work correctly: 1. Create the DcFailOverVolumes file under the /tmp directory.

2. Add a shared volume mount point to the DcFailOverVolumes file.

Page 24 of 65 Features - Data Classification Enabler

Setting Up Data Classification Enabler

Topics | How To | Related Topics

Overview Installing the Data Classification Enabler

z Database Considerations z Space and Performance Considerations z Services Enable and Configure the Agents or Components

z Exchange Agents z File Archiver for Unix Agent z File Archiver for Windows Agent z Online Content Indexing z SRM Windows File System Agent z Unix File System iDataAgents z Windows File System iDataAgents

Overview Setting up the Data Classification Enabler includes the following tasks:

z Installing and enabling Data Classification z Configuring Data Classification for the specific agent or component

Installing the Data Classification Enabler The Data Classification Enabler can be installed on Windows and Unix computers. For more information, see Deployment - Data Classification Enabler. Also, see System Requirements - Data Classification Enabler for the supported operating systems, applications and file servers. Once the Data Classification Enabler is installed and enabled, it performs an initial data collection of all the data and then creates SQL-like (meta) databases. Once this initialization is completed, the supported components can use Data Classification. Database Considerations Exchange To administer Exchange data, the Data Classification Enabler uses a couple of processes: enumeration and sink. The enabler uses enumeration to log in to the Exchange Server, parse each Exchange mailbox on the server, and create a map of the data in the Data Classification database. The enabler uses sink to hook to the Exchange server in order to capture the state changes (events) of the mailbox contents and to record this information in the Data Classification transaction logs. Once these logs reach the specified maximum size, they are consumed in the Data Classification database. As such, the Data Classification database includes a record of the data change events and a corresponding time stamp for each event.

Unix Each meta database contains information about the files in the associated volume. Thereafter, the Data Classification service constantly monitors all the files on these volumes, and it detects new volumes at a prescribed time interval. The service updates the databases and it keeps tracks of the updates (e.g. file

Page 25 of 65 Features - Data Classification Enabler additions, content update to files, etc.) made to each database; in effect, this provides almost a real-time view of the data in the system. By default, the meta database is located at the root of each mount point, and it is named .db.cv (e.g., for /home, it would be /home/.DATACLASS_1/.db.cv). Journals from the FSF driver is used to keep track of the updates to each meta database.

Windows Each meta database contains information about the files in the associated volume. Thereafter, the Data Classification service constantly monitors all the files on these volumes, and it detects new volumes at a prescribed time interval. The service updates the databases and it keeps tracks of the updates (e.g. file additions, content update to files, etc.) made to each database; in effect, this provides almost a real-time view of the data in the system. By default, the meta database is located at the root of each NTFS volume or mount point, and it is named [volume]_db.db (e.g., for C:\, it would be c_db.db). For mount points, the database name is [mountpoint]_db.db (e.g., for a mount point C:\mountpoint, the file is mountpoint_db.db, and it resides in the C:\mountpoint directory). Change Journal is used to keep track of the updates to each meta database.

Data Classification works with NTFS volumes but not with FAT volumes. New volumes that are added to your system are automatically recognized.

Space and Performance Considerations The meta databases created by Data Classification usually consume about 5% of the total space on the hard disk. Depending on the type of data and folder layout, the metafiles may consume additional space. For Data Classification on Unix, each Data Classification update record consumes about 256 bytes (this assumes an average short name length of 16 bytes and an average full path length of 256 bytes). For Data Classification on Windows, you can administer the size of the Data Classification databases by using the DC_CREATE_INDEX registry key. This key also allows you to administer other items associated with Data Classification, such as the time required for database initialization as well as backup and archiving speed for some agents. Services Data Classification services can be started or stopped using the . See Services for an overview. Data Classification services on Windows can also be started or stopped using the Data Classification Administration Utility or the Data Classification Console for Windows.

Enable and Configure the Agents or Components Depending on the agent or component, you can configure the Data Classification Enabler to do the following:

z Maximize the scan speed for data, resulting in faster backup, search, and SRM data collection operations (Windows File System, Unix File System, SRM Windows File System, Online Content Indexing for Windows File System) z Select backup, archive and Online Content Indexing data faster (Exchange Server 2003/2007 Mailbox, Exchange Mailbox Archiver, Online Content Indexing for Exchange) z Define archive rules based on file attributes and not just on volumes and basic attributes, such as size and modified times (File Archiver for Windows (Local File System Instance), File Archiver for Unix) z Select users and user groups to administer data to be archived (File Archiver for Windows (Local File System instance)) Exchange Agents These agents can use the Data Classification Enabler to log and use events generated by the Exchange

Page 26 of 65 Features - Data Classification Enabler server in order to select the eligible data for data protection operations or Online Content Indexing. The events keep track of what data has been added to, removed from, and changed in the Exchange Server. Tracking data using events is especially useful when preparing to run incremental backups or Online Content Indexing. Use the Data Classification Administration Utility or the Data Classification Console for Windows to administer event logging for Exchange mailbox data and to populate the Data Classification database. For more information, see Advanced Plugins. For the step-by-step procedures, see Administer Exchange Properties for Data Classification and Start Data Classification for Exchange to Populate the Data Classification Database with Exchange Metadata.

To use the Data Classification Enabler with an eligible Exchange 2007 agent, both the enabler and the agent must be installed on a proxy. To administer Exchange 2007 mailbox data using the Data Classification Enabler, you must configure a proxy computer for the enabler by populating the affected fields discussed in Administer Exchange Properties for Data Classification. Also, you must click the name of the proxy from the Exchange Proxy list on the Client Computer Properties (Advanced) tab. To use the Data Classification Enabler on a 2008 CCR cluster, you must configure a shared folder that will essentially serve as the third node in the cluster. To enable creation and use of this shared folder, ensure that your system is running the Distributed File System (DFS) and that it includes a DFS replicator. Be sure to add the shared folder to the replication group for all the nodes in the cluster. This setup will enable the data on all the nodes to be replicated. As such, in case of a failover, all the nodes will have the data.

Selected data can be backed up or migrated, as appropriate, by the supported agents. Before you perform either data management operation, be sure to select the Use Data Classification option from the Backup Set/Archive Set Properties dialog box in the CommCell Console. For a step-by-step procedure, see Use Classic File Scan or Data Classification.

If you are using Data Classification on an Exchange database that has been restored from a protected copy, be sure to stop the Data Classification services after the restore and then restart the services. This will repopulate the Data Classification database correctly.

File Archiver for Unix Agent

For this agent, there is no fallback scan method if the Data Classification Enabler is not available.

This agent can use the Data Classification Enabler to define archiving rules based on file attributes and not just on volumes and basic attributes, such as size and modified times. For example, you can use Data Classification to define the agent's subclient content to contain all files starting with 'A', all files modified after a specific date, etc. You can make the associated queries for these and more complex definitions by issuing SQL database-like commands from the CommCell Console against the metadata databases. For an overview, see Rules and Queries. File Archiver for Windows Agent Local File System Instance

For this agent, there is no fallback scan method if the Data Classification Enabler is not available.

This agent can use the Data Classification Enabler to define archiving rules based on file attributes and not just on volumes and basic attributes, such as size and modified times. For example, you can use Data Classification to define the agent's subclient content to contain all files starting with 'A', all files modified

Page 27 of 65 Features - Data Classification Enabler after a specific date, etc. You can make the associated queries for these and more complex definitions by issuing SQL database-like commands from the CommCell Console against the metadata databases. For an overview, see Rules and Queries. To use this capability, you must first configure a Local File System Instance. This agent can use the Data Classification Enabler to support domain users and user groups. You can authenticate against the domain the users whose files you want to archive. For more information, see Users and User Groups. Using Data Classification for this purpose is especially useful when you are archiving data for user groups across multiple volumes. Data Classification can archive data for users in these groups using rules that you define without the need for your specifying the exact paths to find this data. Online Content Indexing Online Content Indexing can content-index various data that are scanned or selected by the Data Classification Enabler. SRM Windows File System Agent This agent can use the Data Classification Enabler to scan file system data before data collection jobs. Such scans help expedite Analysis-level data collection jobs. Scans using Data Classification for this agent are enabled by default from the CommCell Console. For more information, see Agents - SRM Windows File System: Data Classification. For a step-by-step procedure, see Enable Data Classification Enabler for SRM. Once this setting is enabled, all Analysis level data collection jobs that you run subsequently will use Data Classification to gather data. Data Collection jobs will transition to traditional collection methods if any of the following conditions are true:

z Data Classification services are not available z Data Classification databases are currently being rebuilt Unix File System iDataAgents These agents can use the Data Classification Enabler to improve the scan speed of file system data before data management operations. If the enabler is not available, Classic File Scan is used to scan the data. Scans using Data Classification for these agents must be enabled from the CommCell Console. See Use Classic File Scan or Data Classification for a step-by-step procedure. Windows File System iDataAgents These agents can use the Data Classification Enabler to improve the scan speed of file system data before data management operations. If the enabler is not available, Change Journal or Classic File Scan is used to scan the data. Scans using Data Classification for these agents must be enabled from the CommCell Console. See Use Change Journal, Classic File Scan or Data Classification for a step-by-step procedure.

Setting Up Data Classification Enabler - How To

Topics | How To | Related Topics

Use Change Journal, Classic File Scan, or Data Classification Enabler for Backups (Exchange Server 2003/2007 Mailbox, Exchange Mailbox Archiver, Online Content Indexing for Exchange, Unix File System, Windows File System) Enable Data Classification Enabler for SRM (SRM Windows File System) Administer Exchange Properties for Data Classification (Exchange Server 2003/2007 Mailbox, Exchange Mailbox Archiver, Online Content Indexing for Exchange) Start Data Classification for Exchange to Populate the Data Classification Database with Exchange Metadata (Exchange Server 2003/2007 Mailbox, Exchange Mailbox Archiver, Online Content Indexing for Exchange)

Page 28 of 65 Features - Data Classification Enabler

Use Change Journal, Classic File Scan, or Data Classification Enabler Before You Begin

z For Unix, review Using Classic File Scan or Data Classification Enabler for Backups z For Windows, review Using Change Journal, Classic File Scan, or Data Classification Enabler for Backups Required Capability: See Capabilities and Permitted Actions To use Classic File Scan, Change Journal, or Data Classification Enabler: 1. From the CommCell Browser, right-click the desired file system backup set or archive set and click Properties. 2. From the General tab, select the Use Change Journal, Use Classic File Scan, or Use Data Classification. 3. Optionally, select Preserve File Access Time. 4. Optionally, if applicable, select Check archive bit during backups. 5. Click OK to save your changes.

Enable Data Classification Enabler for SRM Required Capability: See Capabilities and Permitted Actions To enable the Data Classification Enabler for SRM: 1. From the CommCell Browser, expand client computers, then right-click the icon for SRM Windows File System Agent for which you wish to enable Data Classification Enabler, select All Tasks and click Configure GXDC. 2. From the SRM Data Classification Options dialog box, select the Use Data Classification if present option to enable the Data Classification Enabler. Clear the Use Data Classification if present option to disable the Data Classification Enabler. 3. Click OK.

Administer Exchange Properties for Data Classification To administer Exchange properties for Data Classification: 1. From the Data Classification Administration Utility, click from the list the name of the Exchange server. 2. Click Advanced Plugins. 3. Click Exchange DC Journal Configuration. 4. Click Modify Settings. 5. Start populating the Exchange DC Journal Configuration dialog box as appropriate. { In the Domain Name\User Name and Password fields, type the account information that is required to access the affected Exchange server. The account should have administrator privileges. { In the Local path on Exchange server to generate Journal files field, type the path where the DC.INI file and the Data Classification database will be generated. This should be a local (and non-UNC) path. In a clustered environment, the file should be on a shareable drive.

Page 29 of 65 Features - Data Classification Enabler

{ In the Exchange Profile Name field, type the name of the profile that is associated with the appropriate administrator mailbox. { Click the Schedule to start Data Classification service later option to start the enumeration and sink later. Use the list provided to set the appropriate date and time. { In the AD Server List field, type the domain name for one or more non-default Exchange servers. Each must be separated by a semi-colon, even if only one name is entered. { In the Registration Timeout field, type the interval of time (in milliseconds) that the Data Classification service should wait for the sink to start on the Exchange server before timing out. Recommended value is 180000 (three minutes). { For the Support Deleted Items option, click the option to remove deleted messages from the index to prevent them from being searched by Online Content Indexing. { Click ProxySetup if you need to establish a proxy for Data Classification. { In the Network\UNC path to journal files field, type the path from the proxy machine to the DC.INI file and the Data Classification database. This path must point to the same location that is specified in the Local path on Exchange server to generate Journal files field above. Also, this path must always be a UNC path. In a clustered environment, the file should be on a shareable drive. { Click the Reuse Event registration Credentials option to reuse the credentials for accessing the affected Exchange server. If you click this option, type the account information that is required to access the affected Exchange server in the Domain Name\User Name and Password fields. The account should have administrator privileges. 6. Click Apply to enforce your inputs.

Start Data Classification for Exchange to Populate the Data Classification Database with Exchange Metadata To start Data Classification for Exchange to populate the Data Classification database with Exchange metadata: 1. From the Data Classification Administration Utility, click from the list the name of the Exchange server. 2. Click Advanced Plugins. 3. Click Exchange DC Journal Configuration. 4. Either accept the Exchange properties settings that are displayed or click Modify Settings and update the properties as appropriate. 5. Click Start Exchange DC. This should populate the Data Classification database with Exchange metadata.

Page 30 of 65 Features - Data Classification Enabler

DC Console

Topics | How To | Related Topics

Overview Add a Data Classification Computer Data Classification Reports Relocate a Data Classification Database Defragment Volumes on a Data Classification Computer Set the Monitor Status for Volumes on a Data Classification Computer Administer Filtered Volumes on a Data Classification Computer Administer Registry Values for a Data Classification Computer

Overview The DC Console is used to manage Data Classification services and file reports for Unix computers where the Data Classification Enabler is installed. Specifically, you can use the console to do the following:

z Add computers with Data Classification to the console's computer administration list z Query for file types and sizes on the computers and generate reports regarding the affected files

z Relocate a Data Classification database

z Defragment volumes on a Data Classification computer z Set the monitor status for volumes on a Data Classification computer z Administer filtered volumes on a Data Classification computer z Administer registry key values for a Data Classification computer The DC Console is available from the CommCell Console as a matter of course, but it can also be used as a standalone tool.

Add a Data Classification Computer The DC Console allows you to add computers with Data Classification to the console's computer administration list. See Add a Computer with Data Classification for step-by-step instructions.

Data Classification Reports You can query for files per their size or file name pattern matching on the computers with Data Classification and generate reports regarding the affected files.

The capabilities provided by this tab represent only a subset of the capabilities provided by Storage Resource Management (SRM). For more information, see SRM or your software representative.

See Generate Reports Regarding Attributes of Files on a Data Classification Computer for step-by-step instructions.

Page 31 of 65 Features - Data Classification Enabler

Relocate a Data Classification Database You can move a Data Classification database to another location on your computer. See Relocate a Data Classification Database for step-by-step instructions.

Defragment Volumes on a Data Classification Computer You can defragment (optimize) a Data Classification database on your computer. This is especially helpful for re-organizing highly-active databases. See Defragment Volumes on a Data Classification Computer for step-by-step instructions.

Set the Monitor Status for Volumes on a Data Classification Computer You can start or stop monitoring volumes on a Data Classification computer. See Set the Monitor Status for Volumes on a Data Classification Computer for step-by-step instructions.

Administer Filtered Volumes on a Data Classification Computer You can add or remove volumes to or from a Data Classification computer. See Administer Filtered Volumes on a Data Classification Computer for step-by-step instructions.

Administer Registry Values for a Data Classification Computer You can change the value of specific registry keys on a Data Classification computer, including the following:

z DB_FOLDER

When you install Data Classification on a Unix computer, a Data Classification database is created on each volume. You can use this key to create a centralized database directory to accommodate all these databases.

z REFRESH_PERIOD

This key indicates the period of time after which the Data Classification services will refresh all the monitored volumes. If any changes occur on the monitored volumes, the Data Classification services will attempt an incremental scan of the data after the refresh period. You can use this key to change the refresh interval. Depending on the time specified, if you add any new volumes, the services will "pick up the volumes" and start to monitor those volumes after the refresh period. Other registry keys whose value you should not change include the following:

z FILTER_OR_MONITOR

These modes are initially set by the Data Classification install and they cannot be changed.

z SCAN_PRIORITY

This key indicates the priority at which data scans will run. This is the same value that is supported by Unix.

z THROTTLE

This key indicates the number of volumes at a time that Data Classification will pick up and start to scan.

Page 32 of 65 Features - Data Classification Enabler

See Administer Registry Values for a Data Classification Computer for step-by-step instructions.

DC Console - How To

Topics | How To | Related Topics

Add a Computer with Data Classification Generate Reports Regarding Attributes of Files on a Data Classification Computer Relocate a Data Classification Database Defragment Volumes on a Data Classification Computer Set the Monitor Status for Volumes on a Data Classification Computer Administer Filtered Volumes on a Data Classification Computer Administer Registry Values for a Data Classification Computer

Add a Computer with Data Classification To add a Computer with Data Classification to the DC Clients list: 1. From the CommCell Console, click the Tools menu and then click Unix DC Console. 2. In the DC Console dialog box, right-click DC Clients and then click Add Client. 3. In the Data Classification dialog box, type the name of the computer with Data Classification in the space and then click OK.

The Unix DC Console will send an authentication request to the targeted client. If a response is not forthcoming after some time period, you will be prompted to try adding the client later.

4. Repeat this procedures for each computer that you want to add. The name of each computer should be displayed under DC Clients.

Generate Reports Regarding Attributes of Files on a Data Classification Computer To generate reports regarding attributes of files on a Data Classification computer: 1. From the CommCell Console, click the Tools menu and then click Unix DC Console. 2. From the list of clients, right-click the name of the Data Classification computer for whose files you want to generate a report and then click Reports. 3. From the Report Type list, click the filter for file selection. 4. In the Volumes field, select the volume(s) whose files you want to administer. 5. Click Reports Configuration. If you clicked File Name Pattern from the Report Type list, go to the next step. If you clicked File Size Range, skip the next step.

6. In the File Name Patterns tab, ensure that the name of each file on the affected computer is included in the File Namesspace. To remove a file, click the file name in the space and then click Remove. To add a file, type the name of the file in the Enter File Name Pattern space and then click Add. Finally, click OK. Skip the next step.

Page 33 of 65 Features - Data Classification Enabler

7. In the File Size Ranges tab, ensure that the minimum and maximum file size ranges of the files on the affected computer are included in the Min(KB)|Max(KB) space at the top of the dialog box. To remove a file size range, click the range and then click Remove. To add a file size range, type the minimum size and the maximum size of the files that should be included in the Enter File Size Ranges spaces and then click Add. Finally, click OK. 8. Click Generate Reports. A pie chart display based on your inputs will be displayed in Report Results. 9. Repeat this procedure for each computer for whose files you want to generate reports.

Relocate a Data Classification Database To move a Data Classification database for a computer to another location: 1. From the CommCell Console, click the Tools menu and then click Unix DC Console. 2. From the DC Clients list on the left-hand side, click the affected computer. 3. From the Volumes Data pane, right-click the affected volume and then click Relocate DB. 4. In the Relocate DB dialog box, type the new location for the Data Classification database in the Database Path field. 5. Click OK. 6. Repeat the appropriate steps for each affected volume on the computer. 7. Repeat this procedure for each computer whose Data Classification database you want to relocate.

Defragment Volumes on a Data Classification Computer To defragment volumes on a Data Classification computer: 1. From the CommCell Console, click the Tools menu and then click Unix DC Console. 2. From the DC Clients list on the left-hand side, click the affected computer. 3. From the Volumes Data pane, right-click the affected volume and then click Defrag. 4. Click OK. Volume defragmentation should take place. 5. Repeat the appropriate steps for each affected volume on the computer. 6. Repeat this procedure for each computer whose volumes you want to defragment.

Set the Monitor Status for Volumes on a Data Classification Computer To set the monitor status for volumes on a Data Classification computer: 1. From the CommCell Console, click the Tools menu and then click Unix DC Console. 2. From the DC Clients list on the left-hand side, click the affected computer. 3. From the Volumes Data pane, right-click the affected volume and then click Start Monitoring or Stop Monitoring as appropriate. 4. Click OK. The monitor status should take effect. 5. Repeat the appropriate steps for each affected volume on the computer. 6. Repeat this procedure for each computer whose volumes' monitor status you want to set.

Page 34 of 65 Features - Data Classification Enabler

Administer Filtered Volumes on a Data Classification Computer To administer filtered volumes on a Data Classification computer: 1. From the CommCell Console, click the Tools menu and then click Unix DC Console. 2. From the DC Clients list on the left-hand side, click the affected computer. 3. Go to the Filtered Volumes pane. 4. To add a filtered volume to the computer, right-click in the Filtered Volumes pane, click Add, type the path for the volume in the Entered Filtered Volume field, and click OK. The filtered volume, if valid, should be displayed in the pane. To remove a filtered volume from the computer, right-click the affected volume and then click Delete. The filtered volume, if valid, should be removed from the pane. 5. Repeat the appropriate steps for each affected volume on the computer. 6. Repeat this procedure for each computer whose filtered volumes you want to administer.

Administer Registry Values for a Data Classification Computer To administer registry values for a Data Classification computer: 1. From the CommCell Console, click the Tools menu and then click Unix DC Console. 2. From the DC Clients list on the left-hand side, click the affected computer. 3. From the Registry Values pane, right-click the affected registry key and then click Edit. 4. In the Edit Registry Values dialog box, indicate or type the desired value in the Registry Value field. { DB_FOLDER

Type the path to a centralized database directory to accommodate all the Data Classification databases.

{ FILTER_OR_MONITOR

To change the install mode to filter mode, type FILTER. To change the install mode to monitor mode, type MONITOR.

{ REFRESH_PERIOD

Type the period of time after which the Data Classification services will refresh all the monitored volumes. Default time is 60 minutes, and you can specify a minimum of 10 minutes; there is no restriction on the maximum number of minutes.

{ SCAN_PRIORITY

Type the priority at which data scans will run. Default is 10. This value depends on the operating system-level scan priority.

{ THROTTLE

Type the number of volumes at a time that Data Classification will pick up and start to scan. Default is 3. 5. Click OK. The change should be displayed in the pane. 6. Repeat this procedure for each registry key whose value you want to administer. 7. Repeat this procedure for each computer whose registry keys' status you want to administer.

Page 35 of 65 Features - Data Classification Enabler

Data Classification Console

Topics | How To

Overview Data Classification Computers and Services Data Classification Server Properties Data Classification Reports Exchange DC Journal Configuration Tab Administer Filtered Volumes on a Data Classification Computer Optimize a Data Classification Database Relocate a Data Classification Database

Overview The Data Classification Console is used to manage Data Classification services, server properties, and file reports for Windows computers where the Data Classification Enabler is installed. For some agents, the console is used to perform data scan-related operations. Specifically, you can use the console to do the following:

z Display the enabler status for all the drives on all the computers where the enabler is installed z Connect to specified computers that include the enabler to retrieve and change the enabler properties on these computers z Query for file types and sizes on the computers and generate reports regarding the affected files z Administer Exchange server and Data Classification properties to be used when logging events for data (mailbox messages) against the Exchange server z Optimize and relocate Data Classification databases (meta databases) The Data Classification Console is an option from the CommCell Console.

To use the Data Classification Console, you must log on to the computer as a User with Administration privileges.

See Start the Data Classification Console for step-by-step instructions on starting the console.

Data Classification Computers and Services You can do the following:

z Administer selected computers across domains that have Data Classification z Administer Data Classification services and processes for computers You can enable or disable Data Classification processes in both the collecting and monitoring states. If a process is disabled, all Data Classification services will be paused until the process is enabled again. When collection processes are enabled, Data Classification services will resume from the default starting point in the database. When monitoring processes are enabled, Data Classification services will resume from the last entry in the change journal. See the following procedures as appropriate:

z Add Computers with Data Classification to the Data Classification Clients List

Page 36 of 65 Features - Data Classification Enabler

z Administer Data Classification Services

Data Classification Server Properties You can view and administer various properties for Data Classification computers, including the following:

z Data Classification Process Priority You can indicate the priority at which Data Classification processes will run during initial database population.

z Runtime Scan Priority You can indicate the priority at which Data Classification processes will run after initial database population.

z Inactivity Wait Interval You can control the interval of time in seconds after local computer activity stops that Data Classification services should resume.

z New Volume Discovery Interval You can control how often Data Classification checks for new volumes.

z Monitor All NTFS Volumes You can indicate whether or not you want to monitor all the affected NTFS volumes per updates to these volumes. See Administer Data Classification Server Properties for a step-by-step procedure.

Data Classification Reports You can query for file types (extensions) and sizes on the computers with Data Classification and generate reports regarding the affected files. You can generate pie charts and bar graphs to display information. Wildcard characters within the extensions are not supported. You can include a maximum of 10 file types and file size ranges, and each file can be a maximum of 100 KB. You cannot include a file size range that overlaps or falls within a file size range that is already established. For example, if you have already included a file range of Min(KB) 1 and Max(KB) 7, you cannot include a file range of Min(KB) 2 and Max(KB) 6 or Min(KB) 6 and Max(KB) 10, etc. Note that any selections that you make for a session are not saved and are therefore effective only for that session.

The capabilities provided by this tab represent only a subset of the capabilities provided by Storage Resource Management (SRM). For more information, see the Storage Resource Management documentation or your software representative.

See Configure and Generate Reports Regarding Attributes of Files on a Data Classification Computer for a step-by-step procedure.

Exchange DC Journal Configuration Tab The Exchange DC Journal Configuration tab allows you to use the Data Classification Enabler to administer Exchange server and Data Classification properties to be used when logging events for data (mailbox messages) against the Exchange server. Such events including adding data, removing data, and

Page 37 of 65 Features - Data Classification Enabler sending data. It also allows you to start or stop populating the Data Classification database with metadata for these events. See the following procedures as appropriate:

z Administer Exchange Properties for Data Classification z Start Data Classification for Exchange to Populate the Data Classification Database with Exchange Metadata

Administer Filtered Volumes on a Data Classification Computer You can filter volumes from a Data Classification computer, and you can remove filters to add volumes back to the computer. See Administer Filtered Volumes on a Data Classification Computer for a step-by-step procedure.

Optimize a Data Classification Database The Data Classification Console allows you to defragment (optimize) a Data Classification database (meta database) on a selected computer. This is especially helpful for reorganizing highly-active meta databases. See Optimize a Data Classification Database for a step-by-step procedure.

Relocate a Data Classification Database You can move the metafiles (databases) created by the Data Classification services to a folder of your choice as long as the folder has already been created.

It is recommended that you not change the location of the metafile in a clustered environment since this would cause an inconsistency between the data in the database and the data on your computer. Nonetheless, if you do this, ensure that the new location is on the physical node and not on the virtual node or shared disk since Data Classification runs only on physical nodes. Also, it is recommended that you do not move the metafile location under the gxhsmcache folder, which is a hidden folder.

See Relocate a Data Classification Database for a step-by-step procedure.

Data Classification Console - How To

Topics | How To

Start the Data Classification Console Add Computers with Data Classification to the Data Classification Clients List Administer Data Classification Services Administer Data Classification Server Properties Configure and Generate Reports Regarding Attributes of Files on a Data Classification Computer Administer Filtered Volumes on a Data Classification Computer Relocate a Data Classification Database Optimize a Data Classification Database

Page 38 of 65 Features - Data Classification Enabler

Administer Exchange Properties for Data Classification Start Data Classification for Exchange to Populate the Data Classification Database with Exchange Metadata

Start the Data Classification Console To start the Data Classification Console: 1. Launch the the CommCell Console. 2. From the CommCell Console, click Tools > Data Classification Console. The initial view of the Data Classification Console should be displayed.

Add Computers with Data Classification to the Data Classification Clients List To add computers with Data Classification to the Data Classification Clients list: 1. Right-click Data Classification Clients. 2. Click Add Client. 3. Type the name of the client in the dialog box and then click OK. 4. Right-click Data Classification Clients and click Refresh to repopulate the list.

Administer Data Classification Services To stop or start Data Classification services: 1. In the Data Classification Clients list, right-click the name of the computer whose services you want to administer. 2. Click Start DC Services or Stop DC Services as appropriate. 3. Click Yes in the dialog box. Then click OK. To stop and then restart Data Classification services: 1. In the Data Classification Clients list, right-click the name of the computer whose services you want to administer. 2. Click Recycle DC Services to stop and then restart the services. 3. Then click OK.

Administer Data Classification Server Properties To administer Data Classification Server Properties: 1. To view the Data Classification Server Properties for any computer, click the name of the computer in the Data Classification Clients list and check the Registry Values pane on the right-hand side. The associated controls (properties) will be displayed in the Control field, and the corresponding value for each control will be displayed in the Value field. 2. To change a value for any property, right-click the property name and click Edit. Thereafter, use the list or radial button within the Registry Value field in the Edit Registry Values dialog box to set the appropriate value. Then click OK. Specifically:

Page 39 of 65 Features - Data Classification Enabler

{ Data Classification Process Priority (DCProcessPriority) Set to Low to allow Data Classification processes to run at low priority mode during initial database population. Set to BelowNormal to allow the same to run at idle priority.

{ Runtime Scan Priority (RunTimeScanPriority) Set to Normal to allow Data Classification processes to run at normal priority mode after initial database population. Set to Low to allow the same to run at low priority, and set to BelowNormal to allow the same to run at idle priority.

{ Inactivity Wait Interval (InactivityWaitInterval) Set the interval of time in seconds after local computer activity stops that Data Classification services should resume.

{ New Volume Discovery Interval (NewVolumeDiscoveryInterval) Set the interval of time in seconds after which Data Classification should check for new volumes.

{ Monitor All Volumes (MonitorAllVolumes) Indicate whether or not you want to monitor all the affected NTFS volumes per updates to these volumes. If you do want to do this, click Yes; if not, click No. 3. Right-click the affected computer and click Recycle DC services to apply the changes.

Configure and Generate Reports Regarding Attributes of Files on a Data Classification Computer To configure and/or generate reports regarding attributes of files on a Data Classification computer: 1. Do one of the following: a. If you want to configure and generate a report, or if you just want to configure a report, go to the clients list and right-click the name of the Data Classification computer for whose files you want to configure and/or generate a report. Then click Reports and go to the next main step. b. If you just want to configure a report without generating it, go to the clients list and click the name of the Data Classification computer for whose files you want to configure a report. Thereafter, click Reports in the menu bar, click Report Configuration, and skip the next three main steps. 2. From the Report Type list, click the filter for file selection. 3. In the Volumes field, select the volume(s) whose files you want to administer. 4. Click Reports Configuration. 5. Make changes in either or both of the following two tabs as appropriate: a. In the File Name Patterns tab, ensure that the extension of each file on the affected computer is included in the File Names space. To remove an extension, click the extension in the space and then click Remove. To add an extension, type the extension (without '.') in the Enter File Name Pattern space and then click Add. b. In the File Size Ranges tab, ensure that the minimum and maximum file size ranges of the files on the affected computer are included in the Min(KB)|Max(KB) space at the top of the dialog box. To remove a file size range, click the range and then click Remove. To add a file size range, type the minimum size and the maximum size of the files that should be included in the Enter File Size Ranges spaces and then click Add. When you are finished, click OK. 6. If you are generating a report, click Generate Reports. Based on your inputs, a bar graph or pie chart will be displayed in Report Results. Mouse-over the shaded areas to display the counts of the

Page 40 of 65 Features - Data Classification Enabler

affected items. 7. Repeat this procedure for each computer for whose files you want to configure and/or generate reports.

Administer Filtered Volumes on a Data Classification Computer To filter a volume from a Data Classification computer (Method 1): 1. From the list of clients, click the name of the computer whose volume you want to filter. 2. In the Volumes Data pane, right-click the name of the volume to be filtered and click Stop Monitoring. 3. A message indicating that monitoring of the volume has stopped should be displayed. Click OK. 4. Right-click the affected computer and click Recycle DC services to apply the changes. Click OK. 5. Right-click the affected computer and click Refresh. The filtered volume should now be displayed in the Filtered Volumes pane and removed from the Volumes Data pane. To filter a volume from a Data Classification computer (Method 2): 1. From the list of clients, click the name of the computer whose volume you want to filter. 2. In the Filtered Volumes pane, right-click and then click Add. 3. In the DC Services dialog box, type the letter for the volume to be filtered and then click OK. 4. A message indicating that the volume has been added to the Filter Volume List should be displayed. Click OK. 5. Right-click the affected computer and click Recycle DC services to apply the changes. Click OK. 6. Right-click the affected computer and click Refresh. The volume should now be displayed in the Filtered Volumes pane and removed from the Volumes Data pane. To remove a volume filter for a Data Classification computer: 1. From the list of clients, click the name of the computer whose volume filter you want to remove. 2. In the Filtered Volumes pane, right-click the name of the volume whose filter you want to remove and then click Delete. 3. A message indicating that you should recycle the Data Classficiation services to apply the change is displayed. Click OK. 4. Right-click the affected computer and click Recycle DC services to apply the change. Click OK. 5. Right-click the affected computer and click Refresh. The volume should now be displayed in the Volumes Data pane and removed from the Filtered Volumes pane.

Relocate a Data Classification Database To relocate a Data Classification database: 1. From the list of clients, click the name of the computer whose Data Classification database you want to move to another location. 2. In the Volumes Data pane, right-click the name of the volume that includes the Data Classification database to be relocated and then click Relocate Database. 3. In the Relocate Database dialog box, specify the new location for the database in the Database Path field. To do this, type the path in the space or click Browse to find the path. 4. Click OK. The Database field for the affected volume should indicate that the database has been

Page 41 of 65 Features - Data Classification Enabler

marked for relocation.

Optimize a Data Classification Database To optimize the Data Classification database: 1. From the list of clients, click the name of the computer whose Data Classification database you want to optimize (defragment). 2. In the Volumes Data pane, right-click the name of the volume that includes the Data Classification database to be relocated and click Optimize Database. A message indicating that the database has been marked for defragmenting should be displayed.

Administer Exchange Properties for Data Classification To administer Exchange properties for Data Classification: 1. From the list of clients, click the name of the Data Classification computer whose properties you want to administer. 2. Click Exchange DC Journal Configuration. 3. Either accept the Exchange DC Status settings that are displayed or click and update the values as appropriate. 4. Start populating the Exchange Configuration pane as appropriate. { In the Domain Name\User Name and Password fields, type the account information that is required to access the affected Exchange server. The account should have administrator privileges. { In the Local path on Exchange server to generate Journal files field, type the path where the DC.INI file and the Data Classification database will be generated. This should be a local (and non-UNC) path. In a clustered environment, the file should be on a shareable drive. { In the Exchange Profile Name field, type the name of the profile that is associated with the appropriate administrator mailbox. { Click the Schedule to start Data Classification service later option to start the enumeration and sink later. Use the calendar list provided to set the appropriate date and time. { In the AD Server List field, type the domain name for one or more non-default Exchange servers. { In the Registration Timeout field, type the interval of time (in milliseconds) that the Data Classification service should wait for the sink to start on the Exchange server before timing out. Recommended value is 180000 (three minutes). { For the Support Deleted Items option, click the option to remove deleted messages from the index to prevent them from being searched by Online Content Indexing. { Click ProxySetup if you need to establish a proxy for Data Classification. { In the Network\UNC path to journal files field, type the path from the proxy machine to the DC.INI file and the Data Classification database. This path must point to the same location that is specified in the Local path on Exchange server to generate Journal files field above. Also, this path must always be a UNC path. In a clustered environment, the file should be on a shareable drive. { Click the Reuse Event registration Credentials option to reuse the credentials for accessing the affected Exchange server. If you click this option, type the account information that is required to access the affected Exchange server in the Domain Name\User Name and Password fields. The account should have administrator privileges. 5. Click Apply Settings to enforce your inputs.

Page 42 of 65 Features - Data Classification Enabler

Start Data Classification for Exchange to Populate the Data Classification Database with Exchange Metadata To start Data Classification for Exchange to populate the Data Classification database with Exchange metadata: 1. From the list of clients, click the name of the Data Classification computer whose properties you want to administer. 2. Click Exchange DC Journal Configuration. 3. Either accept the Exchange properties settings that are displayed or update the properties as appropriate and then click Apply Settings. 4. Click Start Exchange DC. This should populate the Data Classification database with Exchange metadata.

Page 43 of 65 Features - Data Classification Enabler

DC Client Command Line Tool for Unix

Topics | Related Topics

Overview DcClient Command

z Get Information on Volumes States and Registry Key Values z Start Monitoring a Volume z Stop Monitoring a Volume z Defragment a Volume z Relocate a Database Volume z Add a Volume to be Monitored z Remove a Volume from being Monitored z Change Registry Key Values

Overview The DcClient command is command line tool used to administer Data Classification volumes on Unix. This command allows you to do the following, among other tasks: change the Data Classification registry key values regarding volumes, administer the Data Classification database, stop and start individual volumes and stop and start all volumes simultaneously.

You can also administer some of these items by using the DC Console.

DcClient Command This section discusses how to use the DcClient command.

Get Information on Volumes States and Registry Key Values The following command displays information on the current state of the volumes and registry keys: Usage: DcClient -getinfo

Start Monitoring a Volume The following command starts monitoring a Data Classification volume: Usage: DcClient -start

The Data Classification services will start monitoring the volume unless the services are restarted or the corresponding stop command is run. Stop Monitoring a Volume The following command stops monitoring a Data Classification volume: Usage: DcClient -stop

Page 44 of 65 Features - Data Classification Enabler

The Data Classification services will stop monitoring the volume. Defragment a Volume The following command defragments (optimizes) a Data Classification database. Usage: DcClient -defrag

Defragmentation is especially helpful for reorganizing highly-active databases. Relocate a Database Volume The following command moves a database to another directory. In essence, the command relocates the database from the current path to the specified new path. Usage: DcClient -relocate

Add a Volume to be Monitored The following command adds a Data Classification volume to be monitored: Usage: DcClient -monitor

You can add either existing volumes or new volumes for monitoring. Remove a Volume from being Monitored The following command removes a Data Classification volume from being monitored: Usage: DcClient -donot-monitor

Change Registry Key Values You can use the DcClient command with various registry key values to set or implement the following:

z Centralized Location of Data Classification Database z Refresh Time Interval for All Monitored Volumes z Volume Picking and Scanning z Volume Scan Priority Some of the registry values used with the DcClient command can also be administered using the DC Console. For more information, see Administer Registry Values for a Data Classification Computer.

Centralized Location of Data Classification Database The following command creates a centralized directory for all the Data Classification databases. Usage: DcClient -edit DB_FOLDER

When you install Data Classification on a Unix computer, a Data Classification database is created on each volume. You can use the DB_FOLDER key to create a centralized database directory to accommodate all these databases. If you want Data Classification to create databases under the root of each volume, run the following command: DcClient -edit DB_FOLDER NULL

Refresh Time Interval for All Monitored Volumes

Page 45 of 65 Features - Data Classification Enabler

The following command changes the time interval to refresh all monitored volumes: Usage: DcClient -edit REFRESH_PERIOD

The REFRESH_PERIOD key indicates the period of time after which the Data Classification services will refresh all the monitored volumes. If any changes occur on the monitored volumes, the Data Classification services will attempt an incremental scan of the data after the refresh period. You can use this key to change the refresh interval. Depending on the time specified, if you add any new volumes, the services will "pick up" the volumes and start to monitor those volumes after the refresh period.

Volume Picking and Scanning The following command sets the number of volumes at a time that Data Classification will pick up and start to scan. Usage: DcClient THROTTLE

Default is 3.

Volume Scan Priority The following command sets the priority at which data scans on the volume will run. Usage: DcClient -edit PRIORITY

The value for the PRIORITY key depends on the operating system-level scan priority.

Back to Top

Page 46 of 65 Features - Data Classification Enabler

Rules and Queries - Data Classification Enabler

Topics | How To

Overview Use query string as rule

Overview Data Classification uses the traditional rules established for the supported File Archiver Agents. In addition, it provides several unique rules. These rules are available only when Data Classification is enabled for the supported agent, and the rules apply only to local data. All the rules for Data Classification are configurable from the DataClassSet subclient properties Rules tab of the File Archiver Agent. The unique Data Classification rules include the following:

z Folders/Files Owned By - Allows you to select and exclude files belonging to specific users and user groups z File Paths - Allows you to select and exclude files based on file location and specific file characteristics (e.g., file extensions) z SQL Query Strings - Allow you to use SQL query-like commands to define more complex rules based on your requirements Whenever a DataClassSet subclient is created, a default set of rules is established. These rules are reflected in the various rules tabs. Also, the SQL query string, which has its own tab, is automatically disabled; however, the default set of rules is formulated into the SQL query string. At this point, you can start changing rules either from the various rules tabs or within the SQL query string. If you change rules from the various rules tabs, these rules will be formulated into the SQL query string (which is still disabled). However, if you choose to enable and edit the SQL query string, the rules reflected in the string will be enforced, and the rules previously established by the various rules tabs will be disabled. Subsequently, if you disable the rules in the SQL query string, the rules that were previously in effect via the various rules tabs will be enforced. Each DataClassSet subclient has its own rules, and these rules can be shared by other DataClassSet subclients; however, changing the rules in one DataClassSet subclient will not affect the rules in another DataClassSet subclient. See Migration Archiving - File Archiver Agents for more information.

Use query string as rule Warning: SQL query string rules are intended only for advanced users, and they should be changed only at the risk of such users. If in doubt, do not change rules using SQL query strings without assistance from your Software Provider. Also, whenever you are modifying a SQL query string, be sure not to remove the following special tokens:

z For Windows, do not remove GALAXY_DIR, WINDOWS_DIR, and SYSTEM_DIR z For Unix, do not remove SYSTEM_DIR (/tmp, /usr, /etc, /kernel, etc.)

Removing any of these tokens may yield undesired results. By default, the rules that you select from the various rules tabs are formulated into a query that is used during archiving. This query is available from the Rules/Advanced tab within DataClassSet subclient properties. You can change the settings in this query to extend rules capabilities beyond what is possible using just the various rules tabs.

Page 47 of 65 Features - Data Classification Enabler

You can use the following database fields to create customized user-defined queries:

z Windows: { shortName - the file name { fullPath - the fully qualified path and file name { attrs - the file or folder attributes stored as a single decimal value (see the attribute definitions in the table below) { fileSize - the file size (in bytes) { lowFreeVolume - the low water free space on the volume expressed as a percentage { highFreeVolume - the high water free space on the volume expressed as a percentage { username - the file owner expressed as a domain\user name if the computer is in a domain; if not, the file owner is expressed as computer name\user name { groupname - the name of an Active Directory group (Windows) z Unix: { NAME - the file name { ANAME - the fully qualified path and file name { SIZE - the file size (in bytes) { lowFreeVolume - the low water free space on the volume expressed as a percentage { highFreeVolume - the high water free space on the volume expressed as a percentage { USER - the file owner { GROUP - the name of the group For Windows only, the following table includes the SQL advanced query values for file attributes in decimal and the associated meaning for each value. You can combine these values as necessary.

FILE_ATTRIBUTE_READONLY 1 FILE_ATTRIBUTE_HIDDEN 2 FILE_ATTRIBUTE_SYSTEM 4 FILE_ATTRIBUTE_DIRECTORY 16 FILE_ATTRIBUTE_ARCHIVE 32 FILE_ATTRIBUTE_DEVICE 64 FILE_ATTRIBUTE_NORMAL 128 FILE_ATTRIBUTE_TEMPORARY 256 FILE_ATTRIBUTE_SPARSE_FILE 512 FILE_ATTRIBUTE_REPARSE_POINT 1024 FILE_ATTRIBUTE_COMPRESSED 2048 FILE_ATTRIBUTE_OFFLINE 4096 FILE_ATTRIBUTE_NOT_CONTENT_INDEXED 8192 FILE_ATTRIBUTE_ENCRYPTED 16384

The following operators are supported:

z For Windows, AND, OR, NOT, LIKE, IN, =, <, >, %, and & are supported. Also, all queries must begin with "find files where". For example:

Page 48 of 65 Features - Data Classification Enabler

find files where fullPath like 'c:\users\tom%'

z For Unix, AND, OR, NOT, LIKE, IN, =, <, >, %, and & are supported. Also, all queries must begin with “SELECT ANAME,SIZE,FILE_TYPE,CTIME,MODE from files where”. For example: “ sql query: SELECT ANAME,SIZE,FILE_TYPE,CTIME,MODE from files where ANAME LIKE '/lvfsdmdc/Logs1/%'

Before running a job, you can test your query to raise the confidence level that the correct files will be archived. See Test Your Query Before it Runs for a step-by-procedure. Back to Top

Rules and Queries - Data Classification Enabler - How To

Topics | How To

Configure Archiving Rules - File Archiver Agents Test Your Query Before it Runs

Configure Archiving Rules - File Archiver Agents Required Capability: Capabilities and Permitted Actions To configure archiving rules for the File Archiver Agents: 1. From the CommCell Browser, right-click the subclient whose archiving rules you want to configure, then click Properties from the shortcut menu. 2. Click the Rules tab of the Subclient Properties. 3. Click the File Rule tab (if not already selected). a. Enable the Archiving Rules. b. Configure the Archiving Rules. 4. Click the Stub Rule tab and configure the Archiving Rules. 5. If you are using the Data Classification Enabler: a. Click the Folders\Files Owned By tab. b. Configure Archiving Rules to include or exclude files that belong to specific users or user groups. Once added, files can be removed.

Follow the steps to look up a user in Use Account for Data Classification.

c. Click the File Paths tab. d. Configure Archiving Rules to include or exclude specific folders/files. Once added, files can be removed. As appropriate, use wildcards as discussed in the Note for File Paths. e. Click the Advanced tab. f. Configure Archiving Rules to configure an advanced query string as a rule. Accept the SQL query string displayed on the tab or use the space to edit the string. Optionally, the query can be saved to a file 6. Click OK to save your changes.

Page 49 of 65 Features - Data Classification Enabler

Test Your Query Before it Runs Related Topics

z Establish and Use Rules and Queries - Data Classification Enabler

z Create a New Subclient

z Configure Subclient Content Required Capability: See Capabilities and Permitted Actions To test your query before it runs: 1. After you create and save a DataClassSet subclient, open the DataClassSet Property dialog box, click Rules and then Advanced. 2. Click View Rules as Query. 3. In the View Rules as Query dialog box, click Save to File. 4. Create the file for the rules (e.g., c:\sql.txt) by providing the location and file name and then click Save. 5. On the client computer, open a command window and navigate to the /base folder where the software is installed. 6. If you are adding a user, run the appropriate gxdcquery -input -output command, where { is the SQL query input { includes the names of the files that would be selected for archiving

and press Return. For example:

gxdcquery -input c:\sql.txt -output c:\dc.txt

If you are adding a user group, run the appropriate

gxdcquery -input -groups -user -domain -pw -groups -output command, where

{ is the SQL query input { is the user name used to authenticate against the domain controller { is the domain name used to connect to a specific domain controller { is the password used to authenticate against the domain controller { includes the names of the files that would be selected for archiving

and press Return. For example:

gxdcquery -input c:\sql.txt -groups -user pilot -domain master -pw eingang -groups - output c:\dc.txt

The query will run and return the list of files that would be selected for archiving in the output file.

Back To Top

Page 50 of 65 Features - Data Classification Enabler

Users and User Groups - Data Classification Enabler

Topics | How To

Overview Active Directory User Authentication (Windows)

Overview Data Classification can support various types of users and user groups. Unix For Data Classification on Unix, ensure that you are using valid users and groups on the system. Windows Data Classification on Windows can look up files belonging to the following:

z Domain users z Local users Note that Data Classification on Windows supports files owned by domain user groups but not local user groups. Also, all domain users and domain user groups used by the Data Classification Enabler must always reside under the "Builtin" or "Users" organization unit. Data Classification on Windows allows you to archive data owned only by local users or by users who are members of the same domain as that of the client computer. It does not allow you to archive data owned by users who are members of a domain other than that of the client computer. The user name and password specified at the agent level allow Data Classification-enabled File Archiver for Windows Agent jobs to search Active Directory for user membership of groups that are specified in a Data Classification rule. If Data Classification subclient rules do not include user groups, no Active Directory authentication is required during an archive operation.

Active Directory User Authentication (Windows) The Data Classification Enabler on Active Directory for user information under two conditions:

z When performing a subclient content Browse for users or groups When you are defining subclients and you browse to look up users or user groups, you are prompted to provide a valid domain user name and password. This account can be any account that has permission to list Active Directory users or groups. Note that if the user name and password at the Enabler level have not yet been specified and a user or group Browse is conducted, the user name provided will be saved at the Enabler level for later use during subsequent archive jobs. Only one user name can be stored for this purpose.

z If an archive is performed and the subclient rule contains a group When a Data Classification-enabled data archive job runs, Active Directory will be queried for a list of users if the Data Classification subclient rule contains a "Group" entry. The user name and password used to perform this query are the ones stored at the agent level. See Use Account for Data Classification (Windows) for step-by-step instructions.

Back to Top

Page 51 of 65 Features - Data Classification Enabler

Users and User Groups - Data Classification Enabler - How To

Topics | How To

Use Account for Data Classification (Windows) Configure Archiving Rules - File Archiver Agents

Use Account for Data Classification (Windows) Required Capability: See Capabilities and Permitted Actions To use/change a user account to authenticate against the Active Directory domain controller to look up group membership: 1. From the CommCell Browser, expand the tree if necessary to view the desired enabler-level icon. 2. Right-click the agent icon and then click Properties from the short-cut menu. 3. From the General tab in the agent dialog box, click Authenticate Active Directory Domain Controller. 4. Type the domain name, user name, and password in the appropriate spaces. 5. Make any other desired changes and then click OK. To use/change a user account to authenticate against the Active Directory domain controller to look up a user or (if the authentication is not already saved) to look up group membership: 1. From the CommCell Browser, if you are creating a new DataClassSet (subclient), expand the tree if necessary to view the appropriate DataClassSet backupset-level icon. Then right-click the backup set, click All Tasks, click Create DataClassSet, and start to create the DataClassSet. If you are using an existing DataClassSet, right-click the appropriate DataClassSet subclient and then click Properties. 2. From the Rules tab of the DataClass Property dialog box, click Folders\Files Owned By. 3. In the Folders\Files Owned By tab, to include files owned by a specific user in the migration, click Include Users and then do one of the following: { Click Browse. { Click User and then Browse. { Click User Group and then Browse. Then type the appropriate domain name, user name, and password in the Authenticate Active Directory Domain Controller dialog box and click OK. 4. Click the appropriate user groups/users and then click Add. 5. Repeat the preceding two steps as necessary. 6. In the Folders\Files Owned By tab, to exclude files owned by a specific user in the migration, click Exclude Users and then do one of the following: { Click Browse. { Click User and then Browse. { Click User Group and then Browse. Then type the appropriate domain name, user name, and password in the Authenticate Active Directory Domain Controller dialog box and click OK. 7. Click the appropriate user groups/users and then click Add.

Page 52 of 65 Features - Data Classification Enabler

8. Repeat the preceding two steps as necessary. 9. Make any other desired changes and then click OK.

Configure Archiving Rules - File Archiver Agents Required Capability: Capabilities and Permitted Actions To configure archiving rules for the File Archiver Agents: 1. From the CommCell Browser, right-click the subclient whose archiving rules you want to configure, then click Properties from the shortcut menu. 2. Click the Rules tab of the Subclient Properties. 3. Click the File Rule tab (if not already selected). a. Enable the Archiving Rules. b. Configure the Archiving Rules. 4. Click the Stub Rule tab and configure the Archiving Rules. 5. If you are using the Data Classification Enabler: a. Click the Folders\Files Owned By tab. b. Configure Archiving Rules to include or exclude files that belong to specific users or user groups. Once added, files can be removed.

Follow the steps to look up a user in Use Account for Data Classification.

c. Click the File Paths tab. d. Configure Archiving Rules to include or exclude specific folders/files. Once added, files can be removed. As appropriate, use wildcards as discussed in the Note for File Paths. e. Click the Advanced tab. f. Configure Archiving Rules to configure an advanced query string as a rule. Accept the SQL query string displayed on the tab or use the space to edit the string. Optionally, the query can be saved to a file 6. Click OK to save your changes.

Back To Top

Page 53 of 65 Features - Data Classification Enabler

Services

Topics | How To | Related Topics

Overview Service Dependencies TCP Ports Used for Services

z Static Ports z Dynamic Ports Binding Services to Specific Network Interface Cards (NIC)

z Windows z Unix Service Control

z Service Control for Windows z Service Control on Windows Cluster z Running Services Using a Windows User z Service Control for Unix z Service Control for NetWare z Service Control for Content Indexing GxAdmin Tool

Overview Several services are required by the software to function. For example, all computer configurations minimally require the Base services to be running. CommServe services are installed exclusively on the CommServe computer, and MediaAgent services are installed exclusively on the MediaAgent computer. The following table describes the various CommCell components and the appropriate services that are required. Note that these services are automatically installed when the appropriate software is installed on the computer.

CommCell components Service Service Name Service Name (as Description Group (as (as displayed displayed in displayed in in the Windows Task the Service Windows Manager) Control Local Services Manager) dialog box)

Client only (any system) Base services Bull Calypso EvMgrC Forwards events Client Event generated on the local Manager machine to the CommServe. In additio helps the CommServe browse the application data on local machine.

Bull Calypso CVD Provides the ability to Communications fetch or save metadata Service the CommServe when data protection or data recovery operations are

Page 54 of 65 Features - Data Classification Enabler

progress.

CommServe only Base services Bull Calypso EvMgrS Responsible for Server Event communicating with Manager CommCell Console and receive the events from the Clients and/or MediaAgents.

Bull Calypso CVD Provides the ability to Communications fetch or save metadata Service the CommServe when data protection or data recovery operations are progress.

CommServe Bull Calypso AppMgrSvc Provides access to serv services Application and client configuration Manager for local and remote processes. This service essential for the CommServe.

Bull Calypso Job JobMgr Responsible for running Manager and controlling jobs an also communicate with the available resources

Bull Calypso MediaManager Responsible for control Media & Library the hardware devices t Manager are part of a CommCel

Bull Calypso QSDK Responsible for servicin Commands command line requests Manager and is therefore essent for command line operations.

Bull Calypso SRMServer Responsible for sending Storage and receiving data to a Resource from the SRM clients. Manager

MediaAgent only Base services Bull Calypso EvMgrC Forwards events Client Event generated on the local Manager machine to the CommServe. In additio helps the CommServe browse the application data on local machine.

Bull Calypso CVD Provides the ability to Communications fetch or save metadata Service the CommServe when data protection or data recovery operations are progress.

MediaAgent Bull Calypso cvmountd Responsible for services Media Mount interacting with the Manager hardware devices that (GxMMM) attached to the local ho

Page 55 of 65 Features - Data Classification Enabler

and are part of the CommCell.

Migration Archiver Agents DataArchiver Bull Calypso GXHSMService Installed on clients with Services HSM Recaller Migration Archiver Age Responsible for archivin or recovering the files based on rules defined the migration archiving operation.

Data Classification Data Bull Calypso GXDCService Installed on Clients wit Agents Classification Data (Windows), DcSvc the Data Classification services Classification (Unix) Agent. Responsible for Enabler classifying data which i in turn used by the appropriate agent.

ContinuousDataReplicator CDR Services Bull Calypso CVRepSvc Installed on Clients wit Agents Replication ContinuousDataReplica Service Responsible for replicat data from one client computer to another client computer.

VSS Provider Agents VSS Provider Bull Calypso VSS_SWPROV_SVC Makes use of the Volum Service VSS Provider Shadow Service feature Service ( the Windows Server 20 operating system.

Client only (non- This service Bull Calypso GxClusPlugin Provides notification CommServe Virtual is not Cluster Plugin regarding whether or n Machines) managed by the cluster group goes the Service into an active or passiv Control state. This service is Manager. essential for system The Cluster functionality. Administrator must be used to control this service.

Client only Data Bull Calypso GXSHMServiceNTAP Recall files on a NAS Archiver HSM NAS Share. Services Recaller Service

For the Solaris File System iDataAgent, if the Bull Calypso Communications Service is on an IPv4-only stack (e.g., you do not have a local host IPv6 configured), be sure to do the following before you run a data protection operation: 1. Enable the IPv6 stack. 2. Change nPreferredIPFamily to 1 (i.e., force IPv4).

3. Remove or comment out ‘::1’ from /etc/inet/ipnodes.

4. Alter startup to run on just the local host IPv6. For example:

ifconfig lo0 inet6 plumb

Page 56 of 65 Features - Data Classification Enabler

route add –inet6 ::1/128 localhost ifconfig lo0 inet6 up

Service Dependencies When a system has more than one CommCell component, the service dependencies are as follows:

Using the Service Control Has the following effect: Manager to do the following:

Stop all services used by the Stops all services on that system. system

Start all services used by the Starts all services on that system. If the DataArchiver agent is system installed, starting all services starts the GXHSM recaller service.

Stop the Base services Stops all services on that system because all services depend on the Base services.

Start the Base services Starts only the Base services. Restarting the other services can be done individually or by restarting all services simultaneously.

Stop the CommServe services Stops only the CommServe services.

Start the CommServe services Starts the Base and CommServe services.

Stop the MediaAgent services Stops only the MediaAgent service.

Start the MediaAgent services Starts the Base and the MediaAgent services.

Stop DataMigrator services If the DataArchiver agent is installed, stopping this service stops the GXHSM service for this agent.

Start DataMigrator services If the DataArchiver agent is installed, starting this service starts the GXHSM service for this agent.

Stop Data Classification services If the Data Classification Enabler is installed, stopping this service stops the GXDCService (Windows) or DcSvc (Unix) and all child GxDC (Windows) processes for this enabler.

Start Data Classification services If the Data Classification Enabler is installed, starting this service starts the GXDCService (Windows) or DcSvc (Unix) and all child GxDC processes for this enabler.

Stop CDR services If ContinuousDataReplicator is installed, stopping this service stops the CVRepSvc Service for this agent.

Start CDR services If ContinuousDataReplicator is installed, starting this service starts the CVRepSvc Service for this agent.

Stop VSS Provider Service If the VSS Provider is installed, stopping this service stops the GxVSSProv Service for this agent.

Start VSS Provider Service If VSS Provider is installed, starting this service starts the GxVSSProv Service for this agent.

TCP Ports Used for Services Base Services provide the key communications and control link between components. These services are

Page 57 of 65 Features - Data Classification Enabler assigned registered network port numbers and are identified in the /Windows/System32/Drivers/etc/Services file as Static Ports and Dynamic Ports.

Static Ports

For types of static ports, see Network TCP Port Requirements. Dynamic Ports Cvd sessions also use free ports between 5000 and 32767 for communication during data protection and data recovery jobs. The system will dynamically assign a number of free ports to be used by each job to allow parallel data movement. The client, CommServe, and MediaAgent all send job related communications using that port number. Once the job is finished and no other job is pending, the dynamic ports are released. For more information about ports used by the system, see Network TCP Port Requirements.

Binding Services to Specific Network Interface Cards (NIC) By default the system binds the services to all the available NICs, if it is not configured. You can however, bind the services to a specific NIC using the steps described below. Note that this operation is not recommended for clustered computers. (In a clustered environment, failover will not work if the services are bound to a specific NIC.) Windows 1. Create the IPsToBind.txt file in the \Base folder. 2. Add the IP address or the interface name associated with the NIC cards(s) that must be used. There must be one entry per line, as shown in the following example: 123.45.67.895 interface1.company.com

3. Save the file and then stop and start the services.

Note that if the IPsToBind.txt file is created, at least one valid IP address must match the resolved IP address of the interface name provided for the Client or else services will not start.

Unix 1. Create the sBindToInterface registry key on the computer and provide the host name or IP address of the interface to which all services should bind. 2. Stop and start the services. (Note that the system also allows you to define the interface pairs for data transfer between any two computers. See Data Interface Pairs for more details.)

Service Control Service Control for Windows The Service Control Manager can be used to stop and start services used by the system on Windows. Services on any Windows client computer within the CommCell can also be stopped and started remotely from another Windows client computer within the CommCell. The Service Control Manager window includes the following fields: Computer The host computer of the services. Services Allows you to select either All Services,

Page 58 of 65 Features - Data Classification Enabler

Base Services, CommServe Services, or MediaAgent Services. The All Services option starts (or stops) all services on the local computer, regardless of the components you have installed. Base Services Either stopped or running. CommServe Either stopped or running. Services MediaAgent Either stopped or running. Services DataMigrator Either stopped or running. Services Data Either stopped or running. Classification Services CDR Services Either stopped or running. VSS Provider Either stopped or running. Service Start/Stop Data If selected, and all services on the local Classification computer are started (or stopped), then all Services With Data Classification Enabler services will All Services start (or stop) as well. Auto-Start If selected, all services applicable to the Services when OS local system will start automatically when Starts the system is started. If cleared, the services must be started manually. Retrieve Remote If selected, the Computer field is populated Clients with the names of all the remote Windows client computers within the CommCell. From this field you can remotely stop or start services from any selected Windows client computer within the Computer field.

This feature is not support for Clients installed with an IP address, and Clients on Windows 64-bit computers.

If a client computer within the CommCell is running on a Windows Server 2008 Core Operating System, the Service Control Manager must be launched remotely from another Windows client computer within the CommCell. From the remote client computer, you will be able to start/stop services of the client.

Service Control on Windows Cluster A clustered environment that is not a Windows cluster (VERITAS, Polyserve, etc.) requires that each physical node bind only to that node's specific IP address. Each physical node needs an IPsToBind.txt file in the Base directory. This will force the services on each node to bind to the node's IP address and not the virtual machine IP address. This is not necessary in a Windows clustered environment. Running Services Using a Windows User You can create a User (and not the local system account) to run services and operations only for the Windows File System iDataAgents and SQL Server iDataAgent. By default, services run as a local system account. The created User will run services to back up and restore files and folders regardless of ownership, permissions, encryption, or auditing settings. The User will use several built-in groups, including Backup Operator, Administrator, and Local Administrator. These groups have the necessary permissions and user rights defined. Only a member of the Administrator group can assign users as Backup Operators. Warning: You may be required to edit the registry. However, before you do this, back it up and ensure that you understand how to restore it if a problem occurs. For information about how to do this, view

Page 59 of 65 Features - Data Classification Enabler

Windows Help on the "Restoring the Registry" from Microsoft Help topic in Regedit.exe or the "Restoring a Registry Key" Help topic in Regedt32.exe.

See User Accounts and Passwords: Considerations When Using a Windows User to Run Operations for more information along with the following procedures:

z View or Modify User Rights Assignments on a Workgroup or Member Server z View or Modify User Rights Assignments on a Domain Controller z Set up User Permissions and Rights on a Windows Workgroup or Member Server z Set up Rights on a Windows 2000 Domain Controller z Set up Registry Permissions on Windows 2000 z Set up Folder Permissions Service Control for Unix The services can be stopped or started from a Unix system. However, only one instance of start/stop services at a time is allowed on the system. If you attempt more than one such instance, the appropriate error message is displayed. The following commands can be used to start and stop services:

The -instance option now refers to product installation instances. Such instances have an independent set of binaries, use different network ports, and may talk to different CommServes. Instances have nothing to do with virtual machines; they allow you to have two independent installations of the software on the same machine.

Command Usage

Calypso -all|-instance Brings up services on all configured instances (-all). The - [-force] start instance switch can be used to start services on a specific instance only. The software will refuse to start if it detects partially installed patches. In such cases, you can either install the latest service pack or start the software with the -force option and use Automatic Update to push patches from the CommServe.

Calypso -all|-instance Stops services on all configured instances (-all). The -instance stop switch can be used to stop services on a specific instance only.

Calypso -all|-instance This is the same as "Stop" followed by "Start." restart

Calypso -all|-instance Lists all running services on all instances or just . list

Calypso -all|-instance Provides information about the client installation and for all status configured instances on the client. The -instance switch can be used to provide information for a specific instance only.

Calypso -csname Displays the name of the CommServe and the instance. For a multi-instance installation, displays the name of all the affected CommServes and instances.

Calypso help Displays this help message.

If services go down during a process (e.g. install, backup, etc.), the Calypso start command will not restart services unless the command is included in the crontab file. See Ensure Restarting of Services Using crontab for step-by-step instructions

Service Control for NetWare Services used by the system can be stopped or started

Page 60 of 65 Features - Data Classification Enabler from the NetWare Server using the Load/Unload commands, or from a remote Novell Client PC using the NetWare Service Manager. The NetWare Service Manager window includes the following fields:

NW Servers Allows you to select the name of the NetWare server for service control. Services Allows you to select either All Services, Base Services, or MediaAgent Services. The All Services option starts (or stops) all services on the local computer, regardless of the components you have installed. Refresh Services Allows you to refresh the selected services. NLM Status Displays the current status of the NetWare Loadable Module (NLM).

Service Control for Content Indexing For information on controlling the services for Content Indexing, see Content Indexing Services.

GxAdmin Tool GxAdmin is a real-time CommCell administration tool used to view, monitor, and administer various services and processes of the CommCell components installed in the local Windows machine as well as remote Windows clients. This tool is available on Windows computers in the /base folder. The tool contains the following tabs: General The general tab displays the CommCell deployment details such as instance name, CommServe Host name, client name, and the client Host name. It also displays the configuration details of each CommCell component installed in the selected client. The following details are displayed:

z Software version z Installation directory z Available free space z Location of the Job Results (if applicable) By default the details of the local client is displayed. See Using GxAdmin to View Remote Clients to view remote client details using the GxAdmin tool. Services The services tab displays the services for each CommCell component installed with the instance name and the status of the service. The individual services can be managed, see Using GxAdmin to Start/Stop

Page 61 of 65 Features - Data Classification Enabler

Services for step-by-step instructions. Processes The processes tab lists the currently running CommCell processes with details such as start time, memory used, thread count, and handle count. The log details of the process can be stored in a dump file and used for troubleshooting purposes. See Using GxAdmin to Create Process Dump for step-by-step instructions.

Back To Top

Page 62 of 65 Features - Data Classification Enabler

Frequently Asked Questions - Data Classification Enabler

Choose the following topic:

z Overview

z Services/Processes

z CommCell Console Operations

z Volumes

z Metafiles

z Users/User Groups

z Mount Points

Overview This section contains typical questions and answers regarding various aspects of Data Classification. These questions and answers are organized by sections identifying various related items. Where appropriate, links to related information are included.

Services/Processes

z After I install Data Classification, why does the initial scan not collect any data?

If there is activity on the system (e.g., use of the keyboard or mouse) during the initial scan, the scan will not run until the system is idle for at least 30 seconds. To prevent this, during the install, either select a later date and time to run the initial scan or do not use the computer for some time depending on the amount of data on your system. See Install Data Classification: Administer Data Classification for more information.

z For Windows, why are there multiple instances of GXDC.exe running in the ?

GXDC.exe is a "child process" for the Data Classification Enabler service (GXDCService.exe). Each instance of GXDC.exe monitors a volume on your local system. If you have five volumes on your system, there will be five instances of GXDC.exe running. If you remove a volume, its corresponding GXDC.exe process goes away.

z For Windows, can I change the priority of my GXDC.exe processes?

Service priority can be changed to low, normal or idle. Changing the priority will cause the GXDC.exe process to re-start. If the priority is modified during the initial scan phase, a warning message is displayed and, if the user insists, the job stops, and scanning starts all over again. See Administer Data Classification Server Properties for more information.

z What happens when I disable Data Classification?

Monitoring of all volumes (by GXDC.exe for Windows) stops, and no further updates are made to the metafile databases.

z What happens when I disable Data Classification during an initial scan?

Page 63 of 65 Features - Data Classification Enabler

Upon re-enabling Data Classification, the initial scan will be restarted from the beginning. Any data collected thus far will be discarded.

CommCell Console Operations

z Can I modify the priority of the Data Classification services?

Yes. You can select your desired priority (low, normal, or idle) from the Data Classification Administration Utility. See Administer Data Classification Server Properties for more information.

z Can I change the location of my metafile?

Yes, from the Data Classification Administration Utility. See Change the Metafile Location for more information.

z How do I create a new DataClassSet subclient?

Right-click the DataClassSet and click the option to create a DataClassSet subclient. See Subclients for more information.

z Where can I modify the SQL query for a specific DataClassSet?

From the CommCell Console, go to subclient properties and click Rules > Advanced to view and modify the SQL query. However, modifying the query is recommended only for advanced users. If the modified script is incorrect and an archive is run, improper results may be obtained. See Rules and Queries - Data Classification Enabler for more information.

z Can I go back to using the rules instead of the SQL Query?

Yes. Deselect the check box in the Advanced tab of the subclient content. However, consider saving the SQL query changes to a file before saving the subclient.

Volumes

z I created a new volume on my local system but the metafile database was not created. What should I do to correct this?

Restart the Data Classification Enabler services from the Service Control Monitor. See Services for more information.

z What happens if I add a new volume after the install is completed?

The new volume should automatically be detected and a new metafile database will be created.

Metafiles

z Can I change my metafile database location on a cluster?

Yes. However, for clusters, we recommend that the metafile database exists at the root of each volume (default location). Moving the metafile database to a central location on a physical node will prevent access to it by other nodes in the case of a failover condition. See Change the Metafile Location for more information.

Page 64 of 65 Features - Data Classification Enabler

z What happens if the Data Classification services on the client are not running and I change the metafile database location?

A new metafile database will be re-created from the scan phase in the new metafile database location once the services are restarted.

Users/User Groups

For all items in this section, see Users and User Groups - Data Classification Enabler for more information.

z Does Data Classification support local user groups?

No. Only local users are supported.

z Does Data Classification support domain users and domain user groups?

Yes.

z Are there any limitations on where the user and user groups should reside in the Active Directory Domain controller?

The user and the user groups should always reside under the “Builtin” or “Users” organizational unit.

z What happens if I change the location of the user/user group from "Builtin" to "Users" or vice versa after the install?

Nothing adverse will happen; archives and recoveries will still succeed.

z Do I have to be an administrator to authenticate against the Active Directory Domain?

No. Any user can authenticate as long as the user exists in the domain where the client computer resides.

Mount Points

z Are mount points supported?

Yes. Mount points for volumes having no drive letter are supported. Each such mount point has its own metafile database. For mount points, the metafile database name is "[mountpoint]_db.db" (e.g., for a volume mounted on C:\mountpoint, the file is "mountpoint_db.db", and it resides in the C:\mountpoint directory).

Page 65 of 65