Difference between revisions of "Retain Storage"
(→Level 2) |
(→How It Works) |
||
Line 51: | Line 51: | ||
All partition changes are logged in the partition.log file, located in the backup directory specified in Server Configuration (by default this is located in [retain storage location]/backup). | All partition changes are logged in the partition.log file, located in the backup directory specified in Server Configuration (by default this is located in [retain storage location]/backup). | ||
− | + | =How It Works= | |
=====How Retain Determines File Uniqueness===== | =====How Retain Determines File Uniqueness===== | ||
Retain creates a [http://en.wikipedia.org/wiki/Hash_function "hash"] off the message body and its attachments; however, it does not include the message metadata in the hash calculation; thus, a single BLOB on disk would be created for two messages that have identical message bodies and attachments but who date/time/recipients are different. In such cases, the database would get a new entry for each of those messages but only one file on disk would exist. | Retain creates a [http://en.wikipedia.org/wiki/Hash_function "hash"] off the message body and its attachments; however, it does not include the message metadata in the hash calculation; thus, a single BLOB on disk would be created for two messages that have identical message bodies and attachments but who date/time/recipients are different. In such cases, the database would get a new entry for each of those messages but only one file on disk would exist. |
Revision as of 16:45, 8 January 2015
Retain uses a hybrid storage system. All message headers and metadata is stored in the database but all message text and attachments are stored as files on the file system in the location listed here (and subdirectories).
This is done a little like GroupWise, where the OFFILES directory stores Binary Large Objects (a.k.a., BLOBS - anything larger than 2KB). In the case of Retain, this is transparent to the user and you may choose whether to encrypt this data. If it is stored in a location accessible ONLY to the Retain Server, then encryption is not necessary and we recommend not using it.
Contents |
Level 1
Retain Supports the ability to change the location or engine used to store the archived mail.
To change the storage option settings which were created during the initial setup of Retain, simply select the different storage engine of your choice.
If the Standard Engine is selected, then the Standard Engine window is displayed. This details the setting location for the default storage location, under Server Configuration, Storage tab.
The other three options are:
- Legacy Engine
- Centera Storage Device
- NetApp SnapLock
Enhanced Standard Engine
The Enhanced standard storage engine is the standard storage option which should be selected for all new installations of Retain. The storage engine utilizes locally accessible drives and media. When selecting this option, make sure that storage requirements will be easily met, size or capacity increased, and managed.
The enhanced engine also allows for background compression to be used. Background compression is recommended for social media and mobile modules, to help keep the size of the data store down.
Legacy Engine
The legacy engine works as a seamless connection to the 2.x line of Retain, and is available as a bridge for upgraded devices and legacy support systems.
Centera Storage Device
The Centera Storage option simply requires you to put the storage server IP address or DNS name, and the authentication method.
The different Authentication options are PEA and Credentialed. The PEA requires the full path to the PEA file. For a credentialed authentication, simply input the username and the password for the Centera server.
NetApp SnapLock
NetApp is fully supported with SnapLock for storage. Note the information in the option. To enforce WORM rules you MUST set the Job Expiration feature. Also set the IP address, username, password, and the correct path to the server. THE PATH MUST BE SET IN NETAPP FORMAT.
Partitions
Retain also supports the ability to define a different storage location, through the partitioning option. Partitioning is only available when Storage is set to the ‘local’ drive. To access the partitions option click on “partitions” in the Enhanced or Standard Engine window. Ideally, this option would only be used to archive to a new, larger, location when the current storage path has been filled, or to separate mail storage between years. (i.e. /archive/2008; /archive/2009)
A Retain system without a specified partition location will store all mail in the default storage location. This is defined in the Server Configuration menu, under the Storage tab mentioned previously. The default location is set during the initial install.
READ THE BOLD WARNING ON CHANGING PARTITIONS!
You may only specify one new partition at a time. After you have entered the new partition information, click ‘save changes’. Creation or changes to a partition requires Tomcat to be restarted. When adding a Partition on Linux, the path needs to be created manually on the Linux Server before specifying the path within Retain. Ensure that the permissions for the new path are given to Tomcat.
Once Tomcat has been restarted, the new partition can be viewed, edited, or deleted from this window. NOTE: This does not move any of the data stored in the original location configured under Server Configuration | Storage.
Partitions are listed with their Begin Date, Name, and Path. The Begin Date is the creation date. A partition is used to store all archived mail from the moment it is created.
Because of how Retain indexes and tracks messages that are archived, deleting a partition can lead to catastrophic data loss. Do not delete an active storage partition or location without first consulting support.
If a partition is deleted Retain will look for the archived mail that was stored in the deleted partition to be stored in the default storage location (this is defined in Server Configuration, under the Storage tab). Removing a designated partition in the management console does not delete the storage directories or data on the hard drive or SAN, but it does disconnect Retain from the storage location.
All partition changes are logged in the partition.log file, located in the backup directory specified in Server Configuration (by default this is located in [retain storage location]/backup).
How It Works
How Retain Determines File Uniqueness
Retain creates a "hash" off the message body and its attachments; however, it does not include the message metadata in the hash calculation; thus, a single BLOB on disk would be created for two messages that have identical message bodies and attachments but who date/time/recipients are different. In such cases, the database would get a new entry for each of those messages but only one file on disk would exist.
See KB, "Where Data is Stored in Retain" for more information.