HubStor Inc.

How To Exclude Specific File Extensions(types) From Archiving


This article will walk through the process of how to exclude specific file types from being archived (ingested) into HubStor.  This is done by using Exclusions within the Connector Write Policy.  

In this scenario, we want to exclude .txt and .rpt files from being archived that reside within the root path C:\Files as seen below:


Click on the Write Policy Tab at the top of the Connector Properties page. 



Click the 'Add Grouping' button as seen below. 



Provide a Name
Choose 'Add Clause'
From the drop-down, choose 'MetadataSpecificValue', then choose OK


Choose 'ItemType' from the drop-down
Click 'Add Value'
Enter the file extension.  It is important to note that only one file extension can be added per line.  This means choosing the 'Add Value' button for each file extension that should be excluded.  If there are many extensions to exclude, add them to a .csv file using Excel and use the 'Import' button which will add them all to this list.  Once all file extensions are added, click OK.  



The end result will look like the following:



Click OK to save all changes.  That concludes the process. Now when the connector crawl runs, it will ignore all these files from the archive process. 


    • Related Articles

    • How to Define Item Types in HubStor

      Each Stor has a metadata configuration where you can define the item type groupings. For more information on the purpose of item types, see WHAT ARE ITEM TYPES AND HOW DOES HUBSTOR USE THEM? To view or modify a Stor's item type configuration, follow ...
    • Overview of Connector Types

      The following types of connectors can be configured in any instances of the HubStor Connector Service (HCS).  They are available from the drop-down menu when clicking the 'Add Connector' button (pictured below).                                       ...
    • How File Versioning Works

      This article will discuss how HubStor works with File Versioning for the following data types:  File SharePoint OneDrive O365 Audit Log HubStor is fully versioned aware in that if a new version of a document is detected, an additional version of that ...
    • How To Exclude Specific Mailboxes From Being Captured

      In some cases there may be the need to exclude a mailbox or a set of mailboxes from being crawled by the EWS connector.  The process for this is very simple and only requires access to the server hosting the HubStor Connector Service.  If HubStor is ...
    • What limitations exist for file archiving?

      HubStor has the following known limitations for file archiving environments: 1. Max file size -- 4.5 TB. This file size limit is a post compression file size. 2. Max stub retrieval duration before timeout -- 2 minutes. This limit exists specifically ...