About Web Filter
Web Filter monitors HTTP traffic on your network to monitor user behavior and block inappropriate content. Web Filter also appeals to customers who require an added level of protection or are subject to regulations, for example Web Filter helps libraries comply with the Children's Internet Protection Act). Need to block Pornography or Hate Speech on your network? Web Filter is your answer.
- Real-time classification and updates: When your users visit a site, Untangle sends the URL to the cloud to be categorized. When the data is returned, Untangle keeps a temporary local cache of the site and category to speed up the process the next time the URL is requested. This data is then used to block or allow users access to the site they have requested, all without any appreciable increase in load time. If a site is not categorized upon request, it is autocategorized by our partners at zVelo and put into a queue to be verified by a human. Because this is done dynamically, new sites and updated URLs are allowed or blocked according to your settings without additional intervention, plus you have the option of requesting recategorization of sites.
- HTTPS Filtering: Web Filter has multiple techniques to deal with HTTPS, SSL-encrypted HTTP. HTTPS traffic is encrypted so only some information is visible and this information is used to categorize the session. More information on how this is down below.
- Detailed categorization: Web Filter offers over 140 categories and over 450 million categorized sites. The Web Filter database is over 100 times larger and more accurate. With over 10 times the number of categories it also has better granularity. The abundance of categories means that you can narrow your scope - maybe you want to block websites related to Sex, but allow sites dealing with Sexual Education or Pregnancy. With Web Filter Lite you would have to block 'Pornography' category and allow any specifics sites through, while with Web Filter you could set the separate categories appropriately and not worry about manually unblocking anything.
- Additional features: The ability to force SafeSearch on search engines that support it, and more!
This section reviews the different settings and configuration options available for Web Filter.
Block Categories allows you to customize which categories of sites will be blocked or flagged. Categories that are blocked will display a block page to the user; categories that are flagged will allow the user to access the site, but will be silently flagged as a violation for event logs and Reports. These block/flag actions operate the same way for all of the different Web Filter options.
Site Lookup allows you to find the categorization of a URL. Clicking it brings up a dialog. In Site URL specify the URL to find and click Search to find the URL's categorization.
If you feel the current categorization is incorrect, check Suggest a different category, select a new category from the list, and click Suggest to submit the category change for consideration.
- NOTE: This is only a suggestion and may not be accepted. If accepted it may take a few days to become active.
Under Blocked Sites you can add individual domain names you want to be blocked or flagged - just enter the domain name (e.g. youtube.com) and specify your chosen action. This list uses URL Matcher syntax.
Block File Types
The Block File Types section allows you to block files by file extension - just select (or add) your chosen file extension, check your preferred action, and save. This list uses Glob Matcher syntax.
Block MIME Types
The Block MIME Types section allows you to block files by MIME types - just select (or add) your chosen file extension, check your preferred action, and save. This list uses Glob Matcher syntax.
Pass Sites is used to pass content that would have otherwise been blocked. This can be useful for "unblocking" sites that you don't want blocked according to block settings. Any domains you add to the Passed Sites list will be allowed, even if blocked by category or by individual URL - just add the domain and save. Unchecking the pass option will allow the site to be blocked as if the entry was not present. This list uses URL Matcher syntax.
If you add an IP address to this list, Web Filter will not block any traffic from that IP regardless of the blocked categories or sites. Just add the IP and save. Unchecking the pass option will have the block/pass lists affect the user as if they were not entered into the Passed Client IPs list. This list uses IP Matcher syntax.
- If you have a few users that need to completely bypass Web Filter controls, consider using pass lists. If you have users that simply need different Web Filter settings, you should set up a separate rack using Policy Manager. When using this feature, please remember that DHCP IPs can change, so you'll probably want to set up either a Static IP or a Static DHCP Lease for the machine in question.
The Advanced section allows you to configure additional web filter options.
- Process HTTPS traffic by SNI (Server Name Indication) if present: If this option is enabled, HTTPS traffic will be categorized using the "Server Name Indication" in the HTTPS data stream, if present. More details in #HTTPS Options.
- Process HTTPS traffic by hostname in server certificate when SNI information not present: If this option is enabled and SNI information is not present, the certificate is fetched from the HTTPS server and the server name on the certificate will be used for categorization and filtering purposes.
- Process HTTPS traffic by server IP if both SNI and certificate hostname information are not available: If this option is enabled and neither of the previous options worked, HTTPS traffic will be categorized using the IP address. More details in #HTTPS Options.
- Enforce safe search on popular search engines: When this option is enabled, safe search will be enforced on all searches using supported search engines: Google, Yahoo!, Bing, Ask.
- Block pages from IP only hosts: When this option is enabled, users entering an IP address rather than domain name will be blocked.
- Pass if referers match Pass Sites. When this option is checked, if a page contains external content from any site in Pass Sites, that external content will be passed regardless of other block policies.
- Block Google applications: When this option is enabled, only domains listed in Domain are allowed to access Google applications such as Gmail. All others are blocked by Google. Multiple domains can be specified, separated by commas such as:
- untangle.com, domain.com.
- NOTE: HTTPS Inspector must be installed and running with the Inspect Google Traffic configured to Inspect.
- Unblock: This section can be used to add a button to allow users to bypass restrictions on a case-by-case basis.
- If Unblock is set to None no users will be allowed to bypass the block page. If Unblock is set to Temporary users will be allowed to visit the site for one hour from the time it is unblocked. If Unblock is set to Permanent and Global then users will be allowed to visit the site and unblocked sites will be added to the permanent global pass list so it will always be allowed in the future.
- You also have the option of setting a password to Unblock; it can either be the existing Administrator password for the Untangle or you can set a new, separate password only for the Unblock feature.
- Clear Category URL Cache: This option will clear the local cache of categorized sites and URLs. After clearing the cache all new web visits will be looked up fresh using the categorization service. The cache automatically cleans itself as entries become old or stale, so this is mostly for testing.
The Reports tab provides a view of all reports and events for all traffic handled by Web Filter.
All Web Filter reports can be accessed using the Select Reports window. All pre-defined reports will be listed along with any custom reports that have been created.
Reports can be searched and further defined using the time selectors and the Conditions window at the bottom of the page. The data used in the report can be obtained on the Current Data window on the right.
Pre-defined report queries:
|Web Filter Summary||A summary of web filter actions.|
|Web Usage (all)||The amount of total, flagged, and blocked web requests over time.|
|Web Usage (scanned)||The amount of total, flagged, and blocked web requests over time.|
|Web Usage (flagged)||The amount of flagged, and blocked web requests over time.|
|Web Usage (blocked)||The amount of flagged, and blocked web requests over time.|
|Top Categories (by request)||The number of web requests grouped by category.|
|Top Categories (by size)||The sum of the size of requested web content grouped by category.|
|Top Flagged Categories||The number of flagged web requests grouped by category.|
|Top Blocked Categories||The number of blocked web requests grouped by category.|
|Top Sites (by request)||The number of web requests grouped by website.|
|Top Sites (by size)||The sum of the size of requested web content grouped by website.|
|Top Flagged Sites||The number of flagged web requests grouped by website.|
|Top Blocked Sites||The number of blocked web requests grouped by website.|
|Top Domains (by request)||The number of web requests grouped by domain.|
|Top Domains (by size)||The sum of the size of requested web content grouped by domain.|
|Top Flagged Domains||The number of flagged web requests grouped by domain.|
|Top Blocked Domains||The number of blocked web requests grouped by domain.|
|Top Hostnames (by requests)||The number of web requests grouped by hostname.|
|Top Hostnames (by size)||The sum of the size of requested web content grouped by hostname.|
|Top Flagged Hostnames||The number of flagged web request grouped by hostname.|
|Top Blocked Hostnames||The number of blocked web request grouped by hostname.|
|Top Clients (by requests)||The number of web requests grouped by client.|
|Top Clients (by size)||The sum of the size of requested web content grouped by client.|
|Top Flagged Clients||The number of flagged web request grouped by client.|
|Top Blocked Clients||The number of blocked web request grouped by client.|
|Top Usernames (by requests)||The number of web requests grouped by username.|
|Top Usernames (by size)||The sum of the size of requested web content grouped by username.|
|Top Flagged Usernames||The number of flagged web request grouped by username.|
|Top Blocked Usernames||The number of blocked web request grouped by username.|
The event viewer provides a view of all web events and how they are handled by Web Filter. It can be used to view traffic on the network in real time or as a debugging tool to view how Web Filter is operating.
|All Web Events||Shows all scanned web requests.|
|Flagged Web Events||Shows all flagged web requests.|
|Blocked Web Events||Shows all blocked web requests.|
|All HTTP Events||Shows all scanned unencrypted HTTP requests.|
|All HTTPS Events||Shows all encrypted HTTPS requests.|
|Unblocked Web Events||Shows all unblocked web requests|
|All Query Events||Shows all search querires processed by Web Filter.|
Conditions can be used to filter the traffic information shown in reports and events. Each condition has a corresponding column that can be viewed in the events viewer. Multiple conditions can be added to drill down and inspect Web Filter data. For a list of conditions, refer to the http_events table in Global DB Schema.
As described briefly above, there are two HTTPS options.
- Process HTTPS traffic by SNI (Server Name Indication) if present.
- Process HTTPS traffic by IP Address when SNI information not present.
If Process HTTPS traffic by SNI (Server Name Indication) if present encrypted port-443 traffic will be scanned by Web Filter. Most modern browsers on modern OSs will send the hostname of the server in cleartext - this is called "Server Name Indication" or SNI. SNI is an optional cleartext field in the HTTPS request that shows the hostname of the server. If this option is enabled and the SNI information is present in the HTTPS request, this hostname will be used as the URL for this request and all categorization, block lists, and pass lists, will be processed as if this were a regular HTTP request to that URL.
If the SNI-based categorization determines the page should be blocked the session is reset. If the SNI-based categorization determines the page should be passed (and/or flagged) then the session is allowed and the appropriate event based on the SNI information is logged ("https://example.com/").
For example, if the user visits "https://wellsfargo.com/welcome" in the browser, Web Filter will see "wellsfargo.com" as the SNI information. If enabled, the request will be handled exactly like "http://wellsfargo.com" would be. If "Banking" is blocked it will be blocked, unless "wellsfargo.com" is in the pass list or the client IP is in the client IP pass list. If "wellsfargo.com" is blocked it will be blocked, unless "wellsfargo.com" is in the pass list or the client IP is in the client IP pass list.
If Process HTTPS traffic by IP Address when SNI information not present is disabled and no SNI information is present the session will be allowed as there is no information available to process the traffic. If Process HTTPS traffic by IP Address when SNI information not present is enabled and no SNI information is present the session will be processed and categorized by IP address. If the IP-based processing and categorization of the web requests determines the session should be blocked, the session is reset and no more processing of this session will be done. If the IP-based processing and categorization determines the page should be passed (and/or flagged) then the session is allowed and the appropriate event based on its IP is logged ("https://188.8.131.52").
For example, if the user visits "https://wellsfargo.com/welcome" in a non-SNI enabled browser, then there is no SNI information for Web Filter to use. In this case if Process HTTPS traffic by IP Address when SNI information not present is enabled Web Filter will use the IP address instead. So it will process/categorize this web request as 'http://184.108.40.206' if 220.127.116.11 is the IP of wellsfargo.com. This will still often result in correct categorization for dedicated web servers, but does poorly when using generic cloud computing servers that offer a wide variety of websites.
Note: When blocking HTTPS traffic, block pages can not be shown. The HTTPS encryption prevents man-in-the-middle spoofing of data required to display the block page. The connection will simply be reset and the browser will display an error.
Note: Neither HTTPS process (IP-based nor SNI-based) can read the URI information as it is not sent in cleartext. As such the URI will not be used as part of the categorization and the URI is assumed to be "/" when evaluating block/pass rules.
To see the HTTPS categorization in action use the "All HTTPS Events" query in the event log.
Web Filter FAQs
How do Web Filter and Web Filter Lite work?
Web Filter and Web Filter Lite both transparently scans HTTP traffic in order to block or log specific activity. Websites can be blocked or logged based on Category (Pornography, Social Networking, etc.), URL (facebook.com, youtube.com, etc.), MIME Type or File Type (.exe, .mp3, etc..). Web Filter has a more robust feature set that is explained in a FAQ entry below.
Can I use both Web Filter and Web Filter Lite?
We do not recommend running both Web Filter and Web Filter Lite at the same time - if you have access to the trial or full version of Web Filter, we recommend using it rather than Web Filter Lite.
Is Web Filter really better than Web Filter Lite?
Web Filter is the same as Web Filter Lite except it is based on SiteFilter technology. Web Filter is better than Web Filter Lite in many ways:
- HTTPS Filtering
- Many More categories (141 vs. 15)
- Larger database (450+ million URLs vs ~1 million)
- Dynamic categorization of new sites
- SafeSearch enforcement
- Password option for the Unblock feature
Can I install Web Filter/Web Filter Lite on a single computer to use as Parental Control software?
No - Untangle is designed to operate as a gateway or transparent bridge for an entire network and is not meant to filter the computer it is installed on. Installing Untangle will wipe out all existing data on the PC it is installed to. For filtering a single PC, other Internet filter/Parental Control software can be used.
Why is a site not being properly displayed even though I added it to the Pass List?
It's common for a web site to display links, banners and content from other web sites as part of their web pages. There are two easy methods to re-integrate the content while maintaining your access controls. A good example is Facebook - when you go to 'facebook.com', much of the site is loaded from 'fbcdn.net', which also must be put on the pass list for it to display properly. To fix these problems, just look in the Event Log for domains that are still being blocked when you load the site.
Can I block all web sites except certain ones?
Yes, simply block all categories (including "Uncategorized"). Then add whatever sites you'd like to pass to the Pass List. Please be aware that the complex nature of the web and the fact that many applications communicate over HTTP can make this approach difficult.
Why block both MIME Type and File Types?
In an ideal world, both pieces of information would always be present for every web request. However, some sites use incorrect content types or extensions. The behavior of operating systems (Windows vs. Mac) is also different when given only file extension or content type. To be safe, both lists should be used.
What kind of reporting features do Web Filter and Web Filter Lite offer?
Web Filter and Web Filter Lite provide network- and user-based reporting. Data from these apps is fed into Reports to show high level trends such as peak network usage hours as well as allowing administrators to drill down to the individual user level for activity monitoring.
Can I grant privileged access to some users while still blocking sites for everyone else?
There are several ways to accomplish this:
- Policy Manager can be used to create multiple racks, which allows you to have separate filtering settings for individuals or groups of users. The easiest example is a school, where you would want Teachers to have more relaxed internet filter settings than the students. Different settings can be applied to any individual or group in your organization such as CEOs, Administrative Assistants or Accounting Departments.
- The Passed Client IPs List allows you to exempt specific users from all filtering inside the Web Filter/Web Filter Lite applications.
- The Unblock option displays a button that, when clicked, will allow users to bypass the block page. Web Filter has an additional option to require a password for this.
Can I let users access certain sites during lunch?
You can leverage Policy Manager to set up specific filtering settings for different days or time periods, such as allowing Facebook during breaks or after work hours.
Windows Vista/7 computers showing "No Internet Access" but everything is fine. Whats going on?
Make sure you're not blocking access to the domain www.msftncsi.com; this is part of a test that Microsoft runs to see if there is an active internet connection. Once you've verified this domain is not blocked, simply restart the PC and that should take care of it.
How do I submit a mis-categorized or uncategorized site?
You can go to zvelo and submit the correct (or new) categorization. It will be reviewed immediately by a human. Once the new categorization takes effect you may need to flush your category cache in Web Filter to see the new categorization.
Does Web Filter use a lot of memory and CPU?
If your Untangle Server is operating well without Web Filter, then you won't see much of a difference if you run Web Filter. Web Filter doesn't use much memory, and its cloud-based architecture adds very little to CPU utilization.
How do real-time updates work?
When a client first vists a site, Web Filter accesses the zveloDB to get the categories the site is under to make a decision to block or pass based on your configuration. The category information is also written to a local cache so it doesn't have to be checked the next time a user visits that site.
How long does Web Filter cache category information for sites?
Several days. Web Filter flushes non-frequently used cache. The website that you visit daily will not be cleared from cache.
Can I add additional categories?
Custom categories are not available, however we provide over 140 categories for granular control over what your clients can access. If you feel there are categories that we can add to make it even better, just let us know.
How should I handle false positives?
While the fastest way to allow clients to access a site that is currently blocked is to add the site to your pass list, you can request recategorization of sites here - the turnaround time is usually less than two days.
Can I use Web Filter to block HTTPS/SSL sites?
Yes - because Web Filter has access to a separate database of IP addresses, it can categorize HTTPS traffic based on the destination IP address. This is not done by individual domain, but by category - for example, if you simply block 'facebook.com'. Please note that this does not mean Web Filter can parse HTTPS as it is encrypted. Categorization is done via IP address. This means other forms of blocking like URL, file-type, mime-type, etc can not be done on HTTPS as the stream is encrypted and these require parsing of the HTTP protocol.
Why can i access a site using HTTPS when I've added it to the block list?
Web Filter scans and categorizes HTTPS traffic by IP address because the session itself is encrypted and cannot be scanned. As a result, if you add "example.com" to the block list and go to "https://example.com" it will not be blocked because Untangle can only see the IP address. However, if you block the category "example.com" is in, then go to "https://example.com" it will not connect and you will see a block event in the Event Log.
Why is Web Filter still blocking an HTTPS site even after I added it to the pass list?
This should only be a problem with older browsers that do not provide SNI information in the HTTPS stream - if your browser provides SNI information, adding the domain to the pass list should allow the site to load. Older browsers that do not provide SNI information can run into this problem, however. If this is the case, it is because Web Filter does categorization of HTTPS traffic by IP address. HTTPS encrypts the hostname and request, so all we can see is the destination IP. This means if https://example.com/ is getting blocked, adding "example.com" to the passlist will have no effect because HTTPS is categorized by IP address. If you add the IP address of example.com to the passlist then HTTPS traffic to example.com will be allowed.
Why did 'Youtube for Schools' disappear?
Google/Youtube stopped supporting the their youtube for schools features. This features relied on Untangle adding an identification header to HTTP requests and then youtube would enforce the policy on the server. Since this feature is no longer supported by their servers the feature has been removed.