Process HTML content in messages
This means that if the message contains HTML, then the HTML is parsed as well as the message plain/text boundary. Tokens will therefore also include data from the HTML messages. It makes the filter more likely to detect HTML as spam because the tokens/patterns of the HTML of bad messages can be used to calculate the probability of spam.
Spam calculation method
When a message is split into its tokens/words for analysis each token in the message is given a probability of either being spam or non-spam.
As such, MailEnable can be configured to use a number of methods for calculating the final probability of a message being spam.
These are outlined as follows:
Measure highest and lowest percentiles of the most frequent tokens - Only those tokens most frequently occurring in the message will be used/aggregated to measure the probability of the message being spam i.e. If this option is used, then messages containing multiple instances of a spam token will most likely be diagnosed as Spam
Measure all tokens in the message - This means that all tokens occurring in the message will be used/aggregated to calculate the probability of the message being spam. The recommended method to use is: "Measure all tokens in the message" because it provides a more balanced calculation.
Measure tokens within the highest and lowest percentiles - This means that only those tokens/words in the message that are most likely to denote the message as spam or non-spam are considered i.e. If this option is used, it will mean that a legitimate message containing the word 'viagra' would be more likely to be detected as spam.
Spam Honeypot Email Addresses (Edit address list)
Define email addresses that do not receive valid mail for sampling.
Ham Addresses (Edit address list)
Define "ham' or legitimate email addresses for sampling.