A fundamental axiom of probability is that if we have two events and from the same probability space, and is a subset of (, then . This should be intuitive: occurring is one of the ways that can occur, but there are also other ways can happen without , so its probability must be higher. This axiom is known as the monotonicity of probability measures; in this course, for the sake of brevity I typically refer to this as the principle of inclusion.
A great deal of statistical theory involves inequalities, and a typical scenario in which this arises is: we know that , therefore . This is intuitively simple, but it is easy to make mistakes with respect to the sign. As a mnemonic device, I suggest “if large side gets larger, or the small side gets smaller, the probability goes up”, but use whatever works for you.
As a more specific example, suppose , , and are random variables. Then by combining the triangle inequality and the monotonicity of probability measures, we have