I no longer study statistics, but the best 'proof' of this formula that I came across was to simply draw a Venn diagram and think about what it is the conditional probability represents. I don't think there exists a proof so much as that formula is the fundamental definition of conditional probability. Nonetheless, I'll try:
Take your two events, A and B, which exist in the same sample space.
Now, you want to find
P(A∣B), or, the probability that A will happen given that B has already occurred. Now there only exists a small region where both A and B can occur simultaneously,
A∩B(Sorry, that should say A∩B, not A and B)However, there is still a probability of B simply occurring again, that means you're calculating the probability of
P(A∩B) while also
restricting your sample space to P(B).
In simplistic terms, you're trying to calculate the proportion of the area which
A∩B takes up inside
B in your Venn diagram. Replace areas with probabilities and you have your equation!
Hopefully this makes sense (and is correct). If any statisticians want to call me out on anything, feel free!