What does it mean if observations are iid, and why is this a desirable property?

The abbreviation iid stands for independent, identically distributed, and refers to instances of a random variable that are samples in such a fashion. This means that each observation is sampled from the same underlying probability distribution and is a separate instance that has no dependency on any other observation. This is a desirable quality for simplifying the variance structure (and providing validation of independence necessary for most linear models) and the computation of the likelihood function.

An example of a random sample that is not iid would be drawing from a deck of cards without replacement, as once a card is drawn, it cannot be placed back into the deck. Thus, subsequent draws would have a different probability of coming up based on what is remaining compared to if they were drawn with replacement.

Author

Help us improve this post by suggesting in comments below:

– modifications to the text, and infographics
– video resources that offer clear explanations for this question
– code snippets and case studies relevant to this concept
– online blogs, and research publications that are a “must read” on this topic

Leave the first comment

Partner Ad
Find out all the ways that you can
Contribute
Here goes your text ... Select any part of your text to access the formatting toolbar.