Let θ denote the true ability of a student in a certain subject, representing the probability of astudent answering correctly an arbitrary question and let X1, . . . , Xn be the marks to n questions the student answered, taking values either 0 or 1. We model them as n i.i.d. Bernoulli(θ) randomvariables. Then, we approximate a student’s ability by its ML estimate.
a) We can approximate the distribution of
(θ(X) - θ) / ((θ(X)(1−θ(X)))/n)0.5
(Where θ(X) is the Maximum likelihood estimate)
by a standard gaussian N(0,1) distribution, under distribution P0. Using this result, construct the 95 percent confidence interval for θ.
b) How big should n be so that we can estimate the true ability within 0.01 marks, with probabilityat least 95%? Justify your answer.
I have no idea where to start here. Why are we approximating that scary looking distribution in the first place, and what does it mean by 'under distribution P0 ? How do we start to construct an interval?