Join TSR now and get all your revision questions answeredSign up now

Statistics 1 Histograms! (Estimate median) Watch

    • Thread Starter
    Offline

    0
    ReputationRep:
    Hey as the title says I was wondering how would I calculate the estimate median for a histogram?
    Can I use the formula (n+1)/2 and then look at the frequency?
    If I can but then what do i put as the median would i put for example 25-30 due to it being continuous data.
    Thanks
    Offline

    2
    ReputationRep:
    Find the total number of samples (eg 60).

    Then halve that number (30) and then look for the bar which contains teh 30th highest value.

    For example:
    0<x<5 - 10
    5<x<10 - 15
    10<x<15 - 22
    15<x<20 - 13

    the cumulative total from 0<x<10 is 25. But our value is the 30th value. This lies in 10<x<15, hence the answer is the interpolated value of this.

    At x=10, y=25
    At x=15, y=47

    So we need to find the value of x when y=30.

    This can be done by similar triangles, or by finding the equation of the line [y-y1=m(x-x1)] and substituting the value of y=30.
    • Thread Starter
    Offline

    0
    ReputationRep:
    (Original post by TheTallOne)
    Find the total number of samples (eg 60).

    Then halve that number (30) and then look for the bar which contains teh 30th highest value.

    For example:
    0<x<5 - 10
    5<x<10 - 15
    10<x<15 - 22
    15<x<20 - 13

    the cumulative total from 0<x<10 is 25. But our value is the 30th value. This lies in 10<x<15, hence the answer is the interpolated value of this.

    At x=10, y=25
    At x=15, y=47

    So we need to find the value of x when y=30.

    This can be done by similar triangles, or by finding the equation of the line [y-y1=m(x-x1)] and substituting the value of y=30.

    Sorry I don't get the part in bold I dont understand how you worked out y given that the original values are unknown. Could you explain please?
    Offline

    2
    ReputationRep:
    (Original post by JBKProductions)
    [/B]
    Sorry I don't get the part in bold I dont understand how you worked out y given that the original values are unknown. Could you explain please?
    Ok, assume we had the data above.
    0<x<5 - 10
    5<x<10 - 15
    10<x<15 - 22
    15<x<20 - 13

    The total number of samples is 60. Thus the median value is at 30.

    If you take the cumulative frequencies:
    0<x<5 = 10
    0<x<10 = 25
    0<x<15=47
    0<x<20 = 60

    Since the median is 30, it must lie between 25 and 47.

    Let x be the measured value (height, etc) and y be the cumulative frequency.

    Look at this diagram of cumulative frequency.



    As you can see, at the value of x=10, the CF is 25. and at the value of x=15, CF is 47.

    Between the value of 25 and 47 is the median, 30. You see the black line drawn from the value of 30 (going horizontal). This meets the diagonal line drawn from the lowest value (x=10 y=25) to the highest value (x=15 y=47). You are interpolating this part of the bar as if it is a straight line between x=10 and x=15.

    So you can find an equation of this diagonal line.

    y-y_1 = m(x-x_1)
    y-25=m(x-10)

    And find m:
    m = \frac{change in y}{change in x} = \frac{47-25}{15-10} = \frac{22}{5}

    So,
    y-25=\frac{22}{5}(x-10)


    Then sub in the value y=30 and find x. (Obviously x is somewhere between 10 and 15)
    Spoiler:
    Show

    x=11.13
    • Thread Starter
    Offline

    0
    ReputationRep:
    (Original post by TheTallOne)
    Ok, assume we had the data above.
    0<x<5 - 10
    5<x<10 - 15
    10<x<15 - 22
    15<x<20 - 13

    The total number of samples is 60. Thus the median value is at 30.

    If you take the cumulative frequencies:
    0<x<5 = 10
    0<x<10 = 25
    0<x<15=47
    0<x<20 = 60

    Since the median is 30, it must lie between 25 and 47.

    Let x be the measured value (height, etc) and y be the cumulative frequency.

    Look at this diagram of cumulative frequency.



    As you can see, at the value of x=10, the CF is 25. and at the value of x=15, CF is 47.

    Between the value of 25 and 47 is the median, 30. You see the black line drawn from the value of 30 (going horizontal). This meets the diagonal line drawn from the lowest value (x=10 y=25) to the highest value (x=15 y=47). You are interpolating this part of the bar as if it is a straight line between x=10 and x=15.

    So you can find an equation of this diagonal line.

    y-y_1 = m(x-x_1)
    y-25=m(x-10)

    And find m:
    m = \frac{change in y}{change in x} = \frac{47-25}{15-10} = \frac{22}{5}

    So,
    y-25=\frac{22}{5}(x-10)


    Then sub in the value y=30 and find x. (Obviously x is somewhere between 10 and 15)
    Spoiler:
    Show

    x=11.13
    oh ok thanks
    Offline

    2
    ReputationRep:
    (Original post by JBKProductions)
    oh ok thanks
    No problem.
    Offline

    0
    ReputationRep:
    Hi, The example quoted on the forum is of a standard distribution shape for the histogrm. How do you go about finding the median for something that is not quite so 'standard distribution' in shape or am I perceiving it wrong?
 
 
 
Poll
How are you feeling about Results Day?
Useful resources

Make your revision easier

Maths

Maths Forum posting guidelines

Not sure where to post? Read the updated guidelines here

Equations

How to use LaTex

Writing equations the easy way

Student revising

Study habits of A* students

Top tips from students who have already aced their exams

Study Planner

Create your own Study Planner

Never miss a deadline again

Polling station sign

Thinking about a maths degree?

Chat with other maths applicants

Can you help? Study help unanswered threads

Groups associated with this forum:

View associated groups

The Student Room, Get Revising and Marked by Teachers are trading names of The Student Room Group Ltd.

Register Number: 04666380 (England and Wales), VAT No. 806 8067 22 Registered Office: International House, Queens Road, Brighton, BN1 3XE

Quick reply
Reputation gems: You get these gems as you gain rep from other members for making good contributions and giving helpful advice.